Writing review for MARLIN: Mixed-Precision Auto-Regressive Parallel Inference on Large Language Models

Scalable Parallel Computing Lab, SPCL @ ETH Zurich

via YouTube

Your review helps other learners like you discover great courses. Only review the course if you have taken or started taking this course.

Cancel