Learn the Skills Netflix, Meta, and Capital One Actually Hire For
Learn Backend Development Part-Time, Online
Overview
AI, Data Science & Cloud Certificates from Google, IBM & Meta — 40% Off
One plan covers every Professional Certificate on Coursera. 40% off Coursera Plus Annual.
Unlock All Certificates
This conference talk reveals critical insights about deploying and scaling open-source Large Language Models (LLMs) for production workloads. Andrey Cheptsov, founder of dstack, examines the truth behind performance benchmarks, hardware requirements, and optimization techniques like quantization and LoRA. Discover when and why open-source LLMs can provide a competitive edge over proprietary models. Learn about the current state of open-source LLM performance compared to closed-source models, practical hardware considerations and optimization strategies for scaling, key frameworks and tools for pre-training, post-training, and inference, and strategic criteria for choosing open-source LLMs for business applications. The 44-minute presentation includes a transcript and invites viewers to share their own LLM optimization experiences.
Syllabus
Open Source LLMs: Costly Myths & Real-World Scaling
Taught by
InfoQ