Gain a Splash of New Skills - Coursera+ Annual Nearly 45% Off
35% Off Finance Skills That Get You Hired - Code CFI35
Overview
Coursera Flash Sale
40% Off Coursera Plus for 3 Months!
Grab it
Learn how to deploy and customize open source models for production-ready inference in this conference talk from the AI Engineer World's Fair. Explore practical strategies for transitioning from experimental AI models to scalable production systems, with insights on leveraging PyTorch technologies for high-performance, cost-effective large language model inference. Discover approaches for interactive experimentation and productionization of large models, drawing from real-world experience in bringing PyTorch from research environments to production applications across various AI use cases. Gain understanding of the technical considerations and best practices for implementing customized inference solutions that can handle enterprise-scale deployments while maintaining efficiency and reliability.
Syllabus
Customized, production ready inference with open source models: Dmytro (Dima) Dzhulgakov
Taught by
AI Engineer