MIT Sloan: Lead AI Adoption Across Your Organization — Not Just Pilot It
The Investment Banker Certification
Overview
Google, IBM & Meta Certificates — All 10,000+ Courses at 40% Off
One annual plan covers every course and certificate on Coursera. 40% off for a limited time.
Get Full Access
Learn how to deploy and customize open source models for production-ready inference in this conference talk from the AI Engineer World's Fair. Explore practical strategies for transitioning from experimental AI models to scalable production systems, with insights on leveraging PyTorch technologies for high-performance, cost-effective large language model inference. Discover approaches for interactive experimentation and productionization of large models, drawing from real-world experience in bringing PyTorch from research environments to production applications across various AI use cases. Gain understanding of the technical considerations and best practices for implementing customized inference solutions that can handle enterprise-scale deployments while maintaining efficiency and reliability.
Syllabus
Customized, production ready inference with open source models: Dmytro (Dima) Dzhulgakov
Taught by
AI Engineer