Power BI Fundamentals - Create visualizations and dashboards from scratch
Build GenAI Apps from Scratch — UCSB PaCE Certificate Program
Overview
AI, Data Science & Cloud Certificates from Google, IBM & Meta — 40% Off
One plan covers every Professional Certificate on Coursera. 40% off Coursera Plus Annual.
Unlock All Certificates
Learn how to deploy and customize open source models for production-ready inference in this conference talk from the AI Engineer World's Fair. Explore practical strategies for transitioning from experimental AI models to scalable production systems, with insights on leveraging PyTorch technologies for high-performance, cost-effective large language model inference. Discover approaches for interactive experimentation and productionization of large models, drawing from real-world experience in bringing PyTorch from research environments to production applications across various AI use cases. Gain understanding of the technical considerations and best practices for implementing customized inference solutions that can handle enterprise-scale deployments while maintaining efficiency and reliability.
Syllabus
Customized, production ready inference with open source models: Dmytro (Dima) Dzhulgakov
Taught by
AI Engineer