NVIDIA Dynamo Platform - Scale and Serve Generative AI
MLOps World: Machine Learning in Production via YouTube
-
17
-
- Write review
The Most Addictive Python and SQL Courses
AI, Data Science & Business Certificates from Google, IBM & Microsoft
Overview
Google, IBM & Meta Certificates — All 10,000+ Courses at 40% Off
One annual plan covers every course and certificate on Coursera. 40% off for a limited time.
Get Full Access
Learn how to scale and serve generative AI applications using NVIDIA's Dynamo Platform in this 29-minute conference talk from MLOps World. Discover the key components and capabilities of NVIDIA's infrastructure solution designed specifically for deploying and managing generative AI workloads at enterprise scale. Explore practical approaches to optimizing performance, managing resources, and ensuring reliable serving of large language models and other generative AI systems. Gain insights into best practices for production deployment, scaling strategies, and operational considerations when implementing generative AI solutions using NVIDIA's platform technologies. Understand how to leverage NVIDIA's tools and frameworks to streamline the transition from AI model development to production-ready generative AI services.
Syllabus
NVIDIA Dynamo Platform Scale and Serve Generative AI - Chris Alexiuk
Taught by
MLOps World: Machine Learning in Production