Launch Your Cybersecurity Career in 6 Months
Learn Excel & Financial Modeling the Way Finance Teams Actually Use Them
Overview
Google, IBM & Meta Certificates — All 10,000+ Courses at 40% Off
One annual plan covers every course and certificate on Coursera. 40% off for a limited time.
Get Full Access
Explore AI performance benchmarking through this 43-minute webinar covering NVIDIA Dynamo and AIPerf tools for optimizing large language model inference. Learn about Dynamo's features and capabilities for accelerating LLM inference across different hardware platforms while maintaining seamless integration with popular frameworks including PyTorch, TensorRT-LLM, and vLLM. Discover AIPerf, a comprehensive benchmarking solution designed to measure generative AI model performance across various inference endpoints, providing detailed metrics through command-line interfaces and extensive reporting capabilities. Follow along with hands-on demonstrations showing practical implementation and real-world optimization strategies for deploying AI models effectively. Gain insights into performance comparison methodologies and learn how to leverage these open-source tools to enhance your AI inference workflows and contribute to the growing ecosystem of AI performance optimization tools.
Syllabus
AI Perf benchmarking - Dynamo and other LLM endpoints
Taught by
NVIDIA Developer