Overview
Coursera Flash Sale
40% Off Coursera Plus for 3 Months!
Grab it
Explore AI performance benchmarking through this 43-minute webinar covering NVIDIA Dynamo and AIPerf tools for optimizing large language model inference. Learn about Dynamo's features and capabilities for accelerating LLM inference across different hardware platforms while maintaining seamless integration with popular frameworks including PyTorch, TensorRT-LLM, and vLLM. Discover AIPerf, a comprehensive benchmarking solution designed to measure generative AI model performance across various inference endpoints, providing detailed metrics through command-line interfaces and extensive reporting capabilities. Follow along with hands-on demonstrations showing practical implementation and real-world optimization strategies for deploying AI models effectively. Gain insights into performance comparison methodologies and learn how to leverage these open-source tools to enhance your AI inference workflows and contribute to the growing ecosystem of AI performance optimization tools.
Syllabus
AI Perf benchmarking - Dynamo and other LLM endpoints
Taught by
NVIDIA Developer