Class Central is learner-supported. When you buy through links on our site, we may earn an affiliate commission.

YouTube

AI Performance Benchmarking - Dynamo and Other LLM Endpoints

Nvidia via YouTube

Overview

Coursera Flash Sale
40% Off Coursera Plus for 3 Months!
Grab it
Explore AI performance benchmarking through this 43-minute webinar covering NVIDIA Dynamo and AIPerf tools for optimizing large language model inference. Learn about Dynamo's features and capabilities for accelerating LLM inference across different hardware platforms while maintaining seamless integration with popular frameworks including PyTorch, TensorRT-LLM, and vLLM. Discover AIPerf, a comprehensive benchmarking solution designed to measure generative AI model performance across various inference endpoints, providing detailed metrics through command-line interfaces and extensive reporting capabilities. Follow along with hands-on demonstrations showing practical implementation and real-world optimization strategies for deploying AI models effectively. Gain insights into performance comparison methodologies and learn how to leverage these open-source tools to enhance your AI inference workflows and contribute to the growing ecosystem of AI performance optimization tools.

Syllabus

AI Perf benchmarking - Dynamo and other LLM endpoints

Taught by

NVIDIA Developer

Reviews

Start your review of AI Performance Benchmarking - Dynamo and Other LLM Endpoints

Never Stop Learning.

Get personalized course recommendations, track subjects and courses with reminders, and more.

Someone learning on their laptop while sitting on the floor.