NY State-Licensed Certificates in Design, Coding & AI — Online
PowerBI Data Analyst - Create visualizations and dashboards from scratch
Overview
Google, IBM & Meta Certificates — All 10,000+ Courses at 40% Off
One annual plan covers every course and certificate on Coursera. 40% off for a limited time.
Get Full Access
Learn best practices for TensorRT-LLM performance analysis and optimization in this 54-minute technical presentation from Nvidia experts. Discover how to analyze TensorRT-LLM performance using specialized tools, interpret profiling results effectively, identify performance bottlenecks, and implement optimization strategies. Gain insights into the systematic approach for enhancing large language model inference performance through TensorRT-LLM's optimization capabilities, with practical guidance on utilizing profiling tools and understanding performance metrics to achieve better computational efficiency.
Syllabus
The practice of doing performance analysis/optimization with TensorRT-LLM
Taught by
NVIDIA Developer