Google, IBM & Microsoft Certificates — All in One Plan
Launch Your Cybersecurity Career in 6 Months
Overview
Google, IBM & Meta Certificates — All 10,000+ Courses at 40% Off
One annual plan covers every course and certificate on Coursera. 40% off for a limited time.
Get Full Access
Learn to maximize the performance of large language models (LLMs) on AWS purpose-built accelerators through profiling techniques and custom kernel development using the Neuron Kernel Interface (NKI). Discover how to analyze what happens inside language models during inference, identify performance bottlenecks, and build optimized custom kernels to achieve superior performance. Explore practical approaches to performance engineering that can significantly speed up LLM inference workloads on AWS Neuron-powered infrastructure.
Syllabus
AWS re:Invent 2025 - Performance engineering on Neuron: How to optimize your LLM with NKI (AIM414)
Taught by
AWS Events