Get 20% off all career paths from fullstack to AI
AI, Data Science & Cloud Certificates from Google, IBM & Meta
Overview
AI, Data Science & Cloud Certificates from Google, IBM & Meta — 40% Off
One plan covers every Professional Certificate on Coursera. 40% off Coursera Plus Annual.
Unlock All Certificates
Learn to maximize the performance of large language models (LLMs) on AWS purpose-built accelerators through profiling techniques and custom kernel development using the Neuron Kernel Interface (NKI). Discover how to analyze what happens inside language models during inference, identify performance bottlenecks, and build optimized custom kernels to achieve superior performance. Explore practical approaches to performance engineering that can significantly speed up LLM inference workloads on AWS Neuron-powered infrastructure.
Syllabus
AWS re:Invent 2025 - Performance engineering on Neuron: How to optimize your LLM with NKI (AIM414)
Taught by
AWS Events