Class Central is learner-supported. When you buy through links on our site, we may earn an affiliate commission.

YouTube

Performance Engineering on Neuron - How to Optimize Your LLM with NKI

AWS Events via YouTube

Overview

Coursera Flash Sale
40% Off Coursera Plus for 3 Months!
Grab it
Learn to maximize the performance of large language models (LLMs) on AWS purpose-built accelerators through profiling techniques and custom kernel development using the Neuron Kernel Interface (NKI). Discover how to analyze what happens inside language models during inference, identify performance bottlenecks, and build optimized custom kernels to achieve superior performance. Explore practical approaches to performance engineering that can significantly speed up LLM inference workloads on AWS Neuron-powered infrastructure.

Syllabus

AWS re:Invent 2025 - Performance engineering on Neuron: How to optimize your LLM with NKI (AIM414)

Taught by

AWS Events

Reviews

Start your review of Performance Engineering on Neuron - How to Optimize Your LLM with NKI

Never Stop Learning.

Get personalized course recommendations, track subjects and courses with reminders, and more.

Someone learning on their laptop while sitting on the floor.