Power BI Fundamentals - Create visualizations and dashboards from scratch
JavaScript Programming for Beginners
Overview
Coursera Flash Sale
40% Off Coursera Plus for 3 Months!
Grab it
Learn to maximize the performance of large language models (LLMs) on AWS purpose-built accelerators through profiling techniques and custom kernel development using the Neuron Kernel Interface (NKI). Discover how to analyze what happens inside language models during inference, identify performance bottlenecks, and build optimized custom kernels to achieve superior performance. Explore practical approaches to performance engineering that can significantly speed up LLM inference workloads on AWS Neuron-powered infrastructure.
Syllabus
AWS re:Invent 2025 - Performance engineering on Neuron: How to optimize your LLM with NKI (AIM414)
Taught by
AWS Events