Class Central

Rankings
Collections

Subjects

View all

Computer Science
Health & Medicine
Mathematics
Business
Humanities
Engineering
Science
Education & Teaching
Social Sciences
Art & Design
Artificial Intelligence
Data Science
Programming
Personal Development
Information Security (InfoSec)

View all Subjects

Universities
The Report

Courses from 1000+ universities

Rankings

Best Courses

Best of All Time
Best of the Year

Most Popular Courses

Most Popular of All Time
Most Popular of the Year

Collections

Computer Science

Algorithms and Data Structures
Internet of Things
Information Technology
Computer Networking
Machine Learning
DevOps
Cryptography
Quantum Computing
Distributed Systems
Blockchain Development
Operating Systems
Automata Theory
Compilers
SCADA
Mainframe
Algorithm Design
Information Theory
Algorithms
Software Engineering
Data Structures
View all Computer Science

Health & Medicine

Nutrition & Wellness
Disease & Disorders
Public Health
Health Care
Nursing
Anatomy
Continuing Medical Education (CME)
Women's Health
Occupational Therapy
Hygiene
Healthcare Innovation
Music Therapy
Holistic Health
Ayurveda
Massage Therapy
Swallowing Disorders
Pharmaceutics
View all Health & Medicine

Mathematics

Statistics & Probability
Foundations of Mathematics
Calculus
Discrete Mathematics
Trigonometry
Algebra
Precalculus
Number Theory
Combinatorics
Mathematical logic
Linear Programming
Graph Theory
Set Theory
Group Theory
Differential Equations
Polynomials
Integration
Mathematical Analysis
Mathematical Thinking
Mathematical Proofs
View all Mathematics

Business

Management & Leadership
Finance
Entrepreneurship
Marketing
Strategic Management
Industry Specific
Business Intelligence
Accounting
Human Resources
Project Management
Sales
Design Thinking
Business Software
Customer Service
Nonprofit Management
Operations Management
Corporate Governance
Business Proposal
Business Models
Product Design
View all Business

Humanities

History
Literature
Language Learning
Grammar & Writing
Philosophy
Religion
ESL
Culture
Sports
Journalism
Linguistics
Food
Library Science
Reading
Crisis Management
Games
Emergency Management
Performing Arts
Religious Studies
Cultural Heritage
View all Humanities

Engineering

Electrical Engineering
Mechanical Engineering
Civil Engineering
Robotics
Nanotechnology
GIS
Textiles
BIM
CAD
Chemical Engineering
Energy Systems
Aerospace Engineering
Finite Element Analysis
GD&T (Geometric Dimensioning and Tolerancing)
Geotechnical Engineering
Reliability Engineering
Petroleum Engineering
Control Theory
Environmental Engineering
Energy Conversion
View all Engineering

Science

Chemistry
Physics
Environmental Science
Astronomy
Biology
Agriculture
Materials Science
Earth Science
Applied Science
Forensic Science
Meteorology
Horology
Paleontology
Fire Science
Sensory Science
Ergonomics
Physical Sciences
Complex Systems
Scientific Method
Medicine
View all Science

Education & Teaching

K12
Higher Education
STEM
Teacher Professional Development
Course Development
Online Education
Pedagogy
Social-emotional Learning (SEL)
Instructional Design
Homeschooling
Special Education
Adult Education
Course Design
Educational Technology
Curriculum Development
Next Generation Science Standards
Student-Centered Learning
Student Engagement
Classroom Management
Elementary Education
View all Education & Teaching

Social Sciences

Sociology
Economics
Psychology
Anthropology
Political Science
Law
Urban Planning
Human Rights
Governance
Archaeology
Social Work
Structural Equation Modeling
Cultural Studies
Philanthropy
Media Studies
Global Development
Social Development
Community Building
Social Impact
Audience Analysis
View all Social Sciences

Art & Design

Music
Digital Media
Visual Arts
Design & Creativity
Photography
Art Therapy
Art Composition
Character Design
Fashion Design
Golden Ratio
Copic Markers
Jewelry Design
Animal Illustration
Anime Drawing
Street Art
Observational Drawing
Greeting Cards
Clay Modeling
Epoxy Resin
Miniature Art
View all Art & Design

Artificial Intelligence

Computer Vision
Natural Language Processing (NLP)
Neural Networks
Autonomous Vehicles
Chatbot
IBM Watson
Intelligent Systems
Genetic Algorithms
Intelligence
Heuristics
Ontology
Collective Intelligence
Constraint Programming
Semantics
Computational Linguistics
Computational Creativity
Evolutionary Algorithms
Speech Recognition
Bots
Speech Synthesis
View all Artificial Intelligence

Data Science

Bioinformatics
Big Data
Data Mining
Data Analysis
Jupyter Notebooks
Process Mining
Stata
Text Mining
Social Network Analysis
Computational Analysis
Data Collection
Information Retrieval
Data Processing
Data Extraction
Data Manipulation
Network Analysis
Graph Analysis
Data Engineering
Demand Forecasting
Expected Values
View all Data Science

Programming

Mobile Development
Web Development
Databases
Programming Languages
Software Development
Cloud Computing
Domain-Specific Languages (DSL)
Hardware Description Languages (HDL)
Aspect-oriented programming
Object-oriented programming
Visual Programming
Competitive Programming
Database Programming
Generic Programming
Leetcode
GNU Toolchain
Windsurf
Vibe Coding
View all Programming

Personal Development

Communication Skills
Career Development
Self Improvement
Presentation Skills
Resilience
Self-Control
Gratitude
Growth Mindset
Self-Assessment
Survival Skills
Sleep Improvement
Career Planning
Empowerment
Generosity
Courage
Humility
Dog Training
Passion
Life Coaching
Self-Esteem
View all Personal Development

Information Security (InfoSec)

Ethical Hacking
Digital Forensics
Reverse Engineering
Malware Analysis
OSINT (Open Source Intelligence)
Threat Intelligence
Red Team
Blue Team
View all Information Security (InfoSec)

The Report

Inside the Coursera-Udemy Deal: Three Bids, Two Years and a Rival Suitor

Buried in Coursera’s 300-page prospectus: two failed merger attempts, competing bidders, a rogue shareholder, and a combined market cap that shrank from $3.8 billion to $1.7 billion.

Dhawal Shah Apr 07, 2026

Latest

[2026] 140+ Free Internet of Things (IoT) Courses, Tutorials, & Webinars
Roads Built for Carriages: How the Four-Year Degree Will Outlast Its Critics
7 Best Robotics Courses (Free & Paid) for 2026: From Simulation to Real-World Applications
9 Best System Design Courses for 2026: From Coding to Architecting
Coursera and Udemy are Finally One. What’s Next?

Write for The Report

Visit The Report

600 Free Google Certifications
Trending

Most common

Popular subjects

Computer Science
52,191 courses
Management & Leadership
6,450 courses
Software Vulnerabilities
1,137 courses

Popular courses

Medicine and the Arts: Humanising Healthcare
University of Cape Town
Umano Digitale
University of Urbino
Academic Writing Made Easy
edX

Organize and share your learning with Class Central Lists.

View our Lists Showcase

0 Reviews

Start learning

Class Central is learner-supported. When you buy through links on our site, we may earn an affiliate commission.

Programming
Cloud Computing
Amazon Web Services (AWS)
Amazon SageMaker

Programming
Cloud Computing
Amazon Web Services (AWS)
Amazon SageMaker

Artificial Intelligence
Natural Language Processing (NLP)
LLM (Large Language Model)
LLaMA (Large Language Model Meta AI)

Artificial Intelligence
Foundation Models

Programming
Cloud Computing
Auto-scaling

Computer Science
Machine Learning
vLLM

Artificial Intelligence
Qwen

Scaling Foundation Model Inference on Amazon SageMaker AI

AWS Events via YouTube

Write review

Start learning Write review

Details

Start learning

Provider

YouTube
Pricing

Free Video
Languages

English
Effort

53 minutes
Level

Advanced

Found in

Amazon SageMaker Courses
LLaMA (Large Language Model Meta AI) Courses
Foundation Models Courses
Auto-scaling Courses
vLLM Courses
Qwen Courses

Discover how to optimize and deploy popular open-source foundation models like Qwen3, GPT-OSS, and Llama4 using advanced inference engines such as vLLM on Amazon SageMaker in this 53-minute conference talk from AWS re:Invent 2025. Explore key features including bidirectional streaming for audio and text applications while learning proven optimization techniques for model inference. Master performance-boosting strategies through live demonstrations covering KV caching, intelligent routing, and autoscaling to maintain system stability under varying workloads. Learn to build Agentic workflows by integrating SageMaker AI with LangChain and Amazon Bedrock AgentCore, and gain access to best practices that will help you confidently transition from prototype development to production-ready AI experiences that deliver exceptional user value.

Syllabus

AWS re:Invent 2025 - Scaling foundation model inference on Amazon SageMaker AI (AIM424)

Taught by

AWS Events

Related Courses

Ad

Learn Python with Generative AI - Self Paced Online

Learn More →
Scaling Foundation Model Inference to Hundreds of Models with Amazon SageMaker
Customize AI Models and Accelerate Time to Production with Amazon SageMaker AI
Scale AI Agents with Custom Models Using Amazon SageMaker AI and SGLang
Streamline AI Model Development Lifecycle with Amazon SageMaker AI - AIM364
Customize Models for Agentic AI at Scale with SageMaker AI and Bedrock

MIT Sloan AI Adoption: Build a Playbook That Drives Real Business ROI Ad
From Zero to GenAI: 9 Unique Ways to Understand Large Language Models
[2026] 120+ Courses to Prepare your AWS Certifications
A Free Tool to Learn Languages Through Netflix and YouTube: Language Reactor Review
5 Best YouTube Marketing Courses for Business in 2026
10 Best Terraform Courses for 2026: Automate Cloud Infrastructure