Coursera Plus Annual is 40% Off — Ends July 6.

View

Class Central

Rankings
Career Certificates

Subjects

View all

Technology
Business
Creative
STEM
Health & Wellness
People & Society
Personal Growth & Lifestyle

View all Subjects

Universities
The Report

Courses from 1000+ universities

Rankings

Best Courses

Best of All Time
Best of the Year

Most Popular Courses

Most Popular of All Time
Most Popular of the Year

Career Certificates

Generative AI
UX Design
Data Science
Finance
DevOps
Project Management
View all Career Certificates

Technology

Computer Science
Artificial Intelligence
Data Science
Web Development
Programming
Databases
IT & Networking
DevOps
UX/UI Design
Generative AI
Cloud Computing
Cybersecurity
Game Development
Product Management
View all Technology

Business

Management & Leadership
Finance
Entrepreneurship
Marketing
Strategic Management
Industry Specific
Project Management
Sales
Business Software
Real Estate
Professional Development
Accounting
Supply Chain Management
Business Strategy
Human Resources
View all Business

Creative

Music
Digital Media
Visual Arts
Design
Crafts
Performing Arts
AI Art
View all Creative

STEM

Mathematics
Engineering
Science
Statistics & Probability
Chemistry
Physics
Environmental Science
Astronomy
Biology
View all STEM

Health & Wellness

Nutrition & Wellness
Disease & Disorders
Public Health
Health Care
Nursing
Mental Health
Continuing Medical Education (CME)
Medicine
Wellness
Nutrition
View all Health & Wellness

People & Society

Humanities
Education & Teaching
Social Sciences
History
Literature
Sociology
Economics
Psychology
Anthropology
Political Science
Law
Language Learning
Writing
Philosophy
Religion
Sustainability
View all People & Society

Personal Growth & Lifestyle

Personal Development
Sports & Recreation
Personal Finance
Parenting & Family
Food & Drink
Self-Defense & Martial Arts
Gardening
Productivity & Time Management
Games
Study Skills
Travel
Pets & Pet Care
Beauty & Makeup
Critical Thinking
View all Personal Growth & Lifestyle

The Report

A Simplilearn Certificate Goes on LinkedIn Each Minute; Here’s How Krishna Kumar Built It From a Blog

17 years ago, Krishna Kumar started offering free PMP prep online. Today, it’s a leading digital upskilling platform that helps millions upskill in AI, cybersecurity, data science, and more.

Class Central Team Jun 22, 2026

Latest

8 Best SolidWorks Courses for 2026
The Business of Online Education: Khan Academy Tax Returns Analysis (2008–2025)
10 Best Spring Boot Courses for 2026: Beginner to Advanced
Massive List of Online Learning Platforms in China
Best SQL Courses for 2026: Top 14 from 3,300+

Write for The Report

Visit The Report

600 Free Google Certifications
Trending

Most common

Popular subjects

Computer Science
52,161 courses
Artificial Intelligence
32,440 courses
Digital Skills
437 courses

Popular courses

Introduction to HTML5
University of Michigan
Rome: A Virtual Tour of the Ancient City
University of Reading
The Modern World, Part One: Global History from 1760 to 1910
University of Virginia

Organize and share your learning with Class Central Lists.

View our Lists Showcase

0 Reviews

Start learning

Class Central is learner-supported. When you buy through links on our site, we may earn an affiliate commission.

Technology
Computer Science
Machine Learning
Transformer Models

Technology
Artificial Intelligence
Natural Language Processing (NLP)

STEM
Mathematics
Graph Theory

Technology
Computer Science
Machine Learning
Model Optimization

Technology
Computer Science
Machine Learning
Transformer Models

Technology
Computer Science
Machine Learning
Model Compression

Technology
Computer Science
Deep Learning

Technology
Computer Science
Machine Learning

Efficient Inference of Extremely Large Transformer Models

Toronto Machine Learning Series (TMLS) via YouTube

Write review

Start learning Write review

Details

Start learning

Provider

YouTube
Pricing

Free Video
Languages

English
Effort

28 minutes
Sessions

Self-Paced
Level

Advanced

Found in

Transformer Models Courses
Natural Language Processing (NLP) Courses
Graph Theory Courses
Model Optimization Courses
Model Compression Courses
Deep Learning Courses
Machine Learning Courses

Power BI Fundamentals - Create visualizations and dashboards from scratch

Learn More →

Build with Azure OpenAI, Copilot Studio & Agentic Frameworks — Microsoft Certified

Learn More →

Overview

AI, Data Science & Cloud Certificates from Google, IBM & Meta — 40% Off

One plan covers every Professional Certificate on Coursera. 40% off Coursera Plus Annual.

Unlock All Certificates

Explore the challenges and solutions for efficient inference of massive transformer-based language models in this 28-minute Toronto Machine Learning Series (TMLS) talk. Dive into the world of multi-billion-parameter models and learn how they are optimized for production environments. Discover key techniques for making these behemoth models faster, smaller, and more cost-effective, including model compression, efficient attention mechanisms, and optimal model parallelism strategies. Gain insights from Bharat Venkitesh, Senior Machine Learning Engineer at Cohere, as he discusses the establishment of the inference tech stack and the latest advancements in handling extremely large transformer models.

Syllabus

Efficient Inference of Extremely Large Transformer Models

Taught by

Toronto Machine Learning Series (TMLS)

Related Courses

Ad

Master AI and Machine Learning: From Neural Networks to Applications

Learn More →
LLM Inference Optimization
Quantization Techniques for Efficient Large Language Model Inference
Transformer Models and BERT Model
Fine-Tuning & Optimizing Large Language Models
BitNet.cpp - CPU Inference Framework for 1-bit Large Language Models

AI, Data Science & Cloud Certificates from Google, IBM & Meta Ad
11 Best Embeddings & Transformer Models Courses in 2026 (Free & Paid): Word2Vec, Vector Search, and RAG
9 Best Vector Database Courses for 2026: Build RAG Apps and Semantic Search
Write Prompts That Actually Work: ZTM’s Prompt Engineering Bootcamp Review
14 Best Artificial Intelligence Courses for 2026
16 Best Machine Learning Courses for 2026: Scikit-learn, TensorFlow, and more