Pushing Limits of Supercomputing Innovation on Azure AI Infrastructure
Overview
Syllabus
0:00 - History of model evolution from 2019 to present
00:08:07 - Fundamental AI infrastructure stack: compute, network, storage, and managed services
00:10:37 - Announcement of GB300 GPU general availability on Azure
00:16:37 - Core pillars of cloud infrastructure: compute, network, and storage
00:17:21 - Introduction to Data Ingestion and its Scale in Azure Cloud
00:25:32 - Performance Growth Over Time and Azure Production Scale with GPU Supercomputers
00:28:25 - Large-Scale Validation with GRAC 314B Model
00:31:11 - GPU Generations: GB200/GB300 vs H100 Workloads
00:35:00 - Summary of Azure’s AI Infrastructure and Year-over-Year Improvements
Taught by
Microsoft Ignite