Learn the Skills Netflix, Meta, and Capital One Actually Hire For
AI Engineer - Learn how to integrate AI into software applications
Overview
Google, IBM & Meta Certificates — All 10,000+ Courses at 40% Off
One annual plan covers every course and certificate on Coursera. 40% off for a limited time.
Get Full Access
Learn about critical infrastructure software components needed for managing hyperscale AI and GPU clusters in this technical talk from the 2024 OCP APAC Summit. Explore key considerations and solutions for deploying and operating large-scale artificial intelligence infrastructure, with a focus on software systems that enable efficient GPU cluster management at hyperscale. Gain insights into the challenges and best practices for building robust AI computing environments that can scale effectively to meet demanding computational requirements.
Syllabus
Infrastructure Software for Hyperscale AI and GPU Clusters
Taught by
Open Compute Project