Class Central is learner-supported. When you buy through links on our site, we may earn an affiliate commission.

YouTube

Distributed Training of Modern AI Systems

IEEE Signal Processing Society via YouTube

Overview

Coursera Flash Sale
40% Off Coursera Plus for 3 Months!
Grab it
Learn about distributed training techniques for modern AI systems in this comprehensive webinar that explores the computational challenges and solutions for training large-scale artificial intelligence models. Discover how to leverage distributed computing architectures to overcome the limitations of single-machine training when working with massive datasets and complex neural networks. Explore various distributed training paradigms including data parallelism, model parallelism, and pipeline parallelism, understanding when and how to apply each approach effectively. Examine the communication overhead challenges in distributed systems and learn optimization strategies to minimize bottlenecks while maximizing training efficiency. Understand the trade-offs between different distributed training frameworks and tools, including considerations for fault tolerance, scalability, and resource utilization. Gain insights into synchronous and asynchronous training methods, gradient aggregation techniques, and the impact of batch size on convergence in distributed settings. Analyze real-world case studies demonstrating successful implementations of distributed training for state-of-the-art AI models, including large language models and computer vision systems.

Syllabus

ASI Webinar | Usman Khan | Distributed Training of modern AI systems

Taught by

IEEE Signal Processing Society

Reviews

Start your review of Distributed Training of Modern AI Systems

Never Stop Learning.

Get personalized course recommendations, track subjects and courses with reminders, and more.

Someone learning on their laptop while sitting on the floor.