Machine Learning Using Various GPU Technologies with Kubeflow
CNCF [Cloud Native Computing Foundation] via YouTube
Stuck in Tutorial Hell? Learn Backend Dev the Right Way
Learn Excel & Financial Modeling the Way Finance Teams Actually Use Them
Overview
Google, IBM & Meta Certificates — All 10,000+ Courses at 40% Off
One annual plan covers every course and certificate on Coursera. 40% off for a limited time.
Get Full Access
Explore advanced GPU technologies for efficient machine learning in this 32-minute conference talk by Jihye Choi from SAMSUNG SDS. Discover how to optimize GPU utilization and enhance distributed learning in Kubeflow environments. Learn about Multi-Instance GPU technology for the NVIDIA A100, which allows splitting a single GPU into up to 7 instances, maximizing resource efficiency for simplified models. Delve into the benefits of GPUDirect RDMA, a high-performance networking technology that enables direct GPU memory communication without CPU intervention, improving GPU utilization and performance in distributed training scenarios. Gain valuable insights on combining these cutting-edge technologies with Kubeflow to overcome limitations in cost and GPU resources for MLOps practitioners.
Syllabus
Machine Learning Using Various GPU Technology With Kubeflow. - Jihye Choi, SAMSUNG SDS
Taught by
CNCF [Cloud Native Computing Foundation]