AI Adoption - Drive Business Value and Organizational Impact
AI Engineer - Learn how to integrate AI into software applications
Overview
Coursera Flash Sale
40% Off Coursera Plus for 3 Months!
Grab it
Learn how Amazon SageMaker HyperPod's new governance capabilities optimize GPU resource allocation for foundation model development in this 18-minute AWS re:Invent 2024 conference talk. Discover solutions for managing competing demands across organizational teams for accelerated compute resources when training new models, fine-tuning with custom data, and running inference at scale. Explore how to dynamically allocate shared compute resources to prioritize critical foundation model development projects while preventing cost overruns from underutilized resources. Gain insights into implementing effective task governance that ensures timely project completion within budget constraints, even with finite GPU availability.
Syllabus
Maximize GPU utilization with SageMaker HyperPod task governance | AWS OnAir re:Invent 2024
Taught by
AWS Events