Learn Python with Generative AI - Self Paced Online
Python, Prompt Engineering, Data Science — Build the Skills Employers Want Now
Overview
Google, IBM & Meta Certificates — All 10,000+ Courses at 40% Off
One annual plan covers every course and certificate on Coursera. 40% off for a limited time.
Get Full Access
Learn how Amazon SageMaker HyperPod's new governance capabilities optimize GPU resource allocation for foundation model development in this 18-minute AWS re:Invent 2024 conference talk. Discover solutions for managing competing demands across organizational teams for accelerated compute resources when training new models, fine-tuning with custom data, and running inference at scale. Explore how to dynamically allocate shared compute resources to prioritize critical foundation model development projects while preventing cost overruns from underutilized resources. Gain insights into implementing effective task governance that ensures timely project completion within budget constraints, even with finite GPU availability.
Syllabus
Maximize GPU utilization with SageMaker HyperPod task governance | AWS OnAir re:Invent 2024
Taught by
AWS Events