Scaling Foundation Model Inference to Hundreds of Models with Amazon SageMaker
AWS Events via YouTube
AI Engineer - Learn how to integrate AI into software applications
Learn Excel & Financial Modeling the Way Finance Teams Actually Use Them
Overview
Google, IBM & Meta Certificates — All 10,000+ Courses at 40% Off
One annual plan covers every course and certificate on Coursera. 40% off for a limited time.
Get Full Access
Learn how to deploy foundation models at scale cost-effectively in this AWS re:Invent 2023 conference session. Discover deployment strategies for large-scale generative AI inferencing in SaaS environments using Amazon SageMaker, with detailed insights on architecting solutions that optimize both performance and cost efficiency. Master techniques for rolling out hundreds of foundation models while maintaining robust performance, particularly valuable for SaaS providers serving multiple customers. Gain practical knowledge about maximizing scaling capabilities and implementing cost-effective solutions for enterprise-level model deployment through Amazon SageMaker's comprehensive toolset.
Syllabus
AWS re:Invent 2023 - Scaling FM inference to hundreds of models with Amazon SageMaker (AIM327)
Taught by
AWS Events