Scaling Foundation Model Inference to Hundreds of Models with Amazon SageMaker
AWS Events via YouTube
PowerBI Data Analyst - Create visualizations and dashboards from scratch
AI Adoption - Drive Business Value and Organizational Impact
Overview
Coursera Spring Sale
40% Off Coursera Plus Annual!
Grab it
Learn how to deploy foundation models at scale cost-effectively in this AWS re:Invent 2023 conference session. Discover deployment strategies for large-scale generative AI inferencing in SaaS environments using Amazon SageMaker, with detailed insights on architecting solutions that optimize both performance and cost efficiency. Master techniques for rolling out hundreds of foundation models while maintaining robust performance, particularly valuable for SaaS providers serving multiple customers. Gain practical knowledge about maximizing scaling capabilities and implementing cost-effective solutions for enterprise-level model deployment through Amazon SageMaker's comprehensive toolset.
Syllabus
AWS re:Invent 2023 - Scaling FM inference to hundreds of models with Amazon SageMaker (AIM327)
Taught by
AWS Events