Class Central is learner-supported. When you buy through links on our site, we may earn an affiliate commission.

YouTube

Scaling Foundation Model Inference to Hundreds of Models with Amazon SageMaker

AWS Events via YouTube

Overview

Coursera Spring Sale
40% Off Coursera Plus Annual!
Grab it
Learn how to deploy foundation models at scale cost-effectively in this AWS re:Invent 2023 conference session. Discover deployment strategies for large-scale generative AI inferencing in SaaS environments using Amazon SageMaker, with detailed insights on architecting solutions that optimize both performance and cost efficiency. Master techniques for rolling out hundreds of foundation models while maintaining robust performance, particularly valuable for SaaS providers serving multiple customers. Gain practical knowledge about maximizing scaling capabilities and implementing cost-effective solutions for enterprise-level model deployment through Amazon SageMaker's comprehensive toolset.

Syllabus

AWS re:Invent 2023 - Scaling FM inference to hundreds of models with Amazon SageMaker (AIM327)

Taught by

AWS Events

Reviews

Start your review of Scaling Foundation Model Inference to Hundreds of Models with Amazon SageMaker

Never Stop Learning.

Get personalized course recommendations, track subjects and courses with reminders, and more.

Someone learning on their laptop while sitting on the floor.