Class Central is learner-supported. When you buy through links on our site, we may earn an affiliate commission.

YouTube

Balance Cost, Performance and Reliability for AI at Enterprise Scale - AIM3304

AWS Events via YouTube

Overview

Coursera Spring Sale
40% Off Coursera Plus Annual!
Grab it
Learn to architect hybrid inference strategies for deploying generative AI at enterprise scale by balancing performance, cost, and reliability across diverse business use cases. Explore Amazon Bedrock's comprehensive portfolio of inference options, including on-demand cross-region inference for elastic scaling, on-demand service tiers for optimizing performance-cost ratios, prompt caching techniques for improving latency while reducing costs, and batch inference for cost-effective bulk processing. Discover the tools and approaches needed to maximize price-performance ratios as AI workloads scale across enterprise environments. Master the strategic considerations for selecting appropriate inference methods based on specific business requirements and learn how to implement optimization techniques that maintain reliability while controlling costs. Gain insights into architecting scalable AI solutions that can adapt to varying workload demands while maintaining operational efficiency and cost-effectiveness in production environments.

Syllabus

AWS re:Invent 2025 - Balance cost, performance & reliability for AI at enterprise scale (AIM3304)

Taught by

AWS Events

Reviews

Start your review of Balance Cost, Performance and Reliability for AI at Enterprise Scale - AIM3304

Never Stop Learning.

Get personalized course recommendations, track subjects and courses with reminders, and more.

Someone learning on their laptop while sitting on the floor.