Overview
Coursera Flash Sale
40% Off Coursera Plus for 3 Months!
Grab it
Explore strategies for scaling generative AI workloads through efficient model selection in this 40-minute conference talk from AWS Summit Zurich 2025. Discover how to optimize your AI implementations by making informed decisions about model architecture and deployment strategies that balance performance, cost, and scalability requirements. Learn from AWS experts about best practices for evaluating different generative AI models, understanding their computational requirements, and implementing them effectively in production environments. Gain insights into the trade-offs between model complexity and operational efficiency, and understand how to leverage AWS services to support large-scale generative AI deployments while maintaining cost-effectiveness and performance standards.
Syllabus
AWS Summit Zurich 2025 - Scaling generative AI workloads with efficient model choice
Taught by
AWS Events