Overview
Coursera Flash Sale
40% Off Coursera Plus for 3 Months!
Grab it
Explore a 51-minute conference session from AWS re:Invent 2024 that delves into overcoming the computational and cost challenges of generative AI through AWS-designed AI chips. Discover the innovative developments in AWS Trainium2 and AWS Inferentia2, examining breakthroughs across silicon, server, and data center architectures. Learn from real-world implementations as AWS customer Poolside and Amazon's internal teams share their experiences deploying Rufus, Amazon's generative AI assistant, and scaling foundation models using AWS AI chips. Gain valuable insights into how these purpose-built chips are enabling organizations to harness the transformative power of generative AI while managing performance, costs, and scalability effectively.
Syllabus
AWS re:Invent 2024 - Conquer AI performance, cost, and scale with AWS AI chips (CMP209)
Taught by
AWS Events