Class Central is learner-supported. When you buy through links on our site, we may earn an affiliate commission.

YouTube

Building GenAI That Doesn't Cost the Earth - Sustainable AI Development on AWS

Conf42 via YouTube

Overview

Coursera Spring Sale
40% Off Coursera Plus Annual!
Grab it
Learn to build sustainable and cost-effective generative AI solutions on AWS in this 20-minute conference talk that addresses the environmental impact of AI development. Explore the energy challenges posed by growing model sizes and increasing data center demand, then discover how AWS reduces carbon footprint through renewable energy and efficient data centers. Master the fundamentals of sustainable AI architecture by starting with managed and serverless services including Amazon Bedrock, EKS, and SageMaker. Understand strategic model selection principles to avoid using unnecessarily large language models and leverage Amazon Bedrock Evaluations for proper model assessment. Compare different customization approaches including prompting, retrieval-augmented generation (RAG), fine-tuning, and training from scratch to determine the most efficient option for your use case. Optimize your infrastructure by choosing appropriate silicon solutions with AWS Trainium for training workloads and Graviton processors for inference tasks. Implement advanced inference optimization techniques through model compression, compilation, and distillation to reduce computational overhead. Establish comprehensive observability and guardrails to monitor cost, performance metrics, and responsible AI practices throughout your GenAI pipeline.

Syllabus

Welcome + Why Sustainable GenAI on AWS Matters
The Energy Spike: Model Growth & Data Center Demand
How AWS Cuts Carbon Footprint Renewables + Efficient Data Centers
Start with Managed & Serverless Services Bedrock, EKS, SageMaker
Model Selection: Don’t Use a Bigger LLM Than You Need
Evaluate Models with Amazon Bedrock Evaluations
Customize the Model: Prompting vs RAG vs Fine-Tuning vs Training from Scratch
Choose Efficient Silicon: Trainium for Training, Graviton for Inference
Inference Optimization: Compression, Compilation & Distillation
Observability & Guardrails: Monitor Cost, Performance, and Responsible AI
Recap + Continuous Improvement + Further Reading

Taught by

Conf42

Reviews

Start your review of Building GenAI That Doesn't Cost the Earth - Sustainable AI Development on AWS

Never Stop Learning.

Get personalized course recommendations, track subjects and courses with reminders, and more.

Someone learning on their laptop while sitting on the floor.