Overview
Coursera Spring Sale
40% Off Coursera Plus Annual!
Grab it
Discover how AI Reliability Engineering (AIRE) can dramatically reduce incident response times from hours to minutes through specialized AI agents in this conference talk from SREcon25 Europe/Middle East/Africa. Learn to build three essential AI agents that automate SRE workflows: a Terraform agent for generating AWS-compliant configurations, a GitOps agent for managing complete pull request workflows from creation to deployment, and an infrastructure validation agent for verifying post-deployment resources. Explore practical implementation strategies including agent instruction design, MCP server integration, and testing methodologies that any platform engineering team can adopt. See live demonstrations of how these agents collaborate to eliminate manual troubleshooting of certificate rotation failures, load balancer misconfigurations, and database connection issues, potentially saving hours of operational work while transforming traditional SRE incident response processes.
Syllabus
SREcon25 Europe/Middle East/Africa - From 4 Hours to 8 Minutes with AI Agents That Transform SRE...
Taught by
USENIX