Overview
Syllabus
Welcome & Talk Overview: Safe LLM Agents for Incident Response
Why Incident Response Feels Impossible Now Alert Fatigue & Complexity
Why LLMs Matter for SRE: Pattern Recognition, Synthesis, Reasoning
The Real Blocker: Production Safety, Hallucinations & Trust
A Production-Tested Architecture: Human-in-the-Loop by Design
Multi-Source Data Ingestion: Building the Right Context
What the LLM Should Output: Summaries, Hypotheses, Mitigations
Safety-First Guardrails: Data Hygiene, Privilege Boundaries, Verification
Data Hygiene & Context Control: Redaction + Narrow, Relevant Windows
Privilege Boundaries & Tooling: Read-Only Defaults + Audit Trails
Verification Gates: Shadow Runs, Counterfactuals, Decision Ledger
From Reactive to Proactive: First-Line Triage & Evidence Assembly
Practical Pilot Blueprint: Start Small, Measure Trust, Expand Carefully
Key Takeaways + Closing, Q&A, and How to Connect
Taught by
Conf42