Learn Backend Development Part-Time, Online
Learn AI, Data Science & Business — Earn Certificates That Get You Hired
Overview
AI, Data Science & Cloud Certificates from Google, IBM & Meta — 40% Off
One plan covers every Professional Certificate on Coursera. 40% off Coursera Plus Annual.
Unlock All Certificates
Learn to build robust evaluation frameworks for scaling AI agents from experimental prototypes to enterprise-grade production systems in this executive-level conference talk. Discover practical strategies for designing evaluation processes that drive measurable business impact, identify and mitigate performance bottlenecks, and implement observability practices to maintain reliability over time. Explore real-world deployment insights that highlight common pitfalls and best practices for iterative improvement, covering text-based agents, multimodal agents, routers, memory systems, and traces. Gain actionable insights for transforming AI agents into reliable, production-ready tools that deliver tangible business results and align with organizational objectives, whether you're shaping your organization's GenAI strategy or looking to unlock the full potential of AI agents at scale.
Syllabus
Introduction
Textbased agents
Multimodal agents
Routers
Memory
Traces
Router
Example
Taught by
AI Engineer