Class Central is learner-supported. When you buy through links on our site, we may earn an affiliate commission.

YouTube

AI Agent Evals - From Testing to Trust

MLOps World: Machine Learning in Production via YouTube

Overview

Coursera Flash Sale
40% Off Coursera Plus for 3 Months!
Grab it
Discover how to build reliable AI systems that transition successfully from demo environments to production through comprehensive evaluation strategies in this 27-minute conference talk. Learn why evaluation serves as the cornerstone of trust in AI agents and explore how leading teams integrate testing and monitoring throughout their entire development lifecycle. Examine practical approaches for optimizing context using write, select, compress, and isolate strategies while understanding real-world implementation challenges faced by both startups and enterprises. Gain insights into designing scalable LLM evaluation workflows that maintain quality and reliability across different environments, with specific focus on closing the loop between data collection, decision-making, and user trust. Explore proven methodologies for integrating pre-launch experimentation with post-deployment monitoring to create robust AI agent evaluation pipelines that perform consistently in production settings.

Syllabus

AI Agent Evals: From Testing to Trust | Vaibhavi Gangwar, Maxim AI

Taught by

MLOps World: Machine Learning in Production

Reviews

Start your review of AI Agent Evals - From Testing to Trust

Never Stop Learning.

Get personalized course recommendations, track subjects and courses with reminders, and more.

Someone learning on their laptop while sitting on the floor.