Class Central is learner-supported. When you buy through links on our site, we may earn an affiliate commission.

Linux Foundation

From Lab To Life - Practical AI System Evaluation

Linux Foundation via YouTube

Overview

AI, Data Science & Cloud Certificates from Google, IBM & Meta — 40% Off
One plan covers every Professional Certificate on Coursera. 40% off your first 3 months — limited time.
Unlock All Certificates
Explore a comprehensive conference talk that addresses the critical challenge of evaluating agentic AI systems as they transition from laboratory environments to real-world applications. Learn about the significant operational, reputational, and financial risks that enterprises face when deploying dynamic AI systems, and understand why traditional static benchmarks like MMLU fail to capture the complexities of real-world AI behavior. Discover a practical evaluation framework inspired by the University of Michigan's "Evaluation Framework for AI Systems in the Wild" that integrates performance, fairness, and ethics considerations. Examine how this risk-adjusted evaluation approach combines continuous, outcome-oriented methods with both human and automated assessments to increase stakeholder trust and transparency. Gain actionable insights into implementing these evaluation methodologies using open-source technologies throughout the entire AI system development lifecycle, from initial conception through ongoing real-world monitoring and assessment.

Syllabus

From Lab To Life: Practical AI System Evaluation - Sharon Dashet & Vincent Caldeira, Red Hat

Taught by

Linux Foundation

Reviews

Start your review of From Lab To Life - Practical AI System Evaluation

Never Stop Learning.

Get personalized course recommendations, track subjects and courses with reminders, and more.

Someone learning on their laptop while sitting on the floor.