Class Central is learner-supported. When you buy through links on our site, we may earn an affiliate commission.

YouTube

AI Evaluation from First Principles - You Can't Manage What You Can't Measure

Databricks via YouTube

Overview

Coursera Spring Sale
40% Off Coursera Plus Annual!
Grab it
Learn to build effective evaluation systems for GenAI applications through this 39-minute research session that addresses the critical challenge of measuring AI quality in organizations. Discover fundamental principles of GenAI evaluation and gain practical frameworks for establishing reliable metrics, even for subjective assessments. Explore techniques for calibrating LLM judges to create cost-effective and scalable evaluation processes that can adapt as your AI capabilities evolve. Master actionable approaches for defining meaningful quality metrics tailored to your specific use cases, transforming uncertain AI development into measurable, systematic improvement. Understand how to build evaluation systems that clearly identify what's working and what needs improvement in your AI implementations. Presented by Jonathan Frankle, Chief Scientist - Neural Networks at Databricks, and Pallavi Koppol, Research Scientist at Databricks, this session provides essential knowledge for developers, AI solution implementers, and technical leaders seeking to move beyond guesswork toward data-driven AI quality management using Databricks tools and methodologies.

Syllabus

AI Evaluation from First Principles: You Can't Manage What You Can't Measure

Taught by

Databricks

Reviews

Start your review of AI Evaluation from First Principles - You Can't Manage What You Can't Measure

Never Stop Learning.

Get personalized course recommendations, track subjects and courses with reminders, and more.

Someone learning on their laptop while sitting on the floor.