Evaluating AI Agents with Arize AI - Part 3: Agent as a Judge

Evaluating AI Agents with Arize AI - Part 3: Agent as a Judge

Data Science Dojo via YouTube Direct link

0:00 – Introduction and Series Recap

1 of 9

1 of 9

0:00 – Introduction and Series Recap

Class Central Classrooms beta

YouTube videos curated by Class Central.

Classroom Contents

Evaluating AI Agents with Arize AI - Part 3: Agent as a Judge

Automatically move to the next video in the Classroom when playback concludes

  1. 1 0:00 – Introduction and Series Recap
  2. 2 1:30 – Why Reasoning Paths Matter
  3. 3 3:45 – Evaluating Multi-Agent Collaboration
  4. 4 7:10 – Planning in Hierarchical and Crew-Based Agents
  5. 5 10:02 – Measuring Convergence and Execution Efficiency
  6. 6 13:34 – Using Agents as Judges: Peer Review + Self-Eval
  7. 7 18:25 – Demo: Agent-as-Judge in Arize Phoenix
  8. 8 23:17 – Applying Evaluation Methods in Production
  9. 9 27:50 – Wrap-Up and Next Steps

Never Stop Learning.

Get personalized course recommendations, track subjects and courses with reminders, and more.

Someone learning on their laptop while sitting on the floor.