Learn AI, Data Science & Business — Earn Certificates That Get You Hired
Stuck in Tutorial Hell? Learn Backend Dev the Right Way
Overview
AI, Data Science & Cloud Certificates from Google, IBM & Meta — 40% Off
One plan covers every Professional Certificate on Coursera. 40% off Coursera Plus Annual.
Unlock All Certificates
Learn to evaluate AI agent reliability through a hands-on workshop that introduces the Agent GPA (Goal-Plan-Action) framework using the open source TruLens library. Code along with the instructor to measure agent performance across three critical dimensions: goals, plans, and actions, while identifying internal errors such as hallucinations, poor tool usage, and missed planning steps. Gain practical experience implementing evaluation metrics that assess how well AI agents understand objectives, formulate coherent plans, and execute appropriate actions. Access provided code notebooks and slides to follow along with real-world examples and build your own agent evaluation systems using industry-standard tools and methodologies.
Syllabus
What’s your Agent's GPA? A Framework for Evaluating AI Agent Reliability with Josh Reini
Taught by
Open Data Science