Launch Your Cybersecurity Career in 6 Months
Most AI Pilots Fail to Scale. MIT Sloan Teaches You Why — and How to Fix It
Overview
Google, IBM & Meta Certificates — All 10,000+ Courses at 40% Off
One annual plan covers every course and certificate on Coursera. 40% off for a limited time.
Get Full Access
Learn to evaluate AI agent reliability through a hands-on workshop that introduces the Agent GPA (Goal-Plan-Action) framework using the open source TruLens library. Code along with the instructor to measure agent performance across three critical dimensions: goals, plans, and actions, while identifying internal errors such as hallucinations, poor tool usage, and missed planning steps. Gain practical experience implementing evaluation metrics that assess how well AI agents understand objectives, formulate coherent plans, and execute appropriate actions. Access provided code notebooks and slides to follow along with real-world examples and build your own agent evaluation systems using industry-standard tools and methodologies.
Syllabus
What’s your Agent's GPA? A Framework for Evaluating AI Agent Reliability with Josh Reini
Taught by
Open Data Science