2025 is the Year of Evals! Just like 2024, and 2023, and … - Enterprise AI/ML Evaluation and Monitoring
AI Engineer via YouTube
Earn Your Business Degree, Tuition-Free, 100% Online!
PowerBI Data Analyst - Create visualizations and dashboards from scratch
Overview
Syllabus
00:00 Introduction to Arthur AI and Mozilla AI
00:46 2025: The Year of Evals
01:15 AI/ML monitoring and evaluation
02:48 The Year of the Agent
03:26 The need for 'evals' wasn't obvious to the C-suite
04:15 Pre-ChatGPT launch
06:06 Venture capitalists' predictions
07:03 Macroeconomic side of things
08:06 OpenAI launching ChatGPT
09:15 2023: The Year of GenAI
09:39 2024: GenAI applications in production
10:22 2025: Scaling and autonomy
11:35 Definition of an agent
12:06 Connecting to downstream business KPIs
14:40 Shift to multi-agent systems monitoring
15:42 Q&A
16:16 Discussion on domain expertise in evaluations
18:13 Discussion on LLMs as judges
Taught by
AI Engineer