Completed
10:22 2025: Scaling and autonomy
Class Central Classrooms beta
YouTube videos curated by Class Central.
Classroom Contents
2025 is the Year of Evals! Just like 2024, and 2023, and … - Enterprise AI/ML Evaluation and Monitoring
Automatically move to the next video in the Classroom when playback concludes
- 1 00:00 Introduction to Arthur AI and Mozilla AI
- 2 00:46 2025: The Year of Evals
- 3 01:15 AI/ML monitoring and evaluation
- 4 02:48 The Year of the Agent
- 5 03:26 The need for 'evals' wasn't obvious to the C-suite
- 6 04:15 Pre-ChatGPT launch
- 7 06:06 Venture capitalists' predictions
- 8 07:03 Macroeconomic side of things
- 9 08:06 OpenAI launching ChatGPT
- 10 09:15 2023: The Year of GenAI
- 11 09:39 2024: GenAI applications in production
- 12 10:22 2025: Scaling and autonomy
- 13 11:35 Definition of an agent
- 14 12:06 Connecting to downstream business KPIs
- 15 14:40 Shift to multi-agent systems monitoring
- 16 15:42 Q&A
- 17 16:16 Discussion on domain expertise in evaluations
- 18 18:13 Discussion on LLMs as judges