Completed
intro
Class Central Classrooms beta
YouTube videos curated by Class Central.
Classroom Contents
You Get an LLM, Everyone Gets an LLM, But Does It Work? - Evaluating LLM Performance
Automatically move to the next video in the Classroom when playback concludes
- 1 intro
- 2 preamble
- 3 evaluations
- 4 what makes a good evaluation framework?
- 5 public benchmark vs golden datasets
- 6 your use case is likely well defined
- 7 good ol' metrics
- 8 llm evaluates llm
- 9 metrics evaluate llm
- 10 closing the gap
- 11 available frameworks
- 12 all you need is your own test/eval set
- 13 thank you!