Completed
your use case is likely well defined
Class Central Classrooms beta
YouTube videos curated by Class Central.
Classroom Contents
You Get an LLM, Everyone Gets an LLM, But Does It Work? - Evaluating LLM Performance
Automatically move to the next video in the Classroom when playback concludes
- 1 intro
- 2 preamble
- 3 evaluations
- 4 what makes a good evaluation framework?
- 5 public benchmark vs golden datasets
- 6 your use case is likely well defined
- 7 good ol' metrics
- 8 llm evaluates llm
- 9 metrics evaluate llm
- 10 closing the gap
- 11 available frameworks
- 12 all you need is your own test/eval set
- 13 thank you!