Completed
29:23 Evaluating a custom dataset using LightEval
Class Central Classrooms beta
YouTube videos curated by Class Central.
Classroom Contents
Build Custom LLM Benchmarks for Your Application
Automatically move to the next video in the Classroom when playback concludes
- 1 0:00 Creating a custom benchmarking dataset
- 2 0:31 Video Overview and Scripts https://trelis.com/ADVANCED-evals
- 3 1:06 Quick-start with YourBench from HuggingFace
- 4 7:47 Running YourBench locally to create a benchmark
- 5 20:59 Advanced data generation notes pdf conversion, estimating difficulty, citations, chunking, multi-hop, filtering
- 6 29:23 Evaluating a custom dataset using LightEval
- 7 36:29 Evaluation and Data Inspection with Trelis ADVANCED-evals
- 8 46:01 Conclusion