Build Custom LLM Benchmarks for Your Application

Build Custom LLM Benchmarks for Your Application

Trelis Research via YouTube Direct link

46:01 Conclusion

8 of 8

8 of 8

46:01 Conclusion

Class Central Classrooms beta

YouTube videos curated by Class Central.

Classroom Contents

Build Custom LLM Benchmarks for Your Application

Automatically move to the next video in the Classroom when playback concludes

  1. 1 0:00 Creating a custom benchmarking dataset
  2. 2 0:31 Video Overview and Scripts https://trelis.com/ADVANCED-evals
  3. 3 1:06 Quick-start with YourBench from HuggingFace
  4. 4 7:47 Running YourBench locally to create a benchmark
  5. 5 20:59 Advanced data generation notes pdf conversion, estimating difficulty, citations, chunking, multi-hop, filtering
  6. 6 29:23 Evaluating a custom dataset using LightEval
  7. 7 36:29 Evaluation and Data Inspection with Trelis ADVANCED-evals
  8. 8 46:01 Conclusion

Never Stop Learning.

Get personalized course recommendations, track subjects and courses with reminders, and more.

Someone learning on their laptop while sitting on the floor.