Class Central is learner-supported. When you buy through links on our site, we may earn an affiliate commission.

YouTube

Evaluate and Choose the Best LLM Using Automatic Metrics on Custom Datasets

Venelin Valkov via YouTube

Overview

Coursera Flash Sale
40% Off Coursera Plus for 3 Months!
Grab it
Discover effective methods for evaluating Large Language Models (LLMs) using automated metrics on custom datasets in this 22-minute tutorial. Explore best practices for selecting the optimal LLM for specific projects and assess their performance across various tasks. Gain insights into different evaluation approaches, available tools, and metrics. Follow along with a hands-on demonstration using Google Colab, covering dataset preparation, model prediction generation, naive evaluation techniques, and leveraging AI for AI evaluation. Conclude with a comprehensive evaluation report to make informed decisions when choosing the best LLM for your needs.

Syllabus

- Intro
- Text tutorial on MLExpert.io
- LLM evaluation approaches
- Available tools & metrics
- Evaluation process
- Google Colab setup
- Dataset
- Generate model predictions
- Naive evaluation
- Use AI to evaluate AI
- Evaluation report
- Conclusion

Taught by

Venelin Valkov

Reviews

Start your review of Evaluate and Choose the Best LLM Using Automatic Metrics on Custom Datasets

Never Stop Learning.

Get personalized course recommendations, track subjects and courses with reminders, and more.

Someone learning on their laptop while sitting on the floor.