Class Central is learner-supported. When you buy through links on our site, we may earn an affiliate commission.

YouTube

Improving Large Language Model Performance - Evaluation and Testing at Enterprise Scale

Conf42 via YouTube

Overview

Coursera Flash Sale
40% Off Coursera Plus for 3 Months!
Grab it
Learn how to enhance large language model performance through comprehensive evaluation strategies in this 14-minute conference talk from Conf42 ML 2025. Explore the fundamental challenges of testing LLM applications at enterprise scale, moving beyond traditional metrics to understand real business impact. Discover why robust evaluation frameworks are crucial for LLM deployment success and examine the critical role of human oversight in testing processes. Understand how to implement versatile testing approaches and establish effective guardrails for production environments. Gain insights into compliance requirements, documentation best practices, and continuous evaluation methodologies that ensure sustained model performance. Follow a practical implementation roadmap that addresses enterprise-scale challenges while maintaining quality standards. Master the balance between automated testing and human validation to create reliable, business-ready LLM applications that deliver consistent value in production environments.

Syllabus

00:00 Introduction and Background
00:26 Why We Are Here: LLM Applications
01:26 Challenges in Testing LLM Applications
02:15 Evaluating LLM Applications
02:58 Enterprise Scale Challenges
04:18 Traditional Metrics for LLM Evaluation
05:40 Business Impact of Robust Evaluation
08:13 Human Element in LLM Testing
09:06 Versatile Testing and Guardrails
10:37 Compliance, Documentation, and Continuous Evaluation
12:17 Implementation Roadmap
14:10 Conclusion and Future Outlook

Taught by

Conf42

Reviews

Start your review of Improving Large Language Model Performance - Evaluation and Testing at Enterprise Scale

Never Stop Learning.

Get personalized course recommendations, track subjects and courses with reminders, and more.

Someone learning on their laptop while sitting on the floor.