Overview
Coursera Flash Sale
40% Off Coursera Plus for 3 Months!
Grab it
Learn how to test AI systems using AI-powered evaluation methods in this conference talk from Conf42 LLMs 2025. Explore the evolving landscape of software testing for intelligent applications and discover why traditional testing approaches fall short when dealing with AI systems. Master automated evaluation techniques through hands-on demonstrations, including building a quiz generator application and implementing it using Google Colab. Understand how to leverage AI to evaluate other AI systems, moving beyond conventional testing paradigms. Dive into advanced automated evaluation strategies and learn how to integrate AI testing into CI/CD pipelines for continuous deployment of AI applications. Gain practical insights into the changing needs of software testing in the age of artificial intelligence and acquire the skills to implement robust testing frameworks for your own AI projects.
Syllabus
00:00 Introduction to AI Testing
00:30 Understanding Intelligent Applications
01:19 Changing Needs in Software Testing
02:34 Automated Evaluations
03:28 Demo: Building a Quiz Generator
05:04 Hands-On Coding with Google Colab
11:03 Evaluating AI with AI
17:01 Advanced Automated Evaluations
23:02 CI/CD Pipeline for AI Applications
24:30 Conclusion and Q&A
Taught by
Conf42