Class Central is learner-supported. When you buy through links on our site, we may earn an affiliate commission.

Udemy

Test AI & LLM App with DeepEval, RAGAs & more using Ollama

via Udemy

Overview

2026 - Path to AI QA Engineer to test LLMs and AI Apps using DeepEval, RAGAs and HF Evaluate with Local LLMs like Ollama

What you'll learn:
  • Understand the purpose of Testing LLM and LLM based Application
  • Understand DeepEval and RAGAs in detail from complete ground up
  • Understand different metrics and evaluations to evaluate LLMs and LLM based app using DeepEval and RAGAs
  • Understand the advanced concepts of DeepEval and RAGAs
  • Testing RAG based application using DeepEval and RAGAs
  • Testing AI Agents using DeepEval to understand how tool callings can be tested

Testing AI & LLM App with DeepEval, RAGAs & more using Ollama and Local Large Language Models (LLMs)

Master the essential skills for testing and evaluating AI applications, particularly Large Language Models (LLMs). This hands-on course equips QA, AIQA, Developers, data scientists, and AI practitioners with cutting-edge techniques to assess AI performance, identify biases, and ensure robust application development.



Topics Covered:

  • Section 1: Foundations of AI Application Testing (Introduction to LLM testing, AI application types, evaluation metrics, LLM evaluation libraries).

  • Section 2: Local LLM Deployment with Ollama (Local LLM deployment, AI models, running LLMs locally, Ollama implementation, GUI/CLI, setting up Ollama as API).

  • Section 3: Environment Setup (Jupyter Notebook for tests, setting up Confident AI).

  • Section 4: DeepEval Basics (Traditional LLM testing, first DeepEval code for AnswerRelevance, Context Precision, evaluating in Confident AI, testing with local LLM, understanding LLMTestCases and Goldens).

  • Section 5: Advanced LLM Evaluation (LangChain for LLMs, evaluating Answer Relevancy, Context Precision, bias detection, custom criteria with GEval, advanced bias testing).

  • Section 6: RAG Testing with DeepEval (Introduction to RAG, understanding RAG apps, demo, creating GEval for RAG, testing for conciseness & completeness).

  • Section 7: Advanced RAG Testing with DeepEval (Creating multiple test data, Goldens in Confident AI, actual output and retrieval context, LLMTestCases from dataset, running evaluation for RAG).

  • Section 8: Testing AI Agents and Tool Callings (Understanding AI Agents, working with agents, testing agents with and without actual systems, testing with multiple datasets).

  • Section 9: Evaluating LLMs using RAGAS (Introduction to RAGAS, Context Recall, Noise Sensitivity, MultiTurnSample, general purpose metrics for summaries and harmfulness).

  • Section 10: Testing RAG applications with RAGAS (Introduction and setup, creating retrievers and vector stores, MultiTurnSample dataset for RAG, evaluating RAG with RAGAS).



Syllabus

  • Introduction
  • Running LLM locally using Ollama
  • Complete course Source code
  • Environment step required for Testing/Evaluating LLM Apps and LLMs
  • Understanding the Basics of DeepEval (Building Blocks)
  • Evaluating Real LLMs (Locally) as the Source and creating Dataset with LLMs
  • Testing RAG (Retrieval-Augmented Generation) application using DeepEval
  • Testing RAG application with DeepEval (Advanced)
  • Testing AI Agents and Tool Callings with Local LLMs and DeepEval
  • Evaluating/Testing LLMs using RAGAs
  • Testing RAG applications with RAGAs
  • Functional Testing of Large Language Models (LLMs) with HuggingFace and Python
  • Evaluate LLMs using HuggingFace Evaluate
  • Component Testing of RAG LLMs Application with DeepEval
  • Component Testing of AI Agent and Tool Calling with DeepEval

Taught by

Karthik KK

Reviews

4.4 rating at Udemy based on 402 ratings

Start your review of Test AI & LLM App with DeepEval, RAGAs & more using Ollama

Never Stop Learning.

Get personalized course recommendations, track subjects and courses with reminders, and more.

Someone learning on their laptop while sitting on the floor.