Class Central is learner-supported. When you buy through links on our site, we may earn an affiliate commission.

YouTube

What is RAG? The Complete Tutorial - From Scratch to Deployed API on Production - LangChain and Ollama

Venelin Valkov via YouTube

Overview

Coursera Flash Sale
40% Off Coursera Plus for 3 Months!
Grab it
Learn to build a complete Retrieval-Augmented Generation (RAG) system from scratch in this comprehensive 30-minute tutorial that takes you from basic concepts to production deployment. Discover how RAG enables Large Language Models to become experts on your private documents by combining retrieval mechanisms with generation capabilities, solving the limitations of context stuffing for large-scale applications. Start by understanding RAG fundamentals and setting up your development environment with Python and necessary dependencies. Build a document retriever system using Python and Scikit-learn, then implement a simple RAG pipeline before advancing to chat functionality with PDF files. Refactor your system using LangChain for improved efficiency and add tracing and observability features with MLflow to monitor your system's performance. Wrap your RAG system in a streaming FastAPI to create a REST API, then containerize it using Docker and Docker Compose for scalable deployment. Complete the journey by deploying your production-ready RAG system, gaining hands-on experience with the entire development lifecycle from prototype to production. Access the sample PDF file and complete GitHub repository to follow along with the practical implementation, making this tutorial suitable for developers looking to implement RAG systems in real-world applications.

Syllabus

00:00 - What is RAG?
05:58 - Project setup and dependencies
07:04 - Build a retriever
10:52 - Simple RAG
13:25 - Chat with PDF file
15:31 - Tracing and observability with MLflow
19:00 - RAG Rest API with FastAPI
24:02 - Docker container and compose
25:56 - Deploy to production
28:36 - Conclusion

Taught by

Venelin Valkov

Reviews

Start your review of What is RAG? The Complete Tutorial - From Scratch to Deployed API on Production - LangChain and Ollama

Never Stop Learning.

Get personalized course recommendations, track subjects and courses with reminders, and more.

Someone learning on their laptop while sitting on the floor.