Overview

Retrieval Augmented Generation (RAG) improves large language model (LLM) responses by retrieving relevant data from knowledge bases—often private, recent, or domain-specific—and using it to generate more accurate, grounded answers. In this course, you’ll learn how to build RAG systems that connect LLMs to external data sources. You’ll explore core components like retrievers, vector databases, and language models, and apply key techniques at both the component and system level. Through hands-on work with real production tools, you’ll gain the skills to design, refine, and evaluate reliable RAG pipelines—and adapt to new methods as the field advances. Across five modules, you'll complete hands-on programming assignments that guide you through building each core part of a RAG system, from simple prototypes to production-ready components. Through hands-on labs, you’ll: - Build your first RAG system by writing retrieval and prompt augmentation functions and passing structured input into an LLM. - Implement and compare retrieval methods like semantic search, BM25, and Reciprocal Rank Fusion to see how each impacts LLM responses. - Scale your RAG system using Weaviate and a real news dataset—chunking, indexing, and retrieving documents with a vector database. - Develop a domain-specific chatbot for a fictional clothing store that answers FAQs and provides product suggestions based on a custom dataset. - Improve chatbot reliability by handling real-world challenges like dynamic pricing and logging user interactions for monitoring and debugging. - Develop a domain-specific chatbot using open-source LLMs hosted by Together AI for a fictional clothing store that answers FAQs and provides product suggestions based on a custom dataset. You’ll apply your skills using real-world data from domains like media, healthcare, and e-commerce. By the end of the course, you’ll combine everything you’ve learned to implement a fully functional, more advanced RAG system tailored to your project’s needs.

Syllabus

RAG Overview

Learn foundational RAG concepts, get familiar with the main components of a RAG system, including the LLM, knowledge base, and retriever, and start building your first functional RAG system.

Information Retrieval and Search Foundations

Learn foundational information retrieval techniques, including keyword search, semantic search, and metadata filtering. Then build and evaluate a hybrid search pipeline that combines all three techniques.

Information Retrieval with Vector Databases

Learn how vector databases scale up search and techniques to improve retrieval, such as chunking, query parsing, and reranking.

LLMs and Text Generation

Learn all about large language models, how they work, as well as techniques like prompt engineering, hallucination detection, agentic system design, and fine-tuning, to further improve their performance in a RAG system.

RAG Systems in Production

Learn how to monitor and evaluate a RAG system both at the component level and end-to-end and consider the tradeoffs in system performance, cost, capability, and security faced by production RAG systems.