Class Central is learner-supported. When you buy through links on our site, we may earn an affiliate commission.

YouTube

Building Performant RAG Applications for Production

GOTO Conferences via YouTube

Overview

Coursera Spring Sale
40% Off Coursera Plus Annual!
Grab it
Learn how to build production-ready Retrieval Augmented Generation (RAG) applications in this 32-minute conference talk by David Carlos Zachariae at GOTO Copenhagen 2024. Discover why RAG is essential for enhancing Large Language Models with specific knowledge outside their training data, and explore a step-by-step approach to developing performant RAG systems. Follow along with practical demonstrations through four iterations: starting with a simple implementation, progressing to handling multiple documentation categories, managing unstructured documentation, and finally implementing dynamic context and actions. The presentation includes live demos at each stage, showing how to overcome common challenges in transitioning RAG applications from prototyping to production. Gain valuable insights on creating flexible, reliable, predictable, and scalable RAG pipelines that can handle diverse and complex tasks in real-world scenarios.

Syllabus

00:00 Intro
00:50 Agenda
01:42 Why use RAG?
03:52 Performant RAG?
04:44 Use-case
05:18 First iteration: The simple case
07:27 Demo
09:48 Second iteration: Multiple categories of documentation
12:50 Demo
14:43 Third iteration: Unstructured documentation
17:50 Demo
19:56 Fourth iteration: Dynamic context & actions
24:22 Demo
29:15 Take-aways
31:33 Outro

Taught by

GOTO Conferences

Reviews

Start your review of Building Performant RAG Applications for Production

Never Stop Learning.

Get personalized course recommendations, track subjects and courses with reminders, and more.

Someone learning on their laptop while sitting on the floor.