Making Semantic Search and RAG Real - How to Build a Production-Ready Application
AWS Events via YouTube
Become an AI & ML Engineer with Cal Poly EPaCE — IBM-Certified Training
MIT Sloan: Lead AI Adoption Across Your Organization — Not Just Pilot It
Overview
AI, Data Science & Cloud Certificates from Google, IBM & Meta — 40% Off
One plan covers every Professional Certificate on Coursera. 40% off your first 3 months — limited time.
Unlock All Certificates
Discover in this AWS re:Invent 2023 conference talk how to build production-ready applications incorporating semantic search and Retrieval Augmented Generation (RAG). Explore the complexities of implementing private data retrieval with Large Language Models (LLMs) at scale, moving beyond basic kNN search implementations. Learn to develop robust search experiences that prioritize simplicity, repeatability, and security while implementing elegant APIs capable of managing data across various stores, middleware, and internal applications. Master the implementation of essential enterprise features including Role-Based Access Control (RBAC), High Availability (HA), and Disaster Recovery (DR) across cloud and on-premises environments. Gain valuable insights from Elastic, an AWS Partner, on operationalizing enterprise-grade AI search solutions with LLMs, ensuring your applications meet production-level requirements for performance, security, and scalability.
Syllabus
AWS re:Invent 2023 - Making semantic search & RAG real: How to build a prod-ready app (AIM201)
Taught by
AWS Events