Advanced Retrieval Pipeline for RAG - HyDE, Hybrid Search, Reranking - Build 100% Local Retrieval
-
19
-
- Write review
NY State-Licensed Certificates in Design, Coding & AI — Online
Master Production-Ready Machine Learning, Step by Step
Overview
AI, Data Science & Cloud Certificates from Google, IBM & Meta — 40% Off
One plan covers every Professional Certificate on Coursera. 40% off Coursera Plus Annual.
Unlock All Certificates
Learn to build an advanced retrieval pipeline for Retrieval-Augmented Generation (RAG) systems that goes beyond basic vector search to handle real-world production challenges. Discover why simple semantic embeddings fail when users search for specific IDs, acronyms, and keywords, and master the implementation of a comprehensive solution using Python and PostgreSQL. Explore the combination of Vector Search for semantic understanding with Full-Text Search for keyword matching, enhanced by reranking algorithms to achieve optimal retrieval performance. Configure pgvector and Full-Text Search in PostgreSQL, write SQL hybrid search functions using Reciprocal Rank Fusion (RRF), implement Hypothetical Document Embeddings (HyDE) for query enhancement, and integrate FlashRank for result reranking. Build a complete 100% local retrieval pipeline that addresses the limitations of traditional RAG tutorials and delivers production-ready performance for complex search scenarios.
Syllabus
- Why Vectors aren't enough
- The Advanced Retrieval Architecture HyDE + Hybrid + Rerank
- Configuring pgvector & Full-Text Search in PostgreSQL
- Writing the SQL Hybrid Search Function RRF
- Implementing HyDE
- Reranking with FlashRank
- Complete Retrieval Pipeline
- Conclusion
Taught by
Venelin Valkov