Overview
Coursera Flash Sale
40% Off Coursera Plus for 3 Months!
Grab it
Learn to build an end-to-end multimodal RAG (Retrieval-Augmented Generation) system using Langchain that processes PDF documents containing both text and images. Follow a comprehensive step-by-step approach to extract and utilize visual content alongside textual information from PDF sources, implementing advanced techniques for handling mixed-media documents in your RAG pipeline. Master the integration of image processing capabilities with traditional text-based retrieval systems, enabling your applications to understand and respond to queries involving both textual and visual elements from PDF documents.
Syllabus
Step By Step Process To Build MultiModal RAG With Langchain(PDF And Images)
Taught by
Krish Naik