Class Central is learner-supported. When you buy through links on our site, we may earn an affiliate commission.

IBM

What is Multimodal RAG? - Unlocking LLMs with Vector Databases

IBM via YouTube

Overview

Coursera Flash Sale
40% Off Coursera Plus for 3 Months!
Grab it
Explore how Large Language Models can process and retrieve information across multiple data types including text, images, and other modalities through Multimodal Retrieval-Augmented Generation (RAG) in this 11-minute video. Learn from IBM experts Martin Keen and Josh Spurgin as they demonstrate how vector databases enable LLMs to handle cross-modal capabilities beyond traditional text-only processing. Discover the differences between hybrid and full multimodal approaches, understanding how these advanced techniques transform AI retrieval systems to work seamlessly with diverse data formats. Gain insights into the technical implementation of multimodal RAG systems and their practical applications in modern AI workflows, equipping yourself with knowledge of cutting-edge retrieval augmentation technologies that expand the boundaries of what LLMs can accomplish.

Syllabus

What is Multimodal RAG? Unlocking LLMs with Vector Databases

Taught by

IBM Technology

Reviews

Start your review of What is Multimodal RAG? - Unlocking LLMs with Vector Databases

Never Stop Learning.

Get personalized course recommendations, track subjects and courses with reminders, and more.

Someone learning on their laptop while sitting on the floor.