Class Central is learner-supported. When you buy through links on our site, we may earn an affiliate commission.

Google

Inspect Rich Documents with Gemini Multimodality and Multimodal RAG

Google via Google Skills

Overview

Coursera Flash Sale
40% Off Coursera Plus for 3 Months!
Grab it
Complete the intermediate Inspect Rich Documents with Gemini Multimodality and Multimodal RAG skill badge course to demonstrate skills in the following: using multimodal prompts to extract information from text and visual data, generating a video description, and retrieving extra information beyond the video using multimodality with Gemini; building metadata of documents containing text and images, getting all relevant text chunks, and printing citations by using Multimodal Retrieval Augmented Generation (RAG) with Gemini.

Syllabus

  • Inspect Rich Documents with Gemini Multimodality and Multimodal RAG
    • Multimodality with Gemini
    • Using Gemini for Multimodal Retail Recommendations
    • Multimodal Retrieval Augmented Generation (RAG) using the Gemini API in Vertex AI
    • Inspect Rich Documents with Gemini Multimodality and Multimodal RAG: Challenge Lab
  • Your Next Steps
    • Course Badge

Reviews

Start your review of Inspect Rich Documents with Gemini Multimodality and Multimodal RAG

Never Stop Learning.

Get personalized course recommendations, track subjects and courses with reminders, and more.

Someone learning on their laptop while sitting on the floor.