Launch a New Career with Certificates from Google, IBM & Microsoft
PowerBI Data Analyst - Create visualizations and dashboards from scratch
Overview
Google, IBM & Meta Certificates — All 10,000+ Courses at 40% Off
One annual plan covers every course and certificate on Coursera. 40% off for a limited time.
Get Full Access
Complete the intermediate Inspect Rich Documents with Gemini Multimodality and Multimodal RAG skill badge course to demonstrate skills in the following: using multimodal prompts to extract information from text and visual data, generating a video description, and retrieving extra information beyond the video using multimodality with Gemini; building metadata of documents containing text and images, getting all relevant text chunks, and printing citations by using Multimodal Retrieval Augmented Generation (RAG) with Gemini.
Syllabus
- Inspect Rich Documents with Gemini Multimodality and Multimodal RAG
- Multimodality with Gemini
- Using Gemini for Multimodal Retail Recommendations
- Multimodal Retrieval Augmented Generation (RAG) using the Gemini API in Vertex AI
- Inspect Rich Documents with Gemini Multimodality and Multimodal RAG: Challenge Lab
- Your Next Steps
- Course Badge