Extracting Structured Information from Images with LangChain and Multimodal LLMs
The Machine Learning Engineer via YouTube
Overview
Coursera Flash Sale
40% Off Coursera Plus for 3 Months!
Grab it
Learn to extract structured information from PDFs using LangChain and Multimodal Large Language Models in this 56-minute technical tutorial. Master both local implementation using OLLAMA with LLama 3.2 Vision 11B and cloud-based approach with Gemini Pro 1.5 flash. Follow along with practical demonstrations and access the complete implementation code through the provided GitHub repository, which includes detailed Jupyter notebooks showcasing the extraction process and information structuring techniques.
Syllabus
RAG: How to Extract Structured Information Images with LangChain & Multimodal LLMs #machinelearning
Taught by
The Machine Learning Engineer