Overview
Coursera Flash Sale
40% Off Coursera Plus for 3 Months!
Grab it
Learn about the fascinating intersection of vision and language in Large Language Models (LLMs) through this comprehensive lecture that explores how these advanced AI systems process and understand both visual and textual information simultaneously. Delve into the technical architecture, capabilities, and real-world applications of multimodal LLMs, examining how they bridge the gap between computer vision and natural language processing to enable more sophisticated AI interactions.
Syllabus
Vision-and-Language LLMs
Taught by
UofU Data Science