Our career paths help you become job ready faster
Gain a Splash of New Skills - Coursera+ Annual Nearly 45% Off
Overview
Coursera Flash Sale
40% Off Coursera Plus for 3 Months!
Grab it
This video explores visual reasoning capabilities in AI systems, examining both the latest research algorithms and real-world applications. Dive into an analysis of "Reason-RFT: Reinforcement Fine-Tuning for Visual Reasoning," a paper published by researchers from Peking University, Beijing Academy of Artificial Intelligence, Chinese Academy of Sciences, and University of Chinese Academy of Sciences. Learn about the current limitations of visual reasoning in Vision Language Models (VLMs) through personal experiences with commercial AI systems. The 22-minute presentation provides insights into the gap between research claims and practical performance of visual AI reasoning technologies.
Syllabus
Failure of AI "Visual Reasoning" in VLMs
Taught by
Discover AI