Class Central is learner-supported. When you buy through links on our site, we may earn an affiliate commission.

YouTube

Neuro-Symbolic AI for Visual Reasoning - Agent0-VL

Discover AI via YouTube

Overview

Coursera Flash Sale
40% Off Coursera Plus for 3 Months!
Grab it
Explore cutting-edge developments in neuro-symbolic AI for visual reasoning through this 44-minute video presentation. Delve into three groundbreaking research papers that advance the field of vision-language models and multi-modal AI systems. Learn about Chain-of-Visual-Thought methodology from UC Berkeley and UCLA researchers, which teaches vision-language models to enhance their visual perception and reasoning capabilities using continuous visual tokens. Discover the technical innovations behind Qwen3-VL, a state-of-the-art vision-language model developed by the Qwen Team, and understand its architectural improvements and performance benchmarks. Examine Agent0-VL, a novel self-evolving agent framework from UNC-Chapel Hill that integrates tools for sophisticated vision-language reasoning tasks. Gain insights into how these approaches combine symbolic reasoning with neural networks to tackle complex visual understanding challenges, representing significant advances in artificial intelligence research from leading institutions including UC Berkeley, UCLA, Panasonic AI Research, and UNC-Chapel Hill.

Syllabus

Neuro-Symbolic AI for Visual Reasoning: Agent0-VL

Taught by

Discover AI

Reviews

Start your review of Neuro-Symbolic AI for Visual Reasoning - Agent0-VL

Never Stop Learning.

Get personalized course recommendations, track subjects and courses with reminders, and more.

Someone learning on their laptop while sitting on the floor.