Class Central is learner-supported. When you buy through links on our site, we may earn an affiliate commission.

YouTube

Scale Can't Overcome Pragmatics - The Impact of Reporting Bias on Vision-Language Reasoning

USC Information Sciences Institute via YouTube

Overview

Coursera Spring Sale
40% Off Coursera Plus Annual!
Grab it
Attend this 59-minute research seminar exploring how reporting bias in training data fundamentally limits the reasoning capabilities of Vision-Language Models (VLMs), despite their massive scale. Examine the theoretical foundations from pragmatics that explain why people naturally omit tacit information when describing visual content, creating insufficient representation of reasoning skills in web-scale and synthetically generated training corpora. Discover how this communication pattern affects VLM performance across various model and data scales, and learn about potential solutions through more intentional training data curation methods rather than relying solely on scale for emergent reasoning capabilities. Gain insights from PhD candidate Amita Kamath's research at UCLA and University of Washington, conducted in collaboration with the Allen Institute for AI, as she presents findings that challenge conventional approaches to developing reasoning abilities in vision-language systems.

Syllabus

Scale Can’t Overcome Pragmatics: The Impact of Reporting Bias on Vision-Language Reasoning

Taught by

USC Information Sciences Institute

Reviews

Start your review of Scale Can't Overcome Pragmatics - The Impact of Reporting Bias on Vision-Language Reasoning

Never Stop Learning.

Get personalized course recommendations, track subjects and courses with reminders, and more.

Someone learning on their laptop while sitting on the floor.