Class Central is learner-supported. When you buy through links on our site, we may earn an affiliate commission.

New York University (NYU)

Towards Generalizable and Intelligent System for Robotic Manipulation

New York University (NYU) via YouTube

Overview

Coursera Flash Sale
40% Off Coursera Plus for 3 Months!
Grab it
Attend this AI seminar exploring UniVLA, a groundbreaking framework for developing cross-embodiment vision-language-action policies in robotic manipulation. Learn how this innovative approach addresses the limitations of current robotic systems that struggle with generalization across different environments and embodiments by deriving task-centric action representations from videos using a latent action model. Discover how the framework leverages extensive data across diverse embodiments and perspectives while incorporating language instructions within the DINO feature space to mitigate task-irrelevant dynamics. Examine the state-of-the-art results achieved across multiple manipulation and navigation benchmarks, including real-robot deployments, where UniVLA demonstrates superior performance over OpenVLA using significantly less computational resources - requiring less than 1/20 of pretraining compute and 1/10 of downstream data. Explore how continuous performance improvements emerge when heterogeneous data, including human videos, are integrated into the training pipeline, highlighting UniVLA's potential for scalable and efficient robot policy learning that can facilitate the development of truly generalizable robotic systems.

Syllabus

ECE AI SEMINAR: Towards Generalizable and Intelligent System for Robotic Manipulation

Taught by

NYU Tandon School of Engineering

Reviews

Start your review of Towards Generalizable and Intelligent System for Robotic Manipulation

Never Stop Learning.

Get personalized course recommendations, track subjects and courses with reminders, and more.

Someone learning on their laptop while sitting on the floor.