Towards Generalizable and Intelligent System for Robotic Manipulation
New York University (NYU) via YouTube
AI Adoption - Drive Business Value and Organizational Impact
Earn Your Business Degree, Tuition-Free, 100% Online!
Overview
Coursera Flash Sale
40% Off Coursera Plus for 3 Months!
Grab it
Attend this AI seminar exploring UniVLA, a groundbreaking framework for developing cross-embodiment vision-language-action policies in robotic manipulation. Learn how this innovative approach addresses the limitations of current robotic systems that struggle with generalization across different environments and embodiments by deriving task-centric action representations from videos using a latent action model. Discover how the framework leverages extensive data across diverse embodiments and perspectives while incorporating language instructions within the DINO feature space to mitigate task-irrelevant dynamics. Examine the state-of-the-art results achieved across multiple manipulation and navigation benchmarks, including real-robot deployments, where UniVLA demonstrates superior performance over OpenVLA using significantly less computational resources - requiring less than 1/20 of pretraining compute and 1/10 of downstream data. Explore how continuous performance improvements emerge when heterogeneous data, including human videos, are integrated into the training pipeline, highlighting UniVLA's potential for scalable and efficient robot policy learning that can facilitate the development of truly generalizable robotic systems.
Syllabus
ECE AI SEMINAR: Towards Generalizable and Intelligent System for Robotic Manipulation
Taught by
NYU Tandon School of Engineering