Towards Generalizable and Intelligent System for Robotic Manipulation
New York University (NYU) via YouTube
Master Windows Internals - Kernel Programming, Debugging & Architecture
AI Engineer - Learn how to integrate AI into software applications
Overview
AI, Data Science & Cloud Certificates from Google, IBM & Meta — 40% Off
One plan covers every Professional Certificate on Coursera. 40% off Coursera Plus Annual.
Unlock All Certificates
Attend this AI seminar exploring UniVLA, a groundbreaking framework for developing cross-embodiment vision-language-action policies in robotic manipulation. Learn how this innovative approach addresses the limitations of current robotic systems that struggle with generalization across different environments and embodiments by deriving task-centric action representations from videos using a latent action model. Discover how the framework leverages extensive data across diverse embodiments and perspectives while incorporating language instructions within the DINO feature space to mitigate task-irrelevant dynamics. Examine the state-of-the-art results achieved across multiple manipulation and navigation benchmarks, including real-robot deployments, where UniVLA demonstrates superior performance over OpenVLA using significantly less computational resources - requiring less than 1/20 of pretraining compute and 1/10 of downstream data. Explore how continuous performance improvements emerge when heterogeneous data, including human videos, are integrated into the training pipeline, highlighting UniVLA's potential for scalable and efficient robot policy learning that can facilitate the development of truly generalizable robotic systems.
Syllabus
ECE AI SEMINAR: Towards Generalizable and Intelligent System for Robotic Manipulation
Taught by
NYU Tandon School of Engineering