VisionTasker - Mobile Task Automation Using Vision-Based UI Understanding and LLM Task Planning
Association for Computing Machinery (ACM) via YouTube
PowerBI Data Analyst - Create visualizations and dashboards from scratch
Python, Prompt Engineering, Data Science — Build the Skills Employers Want Now
Overview
Google, IBM & Meta Certificates — All 10,000+ Courses at 40% Off
One annual plan covers every course and certificate on Coursera. 40% off for a limited time.
Get Full Access
Watch a 12-minute conference talk from the 37th Annual ACM Symposium on User Interface Software and Technology (UIST 2024) exploring an innovative approach to mobile task automation that combines vision-based UI understanding with LLM task planning capabilities to streamline and enhance user interactions on mobile devices.
Syllabus
VisionTasker: Mobile Task Automation Using Vision Based UI Understanding and LLM Task Planning
Taught by
ACM SIGCHI