Class Central is learner-supported. When you buy through links on our site, we may earn an affiliate commission.

YouTube

Build Motion Control Interfaces with Multimodal LLMs

JavaScript Conferences by GitNation via YouTube

Overview

Coursera Flash Sale
40% Off Coursera Plus for 3 Months!
Grab it
Explore how to create motion-controlled interfaces using multimodal Large Language Models in this 29-minute conference talk from JSNation 2025. Discover the intersection of multimodal AI and human-computer interaction as Charlie Gerard demonstrates practical applications for controlling websites and IoT devices through gesture recognition. Learn to implement TensorFlow.js models for motion detection and enhance user experiences with gesture-based controls that require minimal traditional input methods. Examine how LLMs can interpret and respond to physical movements, with hands-on examples using Gemini for webcam-based gesture recognition. Follow along as the speaker demonstrates controlling WiFi light bulbs through hand gestures, implementing multi-device control systems with positional awareness, and creating custom gesture databases for personalized interfaces. Understand the integration of TensorFlow.js for color control applications and explore the potential for autonomous interfaces that respond to natural human movements. Gain insights into redefining coding interfaces through motion control, leveraging live API functions for directional control, and building custom gesture recognition systems that can be tailored to individual users and specific use cases.

Syllabus

00:00 Introduction to Motion Control with Multimodal AI
01:24 Exploring TensorFlow.js Models for Motion Control
02:47 Enhancing User Interactions with Gesture Control
04:45 Redefining Coding Interfaces with Minimal Inputs
06:58 Leveraging LLMs for Gesture-Based Interaction
08:55 Investigating LLMs for Motion Control Experiences
10:18 Exploring Gemiini for Gesture Recognition
12:34 Webcam-Based Gesture Recognition with Gemini
14:28 Controlling Light States with Gemini Functions
15:46 Interacting with Gemini and WiFi Light Bulbs
17:44 Enabling Multi-Light Control with ID and Position
19:05 Custom Light Names and Gestural Control Logic
20:13 Live API Functions and Directional Light Control
22:57 Integrating TensorFlow.js for Color Control
24:08 Color Changes with Hand Gestures
26:07 Custom Gesture Databases and Autonomous Interfaces
28:28 Personalized Interfaces and Motion Control

Taught by

JavaScript Conferences by GitNation

Reviews

Start your review of Build Motion Control Interfaces with Multimodal LLMs

Never Stop Learning.

Get personalized course recommendations, track subjects and courses with reminders, and more.

Someone learning on their laptop while sitting on the floor.