Build the Finance Skills That Lead to Promotions — Not Just Certificates
Google AI Professional Certificate - Learn AI Skills That Get You Hired
Overview
Google, IBM & Meta Certificates — All 10,000+ Courses at 40% Off
One annual plan covers every course and certificate on Coursera. 40% off for a limited time.
Get Full Access
Explore the latest Qwen 2.5 Omni model in this video tutorial that showcases a powerful multimodal AI capable of processing text, images, video, and audio inputs while generating text or audio outputs in real-time. Learn about the model's architecture through demonstrations of its audio and video processing capabilities. Follow along as the presenter examines the official blog, demonstrates the chat interface, explores the model on Hugging Face, reviews the technical paper, and walks through a demonstration Colab notebook. Access resources including the official blog, chat interface, Hugging Face model page, and Colab notebook to experiment with this versatile AI model yourself. Perfect for developers and AI enthusiasts interested in cutting-edge multimodal language models.
Syllabus
00:00 Intro
00:23 Qwen2.5 Omni Blog
00:29 Qwen2.5 Omni Architecture
00:59 Chat: Demo - Audio
03:17 Chat: Demo - Video
05:24 Hugging Face
05:33 Qwen 2.5 Omni Paper
11:05 Qwen 2.5 Demo Colab
Taught by
Sam Witteveen