Class Central is learner-supported. When you buy through links on our site, we may earn an affiliate commission.

YouTube

Qwen 2.5 Omni - Your New Open Omni Powerhouse

Sam Witteveen via YouTube

Overview

Coursera Spring Sale
40% Off Coursera Plus Annual!
Grab it
Explore the latest Qwen 2.5 Omni model in this video tutorial that showcases a powerful multimodal AI capable of processing text, images, video, and audio inputs while generating text or audio outputs in real-time. Learn about the model's architecture through demonstrations of its audio and video processing capabilities. Follow along as the presenter examines the official blog, demonstrates the chat interface, explores the model on Hugging Face, reviews the technical paper, and walks through a demonstration Colab notebook. Access resources including the official blog, chat interface, Hugging Face model page, and Colab notebook to experiment with this versatile AI model yourself. Perfect for developers and AI enthusiasts interested in cutting-edge multimodal language models.

Syllabus

00:00 Intro
00:23 Qwen2.5 Omni Blog
00:29 Qwen2.5 Omni Architecture
00:59 Chat: Demo - Audio
03:17 Chat: Demo - Video
05:24 Hugging Face
05:33 Qwen 2.5 Omni Paper
11:05 Qwen 2.5 Demo Colab

Taught by

Sam Witteveen

Reviews

Start your review of Qwen 2.5 Omni - Your New Open Omni Powerhouse

Never Stop Learning.

Get personalized course recommendations, track subjects and courses with reminders, and more.

Someone learning on their laptop while sitting on the floor.