Class Central is learner-supported. When you buy through links on our site, we may earn an affiliate commission.

YouTube

Meta MoCha - The First Movie Generation Model

AI Bites via YouTube

Overview

Coursera Spring Sale
40% Off Coursera Plus Annual!
Grab it
Explore the groundbreaking Meta MoCha model in this 12-minute video that explains how video generation technology has evolved to combine both speech and text inputs for creating movie-grade content. Learn about the progression from basic video generation to this potentially revolutionary model that could transform Hollywood filmmaking. The video breaks down the technical aspects including prior work in the field, how talking characters are generated, the MoCha model architecture, Flow Matching techniques, speech video window attention mechanisms, multi-character conversation capabilities, training strategies, and experimental results. Gain insights into whether this technology represents the long-awaited breakthrough that could lead to fully generated feature films.

Syllabus

0:00 - Intro
0:55 - Prior work
2:58 - Talking Characters
3:50 - MoCha model
5:15 - Flow Matching
6:08 - Speech video window attention
7:47 - Multi-character conversation
9:20 - Training Strategy
10:24 - Experiments

Taught by

AI Bites

Reviews

Start your review of Meta MoCha - The First Movie Generation Model

Never Stop Learning.

Get personalized course recommendations, track subjects and courses with reminders, and more.

Someone learning on their laptop while sitting on the floor.