Class Central is learner-supported. When you buy through links on our site, we may earn an affiliate commission.

MIT OpenCourseWare

Generative AI - Text-to-Image Models - Lecture 11

MIT OpenCourseWare via YouTube

Overview

Coursera Flash Sale
40% Off Coursera Plus for 3 Months!
Grab it
Explore text-conditional diffusion models using transformer architecture in this comprehensive lecture from MIT's Hands-On Deep Learning course. Delve into the mechanics of how these models generate images from textual descriptions, examining the underlying diffusion processes and transformer-based architectures that enable this capability. Learn about various text-to-image generation techniques and discover how these principles extend to text-to-video applications. Gain insights into the mathematical foundations, implementation considerations, and practical applications of these generative AI systems. Understand the role of conditioning mechanisms that allow models to interpret and respond to natural language prompts, creating coherent visual outputs that match textual descriptions. Examine real-world examples and case studies that demonstrate the capabilities and limitations of current text-to-image generation technologies.

Syllabus

11: Generative AI – Text-to-Image Models

Taught by

MIT OpenCourseWare

Reviews

Start your review of Generative AI - Text-to-Image Models - Lecture 11

Never Stop Learning.

Get personalized course recommendations, track subjects and courses with reminders, and more.

Someone learning on their laptop while sitting on the floor.