Class Central is learner-supported. When you buy through links on our site, we may earn an affiliate commission.

YouTube

Text-to-Image Diffusion AI Model Implementation - A Line-by-Line Code Tutorial

Neural Breakdown with AVB via YouTube

Overview

Coursera Flash Sale
40% Off Coursera Plus for 3 Months!
Grab it
Learn to build and understand a Text-to-Image Diffusion AI model from the ground up in this comprehensive 25-minute video tutorial that breaks down the code line-by-line. Master the fundamentals of Generative AI Diffusion models, explore Latent Diffusion Models (LDMs), and discover how Text-to-Image conditional Latent diffusion models work through 15 key concepts. Follow along with a practical PyTorch implementation trained on a laptop, complete with detailed explanations and demonstrations. Access supplementary materials including a code repository, condensed 15-minute walkthrough, detailed 4000+ word script, 15+ animations, and presentation slides through the creator's Patreon. Explore related concepts through linked videos covering topics like Text-to-Video Diffusion models, Attention mechanisms, Latent Space, CNNs, U-Net architecture, NLP history, and Multimodal Models. Reference key research papers including DDPM, CLIP, and LDMs, while working with the CelebA dataset available on Kaggle.

Syllabus

- Intro
- 1
- 2
- 3
- 4
- 5
- 6
- 7
- 8
- 9
- 10
- 11
- 12
- 13
- 14
- 15

Taught by

Neural Breakdown with AVB

Reviews

Start your review of Text-to-Image Diffusion AI Model Implementation - A Line-by-Line Code Tutorial

Never Stop Learning.

Get personalized course recommendations, track subjects and courses with reminders, and more.

Someone learning on their laptop while sitting on the floor.