35% Off Finance Skills That Get You Hired - Code CFI35
Our career paths help you become job ready faster
Overview
Coursera Flash Sale
40% Off Coursera Plus for 3 Months!
Grab it
Explore how Transformers revolutionize diffusion models for image generation in this Stanford CS25 lecture delivered by Sayak Paul from Hugging Face. Begin with essential preliminaries on diffusion models and their training processes, then examine the traditional UNet-based architectures that previously dominated the field. Discover how transformer-based architectures emerged as superior alternatives, learning about their fundamental building blocks and the various degrees of freedom available for ablation across different conditional settings. Delve into cutting-edge attention mechanisms and connected components utilized in state-of-the-art open models for diverse applications. Gain insights into subject-driven generation, preference alignment, and evaluation methodologies for diffusion models. Conclude with an exploration of promising future directions focused on efficiency improvements and emerging techniques in the rapidly evolving landscape of AI-generated content.
Syllabus
Stanford CS25: V5 I Transformers in Diffusion Models for Image Generation and Beyond
Taught by
Stanford Online