MIT Sloan AI Adoption: Build a Playbook That Drives Real Business ROI
AI, Data Science & Cloud Certificates from Google, IBM & Meta
Overview
AI, Data Science & Cloud Certificates from Google, IBM & Meta — 40% Off
One plan covers every Professional Certificate on Coursera. 40% off your first 3 months — limited time.
Unlock All Certificates
Explore a comprehensive presentation on LayerSkip, an innovative LLM acceleration method, delivered by Mostafa Elhoushi and Akshat Shrivastava from Meta. Dive into the intricacies of this technique that enables early exit inference and self-speculative decoding, achieving approximately 2x speed-ups on various tasks. Learn about the key components of LayerSkip, including the introduction of dropouts during training, an early exit loss to model representations of early layers, and self-speculating decoding to enhance early prediction accuracy. Gain insights into the paper "LayerSkip: Enabling Early Exit Inference and Self-Speculative Decoding" and its potential impact on AI inference optimization. Discover additional resources for further exploration of AI research, industry trends, and the AI deployment stack through provided links to The Deep Dive newsletter and Unify's blog.
Syllabus
LayerSkip Explained
Taught by
Unify