AI Engineer - Learn how to integrate AI into software applications
The Investment Banker Certification
Overview
Google, IBM & Meta Certificates — All 10,000+ Courses at 40% Off
One annual plan covers every course and certificate on Coursera. 40% off for a limited time.
Get Full Access
Explore a search-based deep learning compiler designed for CPUs, GPUs, and ASICs in this 25-minute conference talk. Learn how Luminal takes an innovative search-first approach to automatically discover efficient kernels, including advanced techniques like flash attention, revolutionizing the way deep learning models are optimized across different hardware platforms. Discover the technical foundations behind this compiler's ability to automatically identify and implement high-performance computational patterns, understand its cross-platform compatibility spanning traditional CPUs to specialized ASICs, and gain insights into how search-based optimization can significantly improve deep learning model performance without manual kernel development.
Syllabus
Luminal - Search-Based Deep Learning Compilers - Joe Fioti
Taught by
AI Engineer