Google, IBM & Meta Certificates — 40% Off for a Limited Time
Build GenAI Apps from Scratch — UCSB PaCE Certificate Program
Overview
Google, IBM & Meta Certificates — All 10,000+ Courses at 40% Off
One annual plan covers every course and certificate on Coursera. 40% off for a limited time.
Get Full Access
Explore a 40-minute conference talk from the Storage Developer Conference (SDC) 2023 examining innovative solutions for reducing power consumption in generative AI and large language model computations. Dive into the concept of breaking the Von-Neumann bottleneck through the integration of SRAM memory cells with interleaved programmable processors on a single die. Learn about the challenges of running power-intensive language models in datacenters, discover the novel "In-SRAM" computing architecture, and understand recent developments in compressed data types for large-scale deep learning models. Presented by George Williams from GSI Technology, gain insights into mixed precision mathematics and extreme low-bit quantization techniques for model parameters and activations, all aimed at achieving a lower in-silicon power profile for next-generation AI applications.
Syllabus
SDC 2023 - In-SRAM Compute For Generative AI and Large Language Models
Taught by
SNIAVideo