Overview
Coursera Flash Sale
40% Off Coursera Plus for 3 Months!
Grab it
Explore a 16-minute video presentation from PLDI 2023 conference that introduces Sparse Register Tiling, a novel technique for optimizing unstructured sparse neural networks. Learn how this method, combining unroll-and-sparse-jam transformation with data compression, significantly improves the performance of sparse matrix multiplication (SpMM) in machine learning models. Discover the challenges of register reuse in sparse matrices and how this approach overcomes them, particularly for pruned neural networks with 60-95% sparsity. Examine the impressive speedups achieved across various transformer and convolutional models, including a 2.12× improvement for MobileNetV1 on ARM processors. Gain insights into the potential impact of this technique on reducing execution time and improving efficiency for ML models on both multicore CPUs and edge devices.
Syllabus
[PLDI'23] Register Tiling for Unstructured Sparsity in Neural Network Inference
Taught by
ACM SIGPLAN