Mon 19 Jun 2023 16:40 - 17:00 at Royal - PLDI: Machine Learning Chair(s): Yaniv David

Unstructured sparse neural networks are an important class of machine learning (ML) models, as they compact model size and reduce floating point operations. The execution time of these models is frequently dominated by the sparse matrix multiplication (SpMM) kernel, C = A × B, where A is a sparse matrix, and B and C are dense matrices. The unstructured sparsity pattern of matrices in pruned machine learning models along with their sparsity ratio has rendered useless the large class of libraries and systems that optimize sparse matrix multiplications. Reusing registers is particularly difficult because accesses to memory locations should be known statically. This paper proposes Sparse Register Tiling, a new technique composed of an unroll-and-sparse-jam transformation followed by data compression that is specifically tailored to sparsity patterns in ML matrices. Unroll-and-sparse-jam uses sparsity information to jam the code while improving register reuse. Sparse register tiling is evaluated across 2396 weight matrices from transformer and convolutional models with a sparsity range of 60-95% and provides an average speedup of 1.72× and 2.65× over MKL SpMM and dense matrix multiplication, respectively, on a multicore CPU processor. It also provides an end-to-end speedup of 2.12× for MobileNetV1 with 70% sparsity on an ARM processor commonly used in edge devices.

Mon 19 Jun

Displayed time zone: Eastern Time (US & Canada) change

16:00 - 18:00
PLDI: Machine LearningPLDI Research Papers at Royal
Chair(s): Yaniv David Columbia University

#pldi-mon-1600-ml-royal Discord icon small YouTube icon small

16:00
20m
Talk
Scallop: A Language for Neurosymbolic Programming
PLDI Research Papers
Ziyang Li UPenn, Jiani Huang UPenn, Mayur Naik University of Pennsylvania
DOI
16:20
20m
Talk
Abstract Interpretation of Fixpoint Iterators with Applications to Neural Networks
PLDI Research Papers
Mark Niklas Müller ETH Zurich, Marc Fischer ETH Zurich, Robin Staab ETH Zurich, Martin Vechev ETH Zurich
DOI
16:40
20m
Talk
Register Tiling for Unstructured Sparsity in Neural Network Inference
PLDI Research Papers
Lucas Wilkinson University of Toronto, Kazem Cheshmi McMaster University, Maryam Mehri Dehnavi University of Toronto
DOI
17:00
20m
Talk
Architecture-Preserving Provable Repair of Deep Neural Networks
PLDI Research Papers
Zhe Tao University of California, Davis, Stephanie Nawas University of California, Davis, Jacqueline Mitchell University of California, Davis, Aditya V. Thakur University of California at Davis
DOI Pre-print
17:20
20m
Talk
Incremental Verification of Neural Networks
PLDI Research Papers
Shubham Ugare University of Illinois at Urbana-Champaign, Debangshu Banerjee UIUC, Sasa Misailovic University of Illinois at Urbana-Champaign, Gagandeep Singh University of Illinois at Urbana-Champaign
DOI
17:40
20m
Talk
Prompting Is Programming: A Query Language for Large Language Models
PLDI Research Papers
Luca Beurer-Kellner ETH Zurich, Marc Fischer ETH Zurich, Martin Vechev ETH Zurich
DOI