Matrix Decompositions over Database Joins
In this talk I will overview recent efforts to bridge between relational algebra and linear algebra, in particular how to leverage the sparsity of relational data for efficient and numerically accurate decompositions of matrices defined by natural joins over multiple relational tables. I will consider three important matrix decompositions, which form the staple for a myriad of computations in linear algebra and machine learning: QR Decomposition, SVD and PCA.
The key insight of our approach is the ability to push the decompositions past the joins and can thus avoid the materialize the join output. This leads to several desirable properties. For acyclic joins, it takes time linear in the size of the input database and independent of the size of the join output. The number of rounding errors relative to the classical decomposition algorithms is on par with the database size relative to the join output size.
A suite of experiments validate that this approach can outperform both in runtime performance and numerical accuracy the LAPACK library Intel MKL by a factor proportional to the gap between the sizes of the join output and input.
Sat 17 JunDisplayed time zone: Eastern Time (US & Canada) change
09:00 - 11:00 | |||
09:00 20mTalk | Matrix Decompositions over Database Joins DRAGSTERS Dan Olteanu University of Zurich, Nils Vortmeier Ruhr University Bochum, Dorde Zivanovic University of Oxford | ||
09:20 20mTalk | NASOQ: Numerically Accurate Sparsity-Oriented QP Solver DRAGSTERS | ||
09:40 20mTalk | UniSparse: An Intermediate Language and Compiler for General Sparse Format Customization DRAGSTERS Jie Liu Cornell University, Zhongyuan Zhao , Zijian Ding Peking University, Benjamin Brock Parallel Computing Lab (PCL), Intel, Hongbo Rong Intel Labs, Zhiru Zhang Cornell University, USA | ||
10:00 20mTalk | Unification as a means of completing partial data structures DRAGSTERS Joachim Kristensen University of Oslo, Robin Kaarsgaard University of Southern Denmark, Michael Kirkedal Thomsen University of Oslo & University of Copenhagen | ||
10:20 20mTalk | Formalizing DRAGSTERS DRAGSTERS Scott Kovach Stanford University | ||
10:40 20mTalk | Scaling Decision--Theoretic Probabilistic Programming Through Factorization DRAGSTERS |