Static Analysis of Data Transformations in Jupyter Notebooks (Virtual)
Jupyter notebooks used to pre-process and polish raw data for data science and machine learning processes are challenging to analyze. Their data-centric code manipulates dataframes through call to library functions with complex semantics, and the properties to track over it vary widely depending on the verification task. This paper presents a novel abstract domain that simplifies writing analyses for such programs, by extracting a unique CFG from the notebook that contains all transformations applied to the data. Several properties can then be determined by analyzing such CFG, that is simpler than the original Python code. We present a first use case that exploits our analysis to infer the required shape of the dataframes manipulated by the notebook.
Sat 17 JunDisplayed time zone: Eastern Time (US & Canada) change
09:00 - 11:00 | SOAP: Session 1 - Static AnalysisSOAP at Magnolia 18 Chair(s): Vincenzo Arceri University of Parma, Italy | ||
09:00 30mTalk | Combining E-Graphs with Abstract Interpretation SOAP Samuel Coward Imperial College London, UK / Intel Corporation, George A. Constantinides Imperial College London, UK, Theo Drane Intel Corporation, USA DOI | ||
09:30 30mTalk | Static Analysis of Data Transformations in Jupyter Notebooks (Virtual) SOAP Luca Negrini Ca’ Foscari University of Venice, Corvallis S.r.l., Guruprerana Shabadi École Polytechnique, Institut Polytechnique de Paris, Caterina Urban Inria & École Normale Supérieure | Université PSL DOI | ||
10:00 30mTalk | Speeding up Static Analysis with the Split Operator SOAP Vincenzo Arceri University of Parma, Italy, Greta Dolcetti University of Parma - Department of Mathematical, Physical, and Computer Sciences, Enea Zaffanella University of Parma, Italy DOI | ||
10:30 30mTalk | When Long Jumps Fall Short: Control-Flow Tracking and Misuse Detection for Non-local Jumps in C SOAP Michael Schwarz Technische Universität München, Julian Erhard Technical University of Munich, Vesal Vojdani University of Tartu, Simmo Saan University of Tartu, Estonia, Helmut Seidl Technische Universität München DOI Media Attached |