Posts

Oct 18, 2023 ‐ I will join the newly formed Journal of Data-centric Machine Learning Research (DMLR) as an Action Editor.

Aug 30, 2023 ‐ I will give a keynote on Directions Towards Resource-Efficient Machine Learning Systems in e-Commerce at the BIFOLD Weizenbaum Summer School on Artificial Intelligence and Ecological Sustainability in Berlin.

Jun 22, 2023 ‐ I am part of the large group of contributors to Apache Flink, who won the ACM SIGMOD Systems Award 2023! Furthermore, Stefan, Shubha and me won an ACM SIGMOD Best Demo Runner Up Award for our demo on Proactively Screening Machine Learning Pipelines with ArgusEyes.

Feb 22, 2023 ‐ We will present a paper on Automating and Optimizing Data-Centric What-If Analyses on Native Machine Learning Pipelines and a demo on Proactively Screening Machine Learning Pipelines with ArgusEyes at SIGMOD in Seattle.

Feb 2, 2023 ‐ We will present a vision paper on the impact of data cleaning on the fairness of ML models at the new special track of ICDE, originating from our ongoing collaboration with the Center for Responsible AI at New York University.

Jan 18, 2023 ‐ I gave a short presentation at CIDR on our ideas for using provenance to debug the data flowing through machine learning pipelines.

Dec 6, 2022 ‐ Stefan will intern with the Gray Systems Lab at Microsoft next year, working on data systems for the Azure cloud.

Dec 4, 2022 ‐ Congratulations to David for winning the best-paper runner-up award at the Table Representation Learning workshop at NeurIPS in New Orleans. NeurIPS made a recording of his presentation available.

Nov 10, 2022 ‐ I am on parental leave until January 2023. I will be back for CIDR in Amsterdam.

Nov 7, 2022 ‐ David is going to present our work on Parameter-Efficient Automation of Data Wrangling Tasks with Prefix-Tuning at the Table Representation Learning workshop at NeurIPS in New Orleans.