Feb 2, 2023 ‐ We will present a vision paper on the impact of data cleaning on the fairness of ML models at the new special track of ICDE, originating from our ongoing collaboration with the Center for Responsible AI at New York University.
Jan 18, 2023 ‐ I gave a short presentation at CIDR on our ideas for using provenance to debug the data flowing through machine learning pipelines.
Dec 6, 2022 ‐ Stefan will intern with the Gray Systems Lab at Microsoft next year, working on data systems for the Azure cloud.
Oct 10, 2022 ‐ Zeyu Zhang starts as a new PhD student, working on responsible data management and natural language processing for mental health research. He will be jointly supervised with Iacer Calixto from the Amsterdam Medical Center.
Aug 18, 2022 ‐ I gave an invited talk on Data Provenance as a Foundation for AI Governance at Megagon Labs.