Posts

  • Feb 2, 2023 ‐ We will present a vision paper on the impact of data cleaning on the fairness of ML models at the new special track of ICDE, originating from our ongoing collaboration with the Center for Responsible AI at New York University.
  • Jan 18, 2023 ‐ I gave a short presentation at CIDR on our ideas for using provenance to debug the data flowing through machine learning pipelines.
  • Dec 6, 2022 ‐ Stefan will intern with the Gray Systems Lab at Microsoft next year, working on data systems for the Azure cloud.
  • Dec 4, 2022 ‐ Congratulations to David for winning the best-paper runner-up award at the Table Representation Learning workshop at NeurIPS in New Orleans. NeurIPS made a recording of his presentation available.
  • Nov 10, 2022 ‐ I am on parental leave until January 2023. I will be back for CIDR in Amsterdam.
  • Nov 7, 2022 ‐ David is going to present our work on Parameter-Efficient Automation of Data Wrangling Tasks with Prefix-Tuning at the Table Representation Learning workshop at NeurIPS in New Orleans.
  • Oct 18, 2022 ‐ Mozhdeh’s paper on A Personalized Neighborhood-based Model for Within-basket Recommendation in Grocery Shopping has been accepted at WSDM. Another piece of evidence that kNN models provide state-of-the-art prediction quality and achieve low latency inference for sequential recommendation tasks.
  • Oct 10, 2022Zeyu Zhang starts as a new PhD student, working on responsible data management and natural language processing for mental health research. He will be jointly supervised with Iacer Calixto from the Amsterdam Medical Center.
  • Aug 18, 2022 ‐ I gave an invited talk on Data Provenance as a Foundation for AI Governance at Megagon Labs.