Posts

  • Nov 15, 2020 ‐ Our work on scalable data validation has been integrated into the new Amazon SageMaker Model Monitor service for concept drift detection. Here is the announcement at the AWS Reinvent conference:

    .
  • Sep 25, 2020 ‐ I gave the keynote at the workshop on Online Recommender Systems at ACM RecSys, check out the recording.
  • Sep 4, 2020 ‐ Julia Stoyanovich mentioned our work on the FairPrep framework in her keynote at VLDB 2020.
  • Jun 25, 2020 ‐ I gave a talk on Unit Tests for Data with Deequ at the database group of CWI, checkout the video recording.
  • Jun 14, 2020 ‐ We ran the 4th edition of the Workshop on Data Management for End-to-End Machine Learning (DEEM) with more than 120 attendees via Zoom this year. Checkout the videos of the presentations and invited talks on our website.
  • Jun 1, 2020 ‐ Our paper summarizing the Apache Mahout project called Apache Mahout: Machine Learning on Distributed Dataflow Systems has been accepted to the open source track of JMLR.
  • Nov 1, 2019 ‐ Our paper on Datawig: Missing Value Imputation for Tables has been accepted at the open source track of the Journal of Machine Learning Research (JMLR).