1

Predicting Political Party Affiliation from Text

Tracking The Trackers: A Large-Scale Analysis of Embedded Web Trackers

Optimistic Recovery for Iterative Dataflows in Action

Efficient Sample Generation for Scalable Meta Learning

Factorbird - a Parameter Server Approach to Distributed Matrix Factorization

Scaling Data Mining in Massively Parallel Dataflow Systems

'All Roads Lead to Rome:' Optimistic Recovery for Distributed Iterative Data Processing

Distributed Matrix Factorization with MapReduce using a series of Broadcast-Joins

Iterative Parallel Data Processing with Stratosphere: An Inside Look

Collaborative Filtering with Apache Mahout