A key challenge in crowdsourcing is inferring the ground truth from noisy and unreliable data. To do so, existing approaches rely on collecting redundant information from the crowd, and aggregating it with some probabilistic method. However, oftentimes such methods are computationally inefficient, are restricted to some specific settings, or lack theoretical guarantees. In this paper, we revisit the problem of binary classification from crowdsourced data. Specifically we propose Streaming Bayesian Inference for Crowdsourcing (SBIC), a new algorithm that does not suffer from any of these limitations. First, SBIC has low complexity and can be used in a real-time online setting. Second, SBIC has the same accuracy as the best state-of-the-art algorithms in all settings. Third, SBIC has provable asymptotic guarantees both in the online and offline settings.
Edoardo Manino (University of Southampton)
Edoardo Manino is a research fellow at the University of Southampton. Currently, he is finishing his PhD in machine learning and crowdsourcing under the supervision of Prof. Nicholas R. Jennings and Dr. Long Tran-Thanh. His research interests range from Bayesian learning to algorithmic game theory and, more recently, influence maximisation on social networks.