Competitive analysis of the top-K ranking problem

Motivated by applications in recommender systems, web search, social choice and crowdsourcing, we consider the problem of identifying the set of top items from noisy pairwise comparisons. In our setting, we are non-actively given pairwise comparisons between each pair of items, where each comparison has noise constrained by a very general noise model called the strong stochastic transitivity (SST) model. We analyze the competitive ratio of algorithms for the top- problem. In particular, we present a linear time algorithm for the top- problem which has a competitive ratio of ; i.e. to solve any instance of top-, our algorithm needs at most times as many samples needed as the best possible algorithm for that instance (in contrast, all previous known algorithms for the top- problem have competitive ratios of or worse). We further show that this is tight: any algorithm for the top- problem has competitive ratio at least .
View on arXiv