677

The Offset Tree for Learning with Partial Labels

Abstract

We present an algorithm, called the Offset Tree, for learning in situations where a loss associated with different decisions is not known, but was randomly probed. The algorithm is an {optimal} reduction from this problem to binary classification. In particular, it has regret at most (k1)(k-1) times the regret of the binary classifier it uses, where kk is the number of decisions, and no reduction to binary classification can do better. This reduction is also computationally optimal, both at training and test time, requiring just O(log2k)O(\log_2 k) work to train on an example or make a prediction. We test the Offset Tree empirically and discover that it generally results in better performance than several plausible alternative approaches.

View on arXiv
Comments on this paper