The Offset Tree for Learning with Partial Labels
Abstract
We present an algorithm, called the Offset Tree, for learning in situations where a loss associated with different decisions is not known, but was randomly probed. The algorithm is an {optimal} reduction from this problem to binary classification. In particular, it has regret at most times the regret of the binary classifier it uses, where is the number of decisions, and no reduction to binary classification can do better. This reduction is also computationally optimal, both at training and test time, requiring just work to train on an example or make a prediction. We test the Offset Tree empirically and discover that it generally results in better performance than several plausible alternative approaches.
View on arXivComments on this paper
