34
16

Fast Rates for Online Prediction with Abstention

Abstract

In the setting of sequential prediction of individual {0,1}\{0, 1\}-sequences with expert advice, we show that by allowing the learner to abstain from the prediction by paying a cost marginally smaller than 12\frac 12 (say, 0.490.49), it is possible to achieve expected regret bounds that are independent of the time horizon TT. We exactly characterize the dependence on the abstention cost cc and the number of experts NN by providing matching upper and lower bounds of order logN12c\frac{\log N}{1-2c}, which is to be contrasted with the best possible rate of TlogN\sqrt{T\log N} that is available without the option to abstain. We also discuss various extensions of our model, including a setting where the sequence of abstention costs can change arbitrarily over time, where we show regret bounds interpolating between the slow and the fast rates mentioned above, under some natural assumptions on the sequence of abstention costs.

View on arXiv
Comments on this paper