34
20
v1v2 (latest)

Uniform regret bounds over RdR^d for the sequential linear regression problem with the square loss

Abstract

We consider the setting of online linear regression for arbitrary deterministic sequences, with the square loss. We are interested in the aim set by Bartlett et al. (2015): obtain regret bounds that hold uniformly over all competitor vectors. When the feature sequence is known at the beginning of the game, they provided closed-form regret bounds of 2dB2lnT+OT(1)2d B^2 \ln T + \mathcal{O}_T(1), where TT is the number of rounds and BB is a bound on the observations. Instead, we derive bounds with an optimal constant of 11 in front of the dB2lnTd B^2 \ln T term. In the case of sequentially revealed features, we also derive an asymptotic regret bound of dB2lnTd B^2 \ln T for any individual sequence of features and bounded observations. All our algorithms are variants of the online non-linear ridge regression forecaster, either with a data-dependent regularization or with almost no regularization.

View on arXiv
Comments on this paper