Bandit optimisation of functions in the Matérn kernel RKHS

Abstract
We consider the problem of optimising functions in the reproducing kernel Hilbert space (RKHS) of a Mat\érn kernel with smoothness parameter over the domain under noisy bandit feedback. Our contribution, the -GP-UCB algorithm, is the first practical approach with guaranteed sublinear regret for all and . Empirical validation suggests better performance and drastically improved computational scalablity compared with its predecessor, Improved GP-UCB.
View on arXivComments on this paper