Minimax Regret for Bandit Convex Optimisation of Ridge Functions

Abstract
We analyse adversarial bandit convex optimisation with an adversary that is restricted to playing functions of the form for convex and . We provide a short information-theoretic proof that the minimax regret is at most where is the number of interactions, the dimension and is the diameter of the constraint set. Hence, this class of functions is at most logarithmically harder than the linear case.
View on arXivComments on this paper