v1v2 (latest)
Minimax Regret for Bandit Convex Optimisation of Ridge Functions

Abstract
We analyse adversarial bandit convex optimisation with an adversary that is restricted to playing functions of the form for convex and unknown that is homogeneous over time. We provide a short information-theoretic proof that the minimax regret is at most where is the number of interactions, the dimension and is the diameter of the constraint set.
View on arXivComments on this paper