Online Newton Method for Bandit Convex Optimisation

Abstract
We introduce a computationally efficient algorithm for zeroth-order bandit convex optimisation and prove that in the adversarial setting its regret is at most with high probability where is the dimension and is the time horizon. In the stochastic setting the bound improves to where is a constant that depends on the geometry of the constraint set and the desired computational properties.
View on arXivComments on this paper