20
0

Optimistic Online Convex Optimization in Dynamic Environments

Abstract

In this paper, we study the optimistic online convex optimization problem in dynamic environments. Existing works have shown that Ader enjoys an O((1+PT)T)O\left(\sqrt{\left(1+P_T\right)T}\right) dynamic regret upper bound, where TT is the number of rounds, and PTP_T is the path length of the reference strategy sequence. However, Ader is not environment-adaptive. Based on the fact that optimism provides a framework for implementing environment-adaptive, we replace Greedy Projection (GP) and Normalized Exponentiated Subgradient (NES) in Ader with Optimistic-GP and Optimistic-NES respectively, and name the corresponding algorithm ONES-OGP. We also extend the doubling trick to the adaptive trick, and introduce three characteristic terms naturally arise from optimism, namely MTM_T, M~T\widetilde{M}_T and VT+1L2ρ(ρ+2PT)ϱ2VTDTV_T+1_{L^2\rho\left(\rho+2 P_T\right)\leqslant\varrho^2 V_T}D_T, to replace the dependence of the dynamic regret upper bound on TT. We elaborate ONES-OGP with adaptive trick and its subgradient variation version, all of which are environment-adaptive.

View on arXiv
Comments on this paper