Smoothness, Low-Noise and Fast Rates

20 September 2010

Abstract

We establish am excess risk bound of order $H \Rad_n^2 + \sqrt{H L^*}\Rad_n$ for ERM with an H-smooth loss function and a hypothesis class with Rademacher complexity $\Rad_n$ , where $L^*$ is the best risk achievable by the hypothesis class. For typical hypothesis classes where $\Rad_n = \sqrt{R/n}$ , this translates to a learning rate of order $RH/n$ in the separable ( $L^*=0$ ) case and $RH/n + \sqrt{L^* RH/n}$ more generally. We also provide similar guarantees for online and stochastic convex optimization of a smooth non-negative objective.

View on arXiv

Comments on this paper