ResearchTrend.AI
  • Papers
  • Communities
  • Organizations
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1603.05953
144
582
v1v2v3v4v5v6 (latest)

Katyusha: The First Direct Acceleration of Stochastic Gradient Methods

18 March 2016
Zeyuan Allen-Zhu
    ODL
ArXiv (abs)PDFHTML
Abstract

We introduce Katyusha\mathtt{Katyusha}Katyusha, the first direct stochastic gradient method that has an accelerated convergence rate. Given an objective that is an average of nnn convex and smooth functions, Katyusha\mathtt{Katyusha}Katyusha converges to an ε\varepsilonε-approximate minimizer using O((n+nκ)⋅log⁡f(x0)−f(x∗)ε)O((n + \sqrt{n \kappa})\cdot \log\frac{f(x_0)-f(x^*)}{\varepsilon})O((n+nκ​)⋅logεf(x0​)−f(x∗)​) stochastic iterations, where κ\kappaκ is the condition number. Katyusha\mathtt{Katyusha}Katyusha is a direct primal method. In contrast, previous accelerated stochastic methods are either based on dual coordinate descent which are more restrictive, or based on outer-inner loops which make them "blind" to the underlying stochastic nature of the optimization process. Katyusha\mathtt{Katyusha}Katyusha is the first algorithm that incorporates acceleration directly into the stochastic gradient updates. Katyusha\mathtt{Katyusha}Katyusha supports proximal updates, non-Euclidean norm smoothness, non-uniform sampling, as well as mini-batch sampling. It also improves the best known convergence rates on many interesting classes of convex objectives, including smooth objectives (e.g., Lasso, Logistic Regression), strongly-convex objectives (e.g., SVM), and non-smooth objectives (e.g., L1SVM). The main ingredient behind our result is Katyusha momentum, a clever "negative momentum on top of momentum" that can be added on top of a variance-reduction based algorithm and speed it up. As a result, since variance reduction has been successfully applied to a fast growing list of practical problems, our paper suggests that in each of such cases, one had better hurry up and give Katyusha a hug.

View on arXiv
Comments on this paper