ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1806.08782
12
23

Finding Local Minima via Stochastic Nested Variance Reduction

22 June 2018
Dongruo Zhou
Pan Xu
Quanquan Gu
ArXivPDFHTML
Abstract

We propose two algorithms that can find local minima faster than the state-of-the-art algorithms in both finite-sum and general stochastic nonconvex optimization. At the core of the proposed algorithms is One-epoch-SNVRG+\text{One-epoch-SNVRG}^+One-epoch-SNVRG+ using stochastic nested variance reduction (Zhou et al., 2018a), which outperforms the state-of-the-art variance reduction algorithms such as SCSG (Lei et al., 2017). In particular, for finite-sum optimization problems, the proposed SNVRG++Neon2finite\text{SNVRG}^{+}+\text{Neon2}^{\text{finite}}SNVRG++Neon2finite algorithm achieves O~(n1/2ϵ−2+nϵH−3+n3/4ϵH−7/2)\tilde{O}(n^{1/2}\epsilon^{-2}+n\epsilon_H^{-3}+n^{3/4}\epsilon_H^{-7/2})O~(n1/2ϵ−2+nϵH−3​+n3/4ϵH−7/2​) gradient complexity to converge to an (ϵ,ϵH)(\epsilon, \epsilon_H)(ϵ,ϵH​)-second-order stationary point, which outperforms SVRG+Neon2finite\text{SVRG}+\text{Neon2}^{\text{finite}}SVRG+Neon2finite (Allen-Zhu and Li, 2017) , the best existing algorithm, in a wide regime. For general stochastic optimization problems, the proposed SNVRG++Neon2online\text{SNVRG}^{+}+\text{Neon2}^{\text{online}}SNVRG++Neon2online achieves O~(ϵ−3+ϵH−5+ϵ−2ϵH−3)\tilde{O}(\epsilon^{-3}+\epsilon_H^{-5}+\epsilon^{-2}\epsilon_H^{-3})O~(ϵ−3+ϵH−5​+ϵ−2ϵH−3​) gradient complexity, which is better than both SVRG+Neon2online\text{SVRG}+\text{Neon2}^{\text{online}}SVRG+Neon2online (Allen-Zhu and Li, 2017) and Natasha2 (Allen-Zhu, 2017) in certain regimes. Furthermore, we explore the acceleration brought by third-order smoothness of the objective function.

View on arXiv
Comments on this paper