Natasha: Faster Non-Convex Stochastic Optimization Via Strongly Non-Convex Parameter

Abstract
Given a nonconvex function that is an average of smooth functions, we design stochastic first-order methods to find its approximate stationary points. The convergence of our new methods depends on the smallest (negative) eigenvalue of the Hessian, a parameter that describes how nonconvex the function is. Our methods outperform known results for a range of parameter , and can be used to find approximate local minima. Our result implies an interesting dichotomy: there exists a threshold so that the currently fastest methods for and for have different behaviors: the former scales with and the latter scales with .
View on arXivComments on this paper