ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2006.01512
14
3

A fast and simple modification of Newton's method helping to avoid saddle points

2 June 2020
T. Truong
T. Tô
T. H. Nguyen
Thu Hang Nguyen
H. Nguyen
M. Helmy
ArXivPDFHTML
Abstract

We propose in this paper New Q-Newton's method. The update rule is very simple conceptually, for example xn+1=xn−wnx_{n+1}=x_n-w_nxn+1​=xn​−wn​ where wn=prAn,+(vn)−prAn,−(vn)w_n=pr_{A_n,+}(v_n)-pr_{A_n,-}(v_n)wn​=prAn​,+​(vn​)−prAn​,−​(vn​), with An=∇2f(xn)+δn∣∣∇f(xn)∣∣2.IdA_n=\nabla ^2f(x_n)+\delta _n||\nabla f(x_n)||^2.IdAn​=∇2f(xn​)+δn​∣∣∇f(xn​)∣∣2.Id and vn=An−1.∇f(xn)v_n=A_n^{-1}.\nabla f(x_n)vn​=An−1​.∇f(xn​). Here δn\delta _nδn​ is an appropriate real number so that AnA_nAn​ is invertible, and prAn,±pr_{A_n,\pm}prAn​,±​ are projections to the vector subspaces generated by eigenvectors of positive (correspondingly negative) eigenvalues of AnA_nAn​. The main result of this paper roughly says that if fff is C3C^3C3 (can be unbounded from below) and a sequence {xn}\{x_n\}{xn​}, constructed by the New Q-Newton's method from a random initial point x0x_0x0​, {\bf converges}, then the limit point is a critical point and is not a saddle point, and the convergence rate is the same as that of Newton's method. The first author has recently been successful incorporating Backtracking line search to New Q-Newton's method, thus resolving the convergence guarantee issue observed for some (non-smooth) cost functions. An application to quickly finding zeros of a univariate meromorphic function will be discussed. Various experiments are performed, against well known algorithms such as BFGS and Adaptive Cubic Regularization are presented.

View on arXiv
Comments on this paper