433
v1v2v3v4 (latest)

FOSI: Hybrid First and Second Order Optimization

International Conference on Learning Representations (ICLR), 2023
Main:9 Pages
13 Figures
Bibliography:3 Pages
2 Tables
Appendix:11 Pages
Abstract

Popular machine learning approaches forgo second-order information due to the difficulty of computing curvature in high dimensions. We present FOSI, a novel meta-algorithm that improves the performance of any base first-order optimizer by efficiently incorporating second-order information during the optimization process. In each iteration, FOSI implicitly splits the function into two quadratic functions defined on orthogonal subspaces, then uses a second-order method to minimize the first, and the base optimizer to minimize the other. We formally analyze FOSI's convergence and the conditions under which it improves a base optimizer. Our empirical evaluation demonstrates that FOSI improves the convergence rate and optimization time of first-order methods such as Heavy-Ball and Adam, and outperforms second-order methods (K-FAC and L-BFGS).

View on arXiv
Comments on this paper