Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2410.19964
Cited By
Understanding Adam Requires Better Rotation Dependent Assumptions
25 October 2024
Lucas Maes
Tianyue H. Zhang
Alexia Jolicoeur-Martineau
Ioannis Mitliagkas
Damien Scieur
Simon Lacoste-Julien
Charles Guille-Escuret
Re-assign community
ArXiv
PDF
HTML
Papers citing
"Understanding Adam Requires Better Rotation Dependent Assumptions"
2 / 2 papers shown
Title
Towards Quantifying the Hessian Structure of Neural Networks
Zhaorui Dong
Yushun Zhang
Z. Luo
Jianfeng Yao
Ruoyu Sun
28
0
0
05 May 2025
Understanding Why Adam Outperforms SGD: Gradient Heterogeneity in Transformers
Akiyoshi Tomihari
Issei Sato
ODL
61
0
0
31 Jan 2025
1