Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2210.11693
Cited By
Amos: An Adam-style Optimizer with Adaptive Weight Decay towards Model-Oriented Scale
21 October 2022
Ran Tian
Ankur P. Parikh
ODL
Re-assign community
ArXiv
PDF
HTML
Papers citing
"Amos: An Adam-style Optimizer with Adaptive Weight Decay towards Model-Oriented Scale"
3 / 3 papers shown
Title
Poisson Process for Bayesian Optimization
Xiaoxing Wang
Jiaxing Li
Chao Xue
Wei Liu
Weifeng Liu
Xiaokang Yang
Junchi Yan
Dacheng Tao
25
1
0
05 Feb 2024
A Theoretical and Empirical Study on the Convergence of Adam with an "Exact" Constant Step Size in Non-Convex Settings
Alokendu Mazumder
Rishabh Sabharwal
Manan Tayal
Bhartendu Kumar
Punit Rathore
17
0
0
15 Sep 2023
Scaling Laws for Neural Language Models
Jared Kaplan
Sam McCandlish
T. Henighan
Tom B. Brown
B. Chess
R. Child
Scott Gray
Alec Radford
Jeff Wu
Dario Amodei
261
4,489
0
23 Jan 2020
1