Papers
Communities
Organizations
Events
Blog
Pricing
Feedback
Contact Sales
Search
Open menu
Home
Papers
2402.17457
Cited By
Why do Learning Rates Transfer? Reconciling Optimization and Scaling Limits for Deep Learning
27 February 2024
Lorenzo Noci
Alexandru Meterez
Thomas Hofmann
Antonio Orvieto
Re-assign community
ArXiv (abs)
PDF
HTML
Papers citing
"Why do Learning Rates Transfer? Reconciling Optimization and Scaling Limits for Deep Learning"
1 / 1 papers shown
Title
Scaling Optimal LR Across Token Horizons
Johan Bjorck
Alon Benhaim
Vishrav Chaudhary
Furu Wei
Xia Song
218
9
0
30 Sep 2024
1