ResearchTrend.AI
  • Papers
  • Communities
  • Organizations
  • Events
  • Blog
  • Pricing
  • Feedback
  • Contact Sales
Papers
Communities
Social Events
Terms and Conditions
Pricing
Contact Sales
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2402.17457
  4. Cited By
Why do Learning Rates Transfer? Reconciling Optimization and Scaling
  Limits for Deep Learning

Why do Learning Rates Transfer? Reconciling Optimization and Scaling Limits for Deep Learning

27 February 2024
Lorenzo Noci
Alexandru Meterez
Thomas Hofmann
Antonio Orvieto
ArXiv (abs)PDFHTML

Papers citing "Why do Learning Rates Transfer? Reconciling Optimization and Scaling Limits for Deep Learning"

1 / 1 papers shown
Title
Scaling Optimal LR Across Token Horizons
Scaling Optimal LR Across Token Horizons
Johan Bjorck
Alon Benhaim
Vishrav Chaudhary
Furu Wei
Xia Song
218
9
0
30 Sep 2024
1