Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2408.11029
Cited By
Scaling Law with Learning Rate Annealing
20 August 2024
Howe Tissue
Venus Wang
Lu Wang
Re-assign community
ArXiv
PDF
HTML
Papers citing
"Scaling Law with Learning Rate Annealing"
3 / 3 papers shown
Title
Learning Dynamics in Continual Pre-Training for Large Language Models
Xingjin Wang
Howe Tissue
Lu Wang
Linjing Li
D. Zeng
CLL
41
0
0
12 May 2025
Scaling Laws for Predicting Downstream Performance in LLMs
Yangyi Chen
Binxuan Huang
Yifan Gao
Zhengyang Wang
Jingfeng Yang
Heng Ji
LRM
64
9
0
11 Oct 2024
Scaling Laws for Neural Language Models
Jared Kaplan
Sam McCandlish
T. Henighan
Tom B. Brown
B. Chess
R. Child
Scott Gray
Alec Radford
Jeff Wu
Dario Amodei
270
4,576
0
23 Jan 2020
1