Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2503.12811
Cited By
A Multi-Power Law for Loss Curve Prediction Across Learning Rate Schedules
17 March 2025
Kairong Luo
Haodong Wen
Shengding Hu
Zhenbo Sun
Zhiyuan Liu
Maosong Sun
Kaifeng Lyu
Wenguang Chen
CLL
Re-assign community
ArXiv
PDF
HTML
Papers citing
"A Multi-Power Law for Loss Curve Prediction Across Learning Rate Schedules"
1 / 1 papers shown
Title
Learning Dynamics in Continual Pre-Training for Large Language Models
Xingjin Wang
Howe Tissue
Lu Wang
Linjing Li
D. Zeng
CLL
29
0
0
12 May 2025
1