Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2505.22757
Cited By
Pre-Training Curriculum for Multi-Token Prediction in Language Models
28 May 2025
Ansar Aynetdinov
Alan Akbik
LRM
Re-assign community
ArXiv (abs)
PDF
HTML
Papers citing
"Pre-Training Curriculum for Multi-Token Prediction in Language Models"
2 / 2 papers shown
Title
Findings of the BabyLM Challenge: Sample-Efficient Pretraining on Developmentally Plausible Corpora
Alex Warstadt
Aaron Mueller
Leshem Choshen
E. Wilcox
Chengxu Zhuang
...
Rafael Mosquera
Bhargavi Paranjape
Adina Williams
Tal Linzen
Ryan Cotterell
202
121
0
10 Apr 2025
DeepSeek-R1: Incentivizing Reasoning Capability in LLMs via Reinforcement Learning
DeepSeek-AI
Daya Guo
Dejian Yang
Haowei Zhang
Junxiao Song
...
Shiyu Wang
S. Yu
Shunfeng Zhou
Shuting Pan
S.S. Li
ReLM
VLM
OffRL
AI4TS
LRM
395
2,028
0
22 Jan 2025
1