Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2504.20571
Cited By
Reinforcement Learning for Reasoning in Large Language Models with One Training Example
29 April 2025
Yiping Wang
Qing Yang
Zhiyuan Zeng
Liliang Ren
Lucas Liu
Baolin Peng
Hao Cheng
Xuehai He
Kuan Wang
Jianfeng Gao
Weizhu Chen
Shuohang Wang
Simon Shaolei Du
Yelong Shen
OffRL
ReLM
LRM
Re-assign community
ArXiv
PDF
HTML
Papers citing
"Reinforcement Learning for Reasoning in Large Language Models with One Training Example"
Title
No papers