Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2504.11343
Cited By
A Minimalist Approach to LLM Reasoning: from Rejection Sampling to Reinforce
15 April 2025
Wei Xiong
Jiarui Yao
Yuhui Xu
Bo Pang
Lei Wang
Doyen Sahoo
Junnan Li
Nan Jiang
Tong Zhang
Caiming Xiong
Hanze Dong
OffRL
LRM
Re-assign community
ArXiv
PDF
HTML
Papers citing
"A Minimalist Approach to LLM Reasoning: from Rejection Sampling to Reinforce"
3 / 3 papers shown
Title
Beyond Áha!': Toward Systematic Meta-Abilities Alignment in Large Reasoning Models
Zhiyuan Hu
Yixuan Wang
Hanze Dong
Yuhui Xu
Amrita Saha
Caiming Xiong
Bryan Hooi
Junnan Li
LRM
24
0
0
15 May 2025
Scalable Chain of Thoughts via Elastic Reasoning
Yuhui Xu
Hanze Dong
Lei Wang
Doyen Sahoo
Junnan Li
Caiming Xiong
OffRL
LRM
51
2
0
08 May 2025
Optimizing Chain-of-Thought Reasoners via Gradient Variance Minimization in Rejection Sampling and RL
Jiarui Yao
Yifan Hao
Hanning Zhang
Hanze Dong
Wei Xiong
Nan Jiang
Tong Zhang
LRM
62
0
0
05 May 2025
1