Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2504.14286
Cited By
SRPO: A Cross-Domain Implementation of Large-Scale Reinforcement Learning on LLM
19 April 2025
X. Zhang
J. Wang
Zifei Cheng
Wenhao Zhuang
Zheng Lin
Minglei Zhang
Shaojie Wang
Yinghan Cui
Chao Wang
J. Peng
Shimiao Jiang
Shiqi Kuang
Shouyu Yin
Chaohang Wen
Haotian Zhang
Bin Chen
Bing Yu
LRM
Re-assign community
ArXiv
PDF
HTML
Papers citing
"SRPO: A Cross-Domain Implementation of Large-Scale Reinforcement Learning on LLM"
1 / 1 papers shown
Title
Phi-4-Mini-Reasoning: Exploring the Limits of Small Reasoning Language Models in Math
Haoran Xu
Baolin Peng
Hany Awadalla
Dongdong Chen
Yen-Chun Chen
...
Yelong Shen
S. Wang
Weijian Xu
Jianfeng Gao
Weizhu Chen
ReLM
LRM
75
1
0
30 Apr 2025
1