ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2504.14286
  4. Cited By
SRPO: A Cross-Domain Implementation of Large-Scale Reinforcement Learning on LLM

SRPO: A Cross-Domain Implementation of Large-Scale Reinforcement Learning on LLM

19 April 2025
X. Zhang
J. Wang
Zifei Cheng
Wenhao Zhuang
Zheng Lin
Minglei Zhang
Shaojie Wang
Yinghan Cui
Chao Wang
J. Peng
Shimiao Jiang
Shiqi Kuang
Shouyu Yin
Chaohang Wen
Haotian Zhang
Bin Chen
Bing Yu
    LRM
ArXivPDFHTML

Papers citing "SRPO: A Cross-Domain Implementation of Large-Scale Reinforcement Learning on LLM"

1 / 1 papers shown
Title
Phi-4-Mini-Reasoning: Exploring the Limits of Small Reasoning Language Models in Math
Phi-4-Mini-Reasoning: Exploring the Limits of Small Reasoning Language Models in Math
Haoran Xu
Baolin Peng
Hany Awadalla
Dongdong Chen
Yen-Chun Chen
...
Yelong Shen
S. Wang
Weijian Xu
Jianfeng Gao
Weizhu Chen
ReLM
LRM
75
1
0
30 Apr 2025
1