Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2506.14731
Cited By
v1
v2 (latest)
Ring-lite: Scalable Reasoning via C3PO-Stabilized Reinforcement Learning for LLMs
17 June 2025
Ling Team
Bin Hu
Cai Chen
Deng Zhao
Ding Liu
dingnan jin
Feng Zhu
Hao Dai
Hongzhi Luan
Jia Guo
Jiaming Liu
J. Wu
Jun Mei
Jun Zhou
Junbo Zhao
Junwu Xiong
Kaihong Zhang
Kuan Xu
Lei Liang
Liang Jiang
Liangcheng Fu
Longfei Zheng
Qiang Gao
Qing Cui
Quan Wan
Shaomian Zheng
Shuaicheng Li
Tongkai Yang
Wang Ren
X. Yan
Xiaopei Wan
Xiaoyun Feng
Xin Zhao
Xinxing Yang
Xinyu Kong
Xuemin Yang
Yang Li
Y. Wu
Y. Liu
Zhankai Xu
Zhenduo Zhang
Zhenglei Zhou
Zhenyu Huang
Zhiqiang Zhang
Zihao Wang
Zujie Wen
OffRL
MoE
ALM
LRM
Re-assign community
ArXiv (abs)
PDF
HTML
Papers citing
"Ring-lite: Scalable Reasoning via C3PO-Stabilized Reinforcement Learning for LLMs"
Title
No papers