ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2506.14731
  4. Cited By
Ring-lite: Scalable Reasoning via C3PO-Stabilized Reinforcement Learning for LLMs
v1v2 (latest)

Ring-lite: Scalable Reasoning via C3PO-Stabilized Reinforcement Learning for LLMs

17 June 2025
Ling Team
Bin Hu
Cai Chen
Deng Zhao
Ding Liu
dingnan jin
Feng Zhu
Hao Dai
Hongzhi Luan
Jia Guo
Jiaming Liu
J. Wu
Jun Mei
Jun Zhou
Junbo Zhao
Junwu Xiong
Kaihong Zhang
Kuan Xu
Lei Liang
Liang Jiang
Liangcheng Fu
Longfei Zheng
Qiang Gao
Qing Cui
Quan Wan
Shaomian Zheng
Shuaicheng Li
Tongkai Yang
Wang Ren
X. Yan
Xiaopei Wan
Xiaoyun Feng
Xin Zhao
Xinxing Yang
Xinyu Kong
Xuemin Yang
Yang Li
Y. Wu
Y. Liu
Zhankai Xu
Zhenduo Zhang
Zhenglei Zhou
Zhenyu Huang
Zhiqiang Zhang
Zihao Wang
Zujie Wen
    OffRLMoEALMLRM
ArXiv (abs)PDFHTML

Papers citing "Ring-lite: Scalable Reasoning via C3PO-Stabilized Reinforcement Learning for LLMs"

Title
No papers