Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2412.14135
Cited By
Scaling of Search and Learning: A Roadmap to Reproduce o1 from Reinforcement Learning Perspective
18 December 2024
Zhiyuan Zeng
Qinyuan Cheng
Zhangyue Yin
Bo Wang
Shimin Li
Yunhua Zhou
Qipeng Guo
Xuanjing Huang
Xipeng Qiu
ELM
AI4TS
LRM
Re-assign community
ArXiv
PDF
HTML
Papers citing
"Scaling of Search and Learning: A Roadmap to Reproduce o1 from Reinforcement Learning Perspective"
5 / 5 papers shown
Title
Beyond Distillation: Pushing the Limits of Medical LLM Reasoning with Minimalist Rule-Based RL
Che Liu
Haozhe Wang
J. Pan
Zhongwei Wan
Yong Dai
Fangzhen Lin
Wenjia Bai
Daniel Rueckert
Rossella Arcucci
OffRL
LRM
ELM
28
0
0
23 May 2025
MARFT: Multi-Agent Reinforcement Fine-Tuning
Junwei Liao
Muning Wen
Jun Wang
Weinan Zhang
OffRL
57
1
0
21 Apr 2025
InftyThink: Breaking the Length Limits of Long-Context Reasoning in Large Language Models
Yuchen Yan
Yongliang Shen
Yuhang Liu
Jin Jiang
Hao Fei
Jian Shao
Yueting Zhuang
LRM
ReLM
67
5
0
09 Mar 2025
DISC: Dynamic Decomposition Improves LLM Inference Scaling
Jonathan Light
Wei Cheng
Wu Yue
Masafumi Oyamada
Mengdi Wang
Santiago Paternain
Haifeng Chen
ReLM
LRM
71
2
0
23 Feb 2025
Boosting Multimodal Reasoning with MCTS-Automated Structured Thinking
Jinyang Wu
Mingkuan Feng
Shuai Zhang
Ruihan Jin
Feihu Che
Zengqi Wen
J. Tao
LRM
72
11
0
04 Feb 2025
1