Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2505.17005
Cited By
R1-Searcher++: Incentivizing the Dynamic Knowledge Acquisition of LLMs via Reinforcement Learning
22 May 2025
Huatong Song
Jinhao Jiang
Wenqing Tian
Zhongfu Chen
Yuhuan Wu
Jiahao Zhao
Yingqian Min
Wayne Xin Zhao
Lei Fang
Ji-Rong Wen
RALM
KELM
AI4TS
LRM
Re-assign community
ArXiv (abs)
PDF
HTML
Papers citing
"R1-Searcher++: Incentivizing the Dynamic Knowledge Acquisition of LLMs via Reinforcement Learning"
5 / 5 papers shown
Title
Towards Effective Code-Integrated Reasoning
Fei Bai
Yingqian Min
Beichen Zhang
Zhipeng Chen
Wayne Xin Zhao
Lei Fang
Zheng Liu
Zhongyuan Wang
Ji-Rong Wen
OffRL
LRM
18
0
0
30 May 2025
Search-R1: Training LLMs to Reason and Leverage Search Engines with Reinforcement Learning
Bowen Jin
Hansi Zeng
Zhenrui Yue
Dong Wang
Sercan O. Arik
Dong Wang
Hamed Zamani
Jiawei Han
RALM
ReLM
KELM
OffRL
AI4TS
LRM
224
122
0
12 Mar 2025
Atom of Thoughts for Markov LLM Test-Time Scaling
Fengwei Teng
Zhaoyang Yu
Quan Shi
Jiayi Zhang
Chenglin Wu
Yuyu Luo
MU
LRM
134
23
0
17 Feb 2025
SFT Memorizes, RL Generalizes: A Comparative Study of Foundation Model Post-training
Tianzhe Chu
Yuexiang Zhai
Jihan Yang
Shengbang Tong
Saining Xie
Dale Schuurmans
Quoc V. Le
Sergey Levine
Yi-An Ma
OffRL
245
128
0
28 Jan 2025
FlashRAG: A Modular Toolkit for Efficient Retrieval-Augmented Generation Research
Jiajie Jin
Yutao Zhu
Xinyu Yang
Chenghao Zhang
Zhicheng Dou
Chenghao Zhang
Tong Zhao
Zhao Yang
Zhicheng Dou
Ji-Rong Wen
VLM
165
72
0
22 May 2024
1