Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2504.13958
Cited By
ToolRL: Reward is All Tool Learning Needs
16 April 2025
Cheng Qian
Emre Can Acikgoz
Qi He
Hongru Wang
Xiusi Chen
Dilek Hakkani-Tur
Gokhan Tur
Heng Ji
OffRL
LRM
Re-assign community
ArXiv
PDF
HTML
Papers citing
"ToolRL: Reward is All Tool Learning Needs"
5 / 5 papers shown
Title
Can Global XAI Methods Reveal Injected Bias in LLMs? SHAP vs Rule Extraction vs RuleSHAP
Francesco Sovrano
12
0
0
16 May 2025
Search and Refine During Think: Autonomous Retrieval-Augmented Reasoning of LLMs
Yaorui Shi
Shihan Li
Chang Wu
Zhiyuan Liu
Fan Zhang
Hengxing Cai
An Zhang
Xinbing Wang
ReLM
LRM
26
0
0
16 May 2025
Knowledge Augmented Complex Problem Solving with Large Language Models: A Survey
Da Zheng
Lun Du
Junwei Su
Yuchen Tian
Yuqi Zhu
Jintian Zhang
Lanning Wei
Ningyu Zhang
H. Chen
LRM
61
0
0
06 May 2025
Sailing AI by the Stars: A Survey of Learning from Rewards in Post-Training and Test-Time Scaling of Large Language Models
Xiaobao Wu
LRM
72
1
0
05 May 2025
OTC: Optimal Tool Calls via Reinforcement Learning
Hongru Wang
Cheng Qian
Wanjun Zhong
Xiusi Chen
Jiahao Qiu
Shijue Huang
Bowen Jin
Mengdi Wang
Kam-Fai Wong
Heng Ji
OffRL
LRM
36
1
0
21 Apr 2025
1