Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2505.17281
Cited By
Search Wisely: Mitigating Sub-optimal Agentic Searches By Reducing Uncertainty
22 May 2025
Peilin Wu
Mian Zhang
Xinlu Zhang
Xinya Du
Zhiyu Zoey Chen
Re-assign community
ArXiv
PDF
HTML
Papers citing
"Search Wisely: Mitigating Sub-optimal Agentic Searches By Reducing Uncertainty"
8 / 8 papers shown
Title
Search-R1: Training LLMs to Reason and Leverage Search Engines with Reinforcement Learning
Bowen Jin
Hansi Zeng
Zhenrui Yue
Dong Wang
Sercan O. Arik
Dong Wang
Hamed Zamani
Jiawei Han
RALM
ReLM
KELM
OffRL
AI4TS
LRM
191
103
0
12 Mar 2025
SMART: Self-Aware Agent for Tool Overuse Mitigation
Cheng Qian
Emre Can Acikgoz
H. Wang
Xiusi Chen
Avirup Sil
Dilek Hakkani-Tur
Gokhan Tur
Heng Ji
LLMAG
KELM
LRM
124
8
0
17 Feb 2025
DeepRAG: Thinking to Retrieve Step by Step for Large Language Models
Xinyan Guan
Jiali Zeng
Fandong Meng
Chunlei Xin
Yaojie Lu
Hongyu Lin
Jia Zheng
Le Sun
Jie Zhou
ReLM
KELM
LRM
85
7
0
03 Feb 2025
DeepSeekMath: Pushing the Limits of Mathematical Reasoning in Open Language Models
Zhihong Shao
Peiyi Wang
Qihao Zhu
Runxin Xu
Jun-Mei Song
...
Haowei Zhang
Mingchuan Zhang
Yiming Li
Yu-Huan Wu
Daya Guo
ReLM
LRM
122
1,119
0
05 Feb 2024
Text Embeddings by Weakly-Supervised Contrastive Pre-training
Liang Wang
Nan Yang
Xiaolong Huang
Binxing Jiao
Linjun Yang
Daxin Jiang
Rangan Majumder
Furu Wei
VLM
239
601
0
07 Dec 2022
Measuring and Narrowing the Compositionality Gap in Language Models
Ofir Press
Muru Zhang
Sewon Min
Ludwig Schmidt
Noah A. Smith
M. Lewis
ReLM
KELM
LRM
177
626
0
07 Oct 2022
HotpotQA: A Dataset for Diverse, Explainable Multi-hop Question Answering
Zhilin Yang
Peng Qi
Saizheng Zhang
Yoshua Bengio
William W. Cohen
Ruslan Salakhutdinov
Christopher D. Manning
RALM
171
2,647
0
25 Sep 2018
TriviaQA: A Large Scale Distantly Supervised Challenge Dataset for Reading Comprehension
Mandar Joshi
Eunsol Choi
Daniel S. Weld
Luke Zettlemoyer
RALM
204
2,654
0
09 May 2017
1