Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2305.15334
Cited By
Gorilla: Large Language Model Connected with Massive APIs
24 May 2023
Shishir G. Patil
Tianjun Zhang
Xin Wang
Joseph E. Gonzalez
ELM
CLL
ALM
SyDa
Re-assign community
ArXiv (abs)
PDF
HTML
Papers citing
"Gorilla: Large Language Model Connected with Massive APIs"
50 / 413 papers shown
Title
Cost-Efficient Serving of LLM Agents via Test-Time Plan Caching
Qizheng Zhang
Michael Wornow
Kunle Olukotun
21
0
0
17 Jun 2025
NaSh: Guardrails for an LLM-Powered Natural Language Shell
Bimal Raj Gyawali
Saikrishna Achalla
Konstantinos Kallas
Sam Kumar
19
0
0
16 Jun 2025
TagRouter: Learning Route to LLMs through Tags for Open-Domain Text Generation Tasks
Zhou Chen
Zhiqiang Wei
Yuqi Bai
Xue Xiong
Jianmin Wu
3DV
19
0
0
14 Jun 2025
Interaction, Process, Infrastructure: A Unified Architecture for Human-Agent Collaboration
Yun Wang
Yan Lu
AI4CE
19
0
0
13 Jun 2025
CRITICTOOL: Evaluating Self-Critique Capabilities of Large Language Models in Tool-Calling Error Scenarios
Shiting Huang
Zhen Fang
Zehui Chen
Siyu Yuan
Junjie Ye
Y. Zeng
Lin Yen-Chen
Qi Mao
Feng Zhao
LLMAG
KELM
34
0
0
11 Jun 2025
ORFS-agent: Tool-Using Agents for Chip Design Optimization
Amur Ghose
Andrew B. Kahng
Sayak Kundu
Zhiang Wang
AI4CE
23
0
0
10 Jun 2025
Design Patterns for Securing LLM Agents against Prompt Injections
Luca Beurer-Kellner
Beat Buesser Ana-Maria Creţu
Edoardo Debenedetti
Daniel Dobos
Daniel Fabian
...
Daniel Naeff
Ezinwanne Ozoani
Andrew Paverd
F. Tramèr
Václav Volhejn
LLMAG
SILM
AAML
42
0
0
10 Jun 2025
SOP-Bench: Complex Industrial SOPs for Evaluating LLM Agents
Subhrangshu Nandi
Arghya Datta
Nikhil Vichare
Indranil Bhattacharya
Huzefa Raja
...
Aaron Chan
Man Ho Woo
Amar Kandola
Brandon Theresa
Francesco Carbone
LLMAG
22
0
0
09 Jun 2025
hdl2v: A Code Translation Dataset for Enhanced LLM Verilog Generation
Charles Hong
Brendan Roberts
Huijae An
Alex Um
Advay Ratan
Y. Shao
124
0
0
05 Jun 2025
Simple Prompt Injection Attacks Can Leak Personal Data Observed by LLM Agents During Task Execution
Meysam Alizadeh
Zeynab Samei
Daria Stetsenko
Fabrizio Gilardi
SILM
59
0
0
01 Jun 2025
MCP-Zero: Active Tool Discovery for Autonomous LLM Agents
Xiang Fei
Xiawu Zheng
Hao Feng
LLMAG
67
0
0
01 Jun 2025
ToolHaystack: Stress-Testing Tool-Augmented Language Models in Realistic Long-Term Interactions
Beong-woo Kwak
Minju Kim
Dongha Lim
Hyungjoo Chae
Dongjin Kang
Sunghwan Kim
Dongil Yang
Jinyoung Yeo
LLMAG
RALM
79
0
0
29 May 2025
ThinkGeo: Evaluating Tool-Augmented Agents for Remote Sensing Tasks
Akashah Shabbir
Muhammad Akhtar Munir
Akshay Dudhane
Muhammad Umer Sheikh
M. H. Khan
Paolo Fraccaro
Juan Bernabé-Moreno
Fahad Shahbaz Khan
Salman Khan
LLMAG
ELM
64
0
0
29 May 2025
LaMDAgent: An Autonomous Framework for Post-Training Pipeline Optimization via LLM Agents
Taro Yano
Yoichi Ishibashi
Masafumi Oyamada
LM&Ro
66
1
0
28 May 2025
Training LLM-Based Agents with Synthetic Self-Reflected Trajectories and Partial Masking
Yihan Chen
Benfeng Xu
Xiaorui Wang
Yongdong Zhang
Zhendong Mao
LLMAG
42
0
0
26 May 2025
TTPA: Token-level Tool-use Preference Alignment Training Framework with Fine-grained Evaluation
Chengrui Huang
Shen Gao
Zhengliang Shi
Dongsheng Wang
Shuo Shang
64
0
0
26 May 2025
FunReason: Enhancing Large Language Models' Function Calling via Self-Refinement Multiscale Loss and Automated Data Refinement
Bingguang Hao
Maolin Wang
Zengzhuang Xu
Cunyin Peng
Yicheng Chen
Xiangyu Zhao
Jinjie Gu
Chenyi Zhuang
ReLM
LRM
115
0
0
26 May 2025
VisualToolAgent (VisTA): A Reinforcement Learning Framework for Visual Tool Selection
Zeyi Huang
Zeyi Huang
Anirudh Sundara Rajan
Zefan Cai
Wen Xiao
Junjie Hu
Yong Jae Lee
75
0
0
26 May 2025
Retrieval-Augmented Generation for Service Discovery: Chunking Strategies and Benchmarking
Robin D. Pesl
Jerin G. Mathew
Massimo Mecella
Marco Aiello
62
1
0
25 May 2025
LiteCUA: Computer as MCP Server for Computer-Use Agent on AIOS
Kai Mei
Xi Zhu
Hang Gao
Shuhang Lin
Yongfeng Zhang
223
0
0
24 May 2025
Gaming Tool Preferences in Agentic LLMs
Kazem Faghih
Wenxiao Wang
Yize Cheng
Siddhant Bharti
Gaurang Sriramanan
S. Balasubramanian
Parsa Hosseini
Soheil Feizi
LLMAG
KELM
125
0
0
23 May 2025
T1: A Tool-Oriented Conversational Dataset for Multi-Turn Agentic Planning
Amartya Chakraborty
Paresh Dashore
Nadia Bathaee
Anmol Jain
Anirban Das
Shi-Xiong Zhang
Sambit Sahu
Milind Naphade
Genta Indra Winata
LLMAG
120
0
0
22 May 2025
MCP-RADAR: A Multi-Dimensional Benchmark for Evaluating Tool Use Capabilities in Large Language Models
Xuanqi Gao
Siyi Xie
Juan Zhai
Shqing Ma
Chao Shen
ELM
128
0
0
22 May 2025
ModelingAgent: Bridging LLMs and Mathematical Modeling for Real-World Challenges
Cheng Qian
Hongyi Du
Hongru Wang
Xiusi Chen
Yuji Zhang
Avirup Sil
Chengxiang Zhai
Kathleen McKeown
Heng Ji
LLMAG
72
2
0
21 May 2025
Procedural Environment Generation for Tool-Use Agents
Michael Sullivan
Mareike Hartmann
Alexander Koller
SyDa
25
0
0
21 May 2025
RRTL: Red Teaming Reasoning Large Language Models in Tool Learning
Yifei Liu
Yu Cui
Haibin Zhang
LRM
127
0
0
21 May 2025
Prolonged Reasoning Is Not All You Need: Certainty-Based Adaptive Routing for Efficient LLM/MLLM Reasoning
Jinghui Lu
Haiyang Yu
Siliang Xu
Shiwei Ran
Guozhi Tang
...
Teng Fu
Hao Feng
Jingqun Tang
Hongru Wang
Can Huang
LRM
116
3
0
21 May 2025
A Challenge to Build Neuro-Symbolic Video Agents
Sahil Shah
Harsh Goel
Sai Shankar Narasimhan
Minkyu Choi
S P Sharan
Oguzhan Akcin
Sandeep Chinchali
AI4TS
78
0
0
20 May 2025
DrugPilot: LLM-based Parameterized Reasoning Agent for Drug Discovery
Kun Li
Zhennan Wu
Shoupeng Wang
Wenbin Hu
LLMAG
LM&MA
63
0
0
20 May 2025
Rethinking Stateful Tool Use in Multi-Turn Dialogues: Benchmarks and Challenges
Hongru Wang
Wenyu Huang
Yufei Wang
Yuanhao Xi
Jianqiao Lu
Huan Zhang
Nan Hu
Zeming Liu
Jeff Z. Pan
Kam-Fai Wong
LLMAG
101
0
0
19 May 2025
Ambiguity Resolution in Text-to-Structured Data Mapping
Zhibo Hu
Chen Wang
Yanfeng Shu
Hye-Young Paik
Liming Zhu
76
0
0
16 May 2025
ShiQ: Bringing back Bellman to LLMs
Pierre Clavier
Nathan Grinsztajn
Raphaël Avalos
Yannis Flet-Berliac
Irem Ergun
...
Eugene Tarassov
Olivier Pietquin
Pierre Harvey Richemond
Florian Strub
Matthieu Geist
OffRL
68
0
0
16 May 2025
TAIJI: MCP-based Multi-Modal Data Analytics on Data Lakes
Chao Zhang
Shaolei Zhang
Quehuan Liu
Sibei Chen
Tong Li
Ju Fan
63
0
0
16 May 2025
RouteNator: A Router-Based Multi-Modal Architecture for Generating Synthetic Training Data for Function Calling LLMs
Vibha Belavadi
Tushar Vatsa
Dewang Sultania
Suhas Suresha
Ishita Verma
Chong Chen
Tracy Holloway King
Michael Friedrich
SyDa
129
0
0
15 May 2025
ALOHA: Empowering Multilingual Agent for University Orientation with Hierarchical Retrieval
Mingxu Tao
Bowen Tang
Mingxuan Ma
Yining Zhang
Hourun Li
Feifan Wen
Hao Ma
Jia-Qi Yang
56
0
0
13 May 2025
TRAIL: Trace Reasoning and Agentic Issue Localization
Darshan Deshpande
Varun Gangal
Hersh Mehta
Jitin Krishnan
Anand Kannappan
Rebecca Qian
140
0
0
13 May 2025
Large Language Models for Computer-Aided Design: A Survey
Licheng Zhang
Bach Le
Naveed Akhtar
Siew-Kei Lam
Tuan Ngo
3DV
AI4CE
139
1
0
13 May 2025
ToolACE-DEV: Self-Improving Tool Learning via Decomposition and EVolution
Xiaolin Huang
Weiwen Liu
Xingshan Zeng
Yanhua Huang
Xinlong Hao
...
Yirong Zeng
Chuhan Wu
Yun Wang
Ruiming Tang
Defu Lian
KELM
79
0
0
12 May 2025
ScaleMCP: Dynamic and Auto-Synchronizing Model Context Protocol Tools for LLM Agents
Elias Lumer
Anmol Gulati
Vamse Kumar Subbiah
Pradeep Honaganahalli Basavaraju
James A. Burke
70
0
0
09 May 2025
PyTDC: A multimodal machine learning training, evaluation, and inference platform for biomedical foundation models
Alejandro Velez-Arce
Marinka Zitnik
111
0
0
08 May 2025
VLM Q-Learning: Aligning Vision-Language Models for Interactive Decision-Making
Jake Grigsby
Yuke Zhu
Michael S Ryoo
Juan Carlos Niebles
OffRL
VLM
96
1
0
06 May 2025
RAG-MCP: Mitigating Prompt Bloat in LLM Tool Selection via Retrieval-Augmented Generation
Tiantian Gan
Qiyao Sun
35
2
0
06 May 2025
CarbonCall: Sustainability-Aware Function Calling for Large Language Models on Edge Devices
Varatheepan Paramanayakam
Andreas Karatzas
Iraklis Anagnostopoulos
Dimitrios Stamoulis
73
1
0
29 Apr 2025
Prompt Injection Attack to Tool Selection in LLM Agents
Jiawen Shi
Zenghui Yuan
Guiyao Tie
Pan Zhou
Neil Zhenqiang Gong
Lichao Sun
LLMAG
139
4
0
28 Apr 2025
When2Call: When (not) to Call Tools
Hayley Ross
Ameya Sunil Mahabaleshwarkar
Yoshi Suhara
141
1
0
26 Apr 2025
Towards Adaptive Software Agents for Debugging
Yacine Majdoub
Eya Ben Charrada
Haifa Touati
LLMAG
157
0
0
25 Apr 2025
One-Pass to Reason: Token Duplication and Block-Sparse Mask for Efficient Fine-Tuning on Multi-Turn Reasoning
Ritesh Goru
Shanay Mehta
Prateek Jain
LRM
75
0
0
25 Apr 2025
A Survey of AI Agent Protocols
Yue Yang
Huacan Chai
Yangqiu Song
S. Qi
Muning Wen
...
Gaowei Chang
Wen Liu
Ying Wen
Yong Yu
Weinan Zhang
LLMAG
144
11
0
23 Apr 2025
WASP: Benchmarking Web Agent Security Against Prompt Injection Attacks
Ivan Evtimov
Arman Zharmagambetov
Aaron Grattafiori
Chuan Guo
Kamalika Chaudhuri
AAML
118
4
0
22 Apr 2025
Synergizing RAG and Reasoning: A Systematic Review
Yunfan Gao
Yun Xiong
Yijie Zhong
Yuxi Bi
Ming Xue
Haoyu Wang
LRM
AI4CE
462
7
0
22 Apr 2025
1
2
3
4
5
6
7
8
9
Next