Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2305.15334
Cited By
Gorilla: Large Language Model Connected with Massive APIs
24 May 2023
Shishir G. Patil
Tianjun Zhang
Xin Wang
Joseph E. Gonzalez
ELM
CLL
ALM
SyDa
Re-assign community
ArXiv (abs)
PDF
HTML
Papers citing
"Gorilla: Large Language Model Connected with Massive APIs"
50 / 413 papers shown
Title
How Should We Build A Benchmark? Revisiting 274 Code-Related Benchmarks For LLMs
Jialun Cao
Yuk-Kit Chan
Zixuan Ling
Wenxuan Wang
Shuqing Li
...
Pinjia He
Shuai Wang
Zibin Zheng
Michael R. Lyu
Shing-Chi Cheung
ALM
179
2
0
18 Jan 2025
Large Language Models, Knowledge Graphs and Search Engines: A Crossroads for Answering Users' Questions
Aidan Hogan
Xin Luna Dong
Denny Vrandečić
Gerhard Weikum
121
5
0
12 Jan 2025
Multi-Agent Collaboration Mechanisms: A Survey of LLMs
Khanh-Tung Tran
Dung Dao
Minh-Duong Nguyen
Quoc-Viet Pham
Barry O'Sullivan
Hoang D. Nguyen
LLMAG
151
57
0
10 Jan 2025
NesTools: A Dataset for Evaluating Nested Tool Learning Abilities of Large Language Models
Han Han
Tong Zhu
Xiang Zhang
Mengsong Wu
Hao Xiong
Wenliang Chen
49
0
0
08 Jan 2025
ToolHop: A Query-Driven Benchmark for Evaluating Large Language Models in Multi-Hop Tool Use
Junjie Ye
Zhengyin Du
Xuesong Yao
Weijian Lin
Yufei Xu
...
Siyu Yuan
Tao Gui
Qi Zhang
Xuanjing Huang
Jiecao Chen
147
0
0
05 Jan 2025
Optimizing Small Language Models for In-Vehicle Function-Calling
Yahya Sowti Khiabani
Farris Atif
Chieh Hsu
Sven Stahlmann
Tobias Michels
Sebastian Kramer
Benedikt Heidrich
M. Saquib Sarfraz
Julian Merten
Faezeh Tafazzoli
82
1
0
04 Jan 2025
Exposing Limitations of Language Model Agents in Sequential-Task Compositions on the Web
Hiroki Furuta
Yutaka Matsuo
Aleksandra Faust
Izzeddin Gur
CLL
221
16
0
03 Jan 2025
Plancraft: an evaluation dataset for planning with LLM agents
Gautier Dagan
Frank Keller
A. Lascarides
LLMAG
60
1
0
31 Dec 2024
The Task Shield: Enforcing Task Alignment to Defend Against Indirect Prompt Injection in LLM Agents
Feiran Jia
Tong Wu
Xin Qin
Anna Squicciarini
LLMAG
AAML
152
7
0
21 Dec 2024
Disentangling Reasoning Tokens and Boilerplate Tokens For Language Model Fine-tuning
Ziang Ye
Zizhuo Zhang
Yang Zhang
Jianxin Ma
Junyang Lin
Fuli Feng
LRM
127
0
0
19 Dec 2024
CAD-Assistant: Tool-Augmented VLLMs as Generic CAD Task Solvers
Dimitrios Mallis
Ahmet Serdar Karadeniz
Sebastian Cavada
Danila Rukhovich
Niki Maria Foteinopoulou
K. Cherenkova
Anis Kacem
Djamila Aouada
194
7
0
18 Dec 2024
TheAgentCompany: Benchmarking LLM Agents on Consequential Real World Tasks
Frank F. Xu
Yufan Song
Boxuan Li
Yuxuan Tang
Kritanjali Jain
...
Wayne Chi
Lawrence Jang
Yiqing Xie
Shuyan Zhou
Graham Neubig
LLMAG
204
42
0
18 Dec 2024
Empowering LLMs to Understand and Generate Complex Vector Graphics
Ximing Xing
Juncheng Hu
Guotao Liang
Jing Zhang
Dong Xu
Qian Yu
192
12
0
15 Dec 2024
GraphTool-Instruction: Revolutionizing Graph Reasoning in LLMs through Decomposed Subtask Instruction
Rongzheng Wang
Shuang Liang
Qizhi Chen
Jiasheng Zhang
Ke Qin
114
0
0
11 Dec 2024
LABIIUM: AI-Enhanced Zero-configuration Measurement Automation System
Emmanuel A. Olowe
Danial Chitnis
129
0
0
07 Dec 2024
Advanced System Integration: Analyzing OpenAPI Chunking for Retrieval-Augmented Generation
Robin D. Pesl
Jerin G. Mathew
Massimo Mecella
Marco Aiello
106
2
0
29 Nov 2024
Action Engine: An LLM-based Framework for Automatic FaaS Workflow Generation
Akiharu Esashi
Pawissanutt Lertpongrujikorn
M. Salehi
103
0
0
29 Nov 2024
MAG-V: A Multi-Agent Framework for Synthetic Data Generation and Verification
Saptarshi Sengupta
Kristal Curtis
Akshay Mallipeddi
Abhinav Mathur
Joseph Ross
Liang Gou
Liang Gou
LLMAG
SyDa
214
2
0
28 Nov 2024
CATP-LLM: Empowering Large Language Models for Cost-Aware Tool Planning
Duo Wu
Jiangming Wang
Yuan Meng
Yanning Zhang
Le Sun
Zhi Wang
536
0
0
25 Nov 2024
ToolScan: A Benchmark for Characterizing Errors in Tool-Use LLMs
Shirley Kokane
Ming Zhu
Tulika Awalgaonkar
Jianguo Zhang
Thai Hoang
...
Juan Carlos Niebles
Huan Wang
Shelby Heinecke
Caiming Xiong
Silivo Savarese
LLMAG
182
2
0
20 Nov 2024
LibEvolutionEval: A Benchmark and Study for Version-Specific Code Generation
Sachit Kuhar
W. Ahmad
Zijian Wang
Nihal Jain
Haifeng Qian
Baishakhi Ray
M. K. Ramanathan
Xiaofei Ma
Anoop Deoras
ELM
83
1
0
19 Nov 2024
PTR: Precision-Driven Tool Recommendation for Large Language Models
Hang Gao
Yongfeng Zhang
KELM
75
0
0
14 Nov 2024
From Medprompt to o1: Exploration of Run-Time Strategies for Medical Challenge Problems and Beyond
Harsha Nori
Naoto Usuyama
Nicholas King
S. McKinney
Xavier Fernandes
Sheng Zhang
Eric Horvitz
LRM
LM&MA
ELM
VLM
107
13
0
06 Nov 2024
EcoAct: Economic Agent Determines When to Register What Action
Shaokun Zhang
Jieyu Zhang
Dujian Ding
Mirian Hipolito Garcia
Ankur Mallick
Daniel Madrigal
Menglin Xia
Victor Rühle
Qingyun Wu
Chi Wang
LLMAG
100
4
0
03 Nov 2024
CmdCaliper: A Semantic-Aware Command-Line Embedding Model and Dataset for Security Research
Sian-Yao Huang
Cheng-Lin Yang
Hongpeng Zhou
Chun-Ying Huang
85
2
0
02 Nov 2024
Adapting While Learning: Grounding LLMs for Scientific Problems with Intelligent Tool Usage Adaptation
Bohan Lyu
Yadi Cao
Duncan Watson-Parris
Leon Bergen
Taylor Berg-Kirkpatrick
Rose Yu
135
5
0
01 Nov 2024
FATH: Authentication-based Test-time Defense against Indirect Prompt Injection Attacks
Jiongxiao Wang
Fangzhou Wu
Wendi Li
Jinsheng Pan
Edward Suh
Zhuoqing Mao
Muhao Chen
Chaowei Xiao
AAML
79
8
0
28 Oct 2024
Improving Small-Scale Large Language Models Function Calling for Reasoning Tasks
Graziano A. Manduzio
Federico A. Galatolo
M. G. Cimino
Enzo Pasquale Scilingo
Lorenzo Cominelli
LRM
40
1
0
24 Oct 2024
PRACT: Optimizing Principled Reasoning and Acting of LLM Agent
Zhiwei Liu
Weiran Yao
Jianguo Zhang
Rithesh Murthy
Liangwei Yang
...
Juan Carlos Niebles
Shelby Heinecke
Huan Wang
Silvio Savarese
Caiming Xiong
31
0
0
24 Oct 2024
Beyond Browsing: API-Based Web Agents
Yueqi Song
Frank F. Xu
Shuyan Zhou
Graham Neubig
164
23
0
21 Oct 2024
HumanEval-V: Evaluating Visual Understanding and Reasoning Abilities of Large Multimodal Models Through Coding Tasks
Fengji Zhang
Linquan Wu
Huiyu Bai
Guancheng Lin
Xiao Li
Xiao Yu
Yue Wang
Bei Chen
Jacky Keung
MLLM
ELM
LRM
104
0
0
16 Oct 2024
ShapefileGPT: A Multi-Agent Large Language Model Framework for Automated Shapefile Processing
Qingming Lin
Rui Hu
Huaxia Li
Sensen Wu
Yadong Li
Kai Fang
Hailin Feng
Zhenhong Du
Liuchang Xu
LLMAG
AI4CE
90
3
0
16 Oct 2024
Facilitating Multi-turn Function Calling for LLMs via Compositional Instruction Tuning
Mingyang Chen
Haoze Sun
Tianpeng Li
Fan Yang
Hao Liang
Keer Lu
Tengjiao Wang
Wentao Zhang
Guosheng Dong
Weipeng Chen
LRM
131
6
0
16 Oct 2024
Toolken+: Improving LLM Tool Usage with Reranking and a Reject Option
Konstantin Yakovlev
Sergey I. Nikolenko
A. Bout
53
1
0
15 Oct 2024
MTU-Bench: A Multi-granularity Tool-Use Benchmark for Large Language Models
Pei Wang
Yanan Wu
Zekun Wang
Qingbin Liu
Xiaoshuai Song
...
Ge Zhang
Hangyu Guo
Zhaoxiang Zhang
Wenbo Su
Bo Zheng
ELM
107
3
0
15 Oct 2024
Balancing Continuous Pre-Training and Instruction Fine-Tuning: Optimizing Instruction-Following in LLMs
Ishan Jindal
Chandana Badrinath
Pranjal Bharti
Lakkidi Vinay
Sachin Dev Sharma
CLL
ALM
85
2
0
14 Oct 2024
Agentic Information Retrieval
Weinan Zhang
Junwei Liao
Ning Li
Kounianhua Du
Jianghao Lin
AIFin
123
7
0
13 Oct 2024
CAMPHOR: Collaborative Agents for Multi-input Planning and High-Order Reasoning On Device
Yicheng Fu
R. Anantha
Jianpeng Cheng
LRM
LLMAG
90
4
0
12 Oct 2024
DAWN: Designing Distributed Agents in a Worldwide Network
Zahra Aminiranjbar
Jianan Tang
Qiudan Wang
Shubha Pant
Mahesh Viswanathan
AI4CE
LLMAG
94
2
0
11 Oct 2024
VoxelPrompt: A Vision-Language Agent for Grounded Medical Image Analysis
Andrew Hoopes
V. Butoi
John Guttag
Adrian V. Dalca
MedIm
LM&MA
102
2
0
10 Oct 2024
Agent S: An Open Agentic Framework that Uses Computers Like a Human
Saaket Agashe
Jiuzhou Han
Shuyu Gan
Jiachen Yang
Ang Li
Xin Eric Wang
LLMAG
LM&Ro
AIFin
108
38
0
10 Oct 2024
AppBench: Planning of Multiple APIs from Various APPs for Complex User Instruction
Hongru Wang
Rui Wang
Boyang Xue
Heming Xia
Jingtao Cao
Zeming Liu
Jeff Z. Pan
Kam-Fai Wong
ALM
95
16
0
10 Oct 2024
From Exploration to Mastery: Enabling LLMs to Master Tools via Self-Driven Interactions
Changle Qu
Sunhao Dai
Xiaochi Wei
Hengyi Cai
Shuaiqiang Wang
D. Yin
Jun Xu
Ji-Rong Wen
155
13
0
10 Oct 2024
AutoFeedback: An LLM-based Framework for Efficient and Accurate API Request Generation
Huanxi Liu
Jiaqi Liao
Dawei Feng
Kele Xu
Huaimin Wang
439
1
0
09 Oct 2024
CursorCore: Assist Programming through Aligning Anything
Hao Jiang
Qi Liu
Rui Li
Shengyu Ye
Shijin Wang
150
1
0
09 Oct 2024
ToolBridge: An Open-Source Dataset to Equip LLMs with External Tool Capabilities
Zhenchao Jin
Mengchen Liu
Dongdong Chen
Lingting Zhu
Yunsheng Li
Lequan Yu
KELM
38
0
0
08 Oct 2024
Hammer: Robust Function-Calling for On-Device Language Models via Function Masking
Qiqiang Lin
Muning Wen
Qiuying Peng
Guanyu Nie
Junwei Liao
...
Jiamu Zhou
Cheng Cheng
Yin Zhao
Jun Wang
Weinan Zhang
88
21
0
06 Oct 2024
Residual Policy Learning for Perceptive Quadruped Control Using Differentiable Simulation
Jing Yuan Luo
Yunlong Song
Victor Klemm
Fan Shi
Davide Scaramuzza
Marco Hutter
91
5
0
04 Oct 2024
ToolGen: Unified Tool Retrieval and Calling via Generation
Renxi Wang
Xudong Han
Lei Ji
Shu Wang
Timothy Baldwin
Haonan Li
LLMAG
168
9
0
04 Oct 2024
Visual Editing with LLM-based Tool Chaining: An Efficient Distillation Approach for Real-Time Applications
Oren Sultan
Alex Khasin
Guy Shiran
Asnat Greenstein-Messica
Dafna Shahaf
48
0
0
03 Oct 2024
Previous
1
2
3
4
5
6
7
8
9
Next