ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2304.08244
  4. Cited By
API-Bank: A Comprehensive Benchmark for Tool-Augmented LLMs

API-Bank: A Comprehensive Benchmark for Tool-Augmented LLMs

14 April 2023
Minghao Li
Yingxiu Zhao
Yu Bowen
Feifan Song
Hangyu Li
Haiyang Yu
Zhoujun Li
Fei Huang
Yongbin Li
    ELM
    RALM
    CLL
ArXivPDFHTML

Papers citing "API-Bank: A Comprehensive Benchmark for Tool-Augmented LLMs"

50 / 113 papers shown
Title
OMAC: A Broad Optimization Framework for LLM-Based Multi-Agent Collaboration
OMAC: A Broad Optimization Framework for LLM-Based Multi-Agent Collaboration
Shijun Li
Hilaf Hasson
Joydeep Ghosh
LLMAG
13
0
0
17 May 2025
ScaleMCP: Dynamic and Auto-Synchronizing Model Context Protocol Tools for LLM Agents
ScaleMCP: Dynamic and Auto-Synchronizing Model Context Protocol Tools for LLM Agents
Elias Lumer
Anmol Gulati
Vamse Kumar Subbiah
Pradeep Honaganahalli Basavaraju
James A. Burke
28
0
0
09 May 2025
Prompt Injection Attack to Tool Selection in LLM Agents
Prompt Injection Attack to Tool Selection in LLM Agents
Jiawen Shi
Zenghui Yuan
Guiyao Tie
Pan Zhou
Neil Zhenqiang Gong
Lichao Sun
LLMAG
51
0
0
28 Apr 2025
Nemotron-Research-Tool-N1: Exploring Tool-Using Language Models with Reinforced Reasoning
Nemotron-Research-Tool-N1: Exploring Tool-Using Language Models with Reinforced Reasoning
Shaokun Zhang
Yi Dong
Jieyu Zhang
Jan Kautz
Bryan Catanzaro
Andrew Tao
Qingyun Wu
Zhiding Yu
Guilin Liu
LLMAG
OffRL
KELM
LRM
97
0
0
25 Apr 2025
Param$Δ$ for Direct Weight Mixing: Post-Train Large Language Model at Zero Cost
ParamΔΔΔ for Direct Weight Mixing: Post-Train Large Language Model at Zero Cost
Sheng Cao
Mingrui Wu
Karthik Prasad
Yuandong Tian
Zechun Liu
MoMe
85
0
0
23 Apr 2025
The Athenian Academy: A Seven-Layer Architecture Model for Multi-Agent Systems
The Athenian Academy: A Seven-Layer Architecture Model for Multi-Agent Systems
Lidong Zhai
Zhijie Qiu
Lvyang Zhang
Jiaqi Li
Yansen Wang
Wen Lu
Xizhong Guo
Ge Sun
AI4CE
34
0
0
17 Apr 2025
ToolRL: Reward is All Tool Learning Needs
ToolRL: Reward is All Tool Learning Needs
Cheng Qian
Emre Can Acikgoz
Qi He
Hongru Wang
Xiusi Chen
Dilek Hakkani-Tur
Gokhan Tur
Heng Ji
OffRL
LRM
38
7
0
16 Apr 2025
Multimodal Agricultural Agent Architecture (MA3): A New Paradigm for Intelligent Agricultural Decision-Making
Multimodal Agricultural Agent Architecture (MA3): A New Paradigm for Intelligent Agricultural Decision-Making
Zhuoning Xu
Jian Xu
Hao Fei
Peijie Wang
Chao Deng
Cheng-Lin Liu
31
0
0
07 Apr 2025
A Desideratum for Conversational Agents: Capabilities, Challenges, and Future Directions
A Desideratum for Conversational Agents: Capabilities, Challenges, and Future Directions
Emre Can Acikgoz
Cheng Qian
Hongru Wang
Vardhan Dongre
Xiusi Chen
Heng Ji
Dilek Hakkani-Tur
Gokhan Tur
LM&Ro
ELM
55
1
0
07 Apr 2025
Multi-Mission Tool Bench: Assessing the Robustness of LLM based Agents through Related and Dynamic Missions
Multi-Mission Tool Bench: Assessing the Robustness of LLM based Agents through Related and Dynamic Missions
Peijie Yu
Yifan Yang
Jiyang Li
Zelong Zhang
Haorui Wang
Xiao Feng
Feng Zhang
LLMAG
119
0
0
03 Apr 2025
ToolACE-R: Tool Learning with Adaptive Self-Refinement
ToolACE-R: Tool Learning with Adaptive Self-Refinement
Xingshan Zeng
Wen Liu
Xiaolin Huang
Zezhong Wang
Lingzhi Wang
...
Yishuo Wang
Lifeng Shang
Xin Jiang
Ruiming Tang
Qiang Liu
CLL
60
0
0
02 Apr 2025
Building Resource-Constrained Language Agents: A Korean Case Study on Chemical Toxicity Information
Building Resource-Constrained Language Agents: A Korean Case Study on Chemical Toxicity Information
Hojun Cho
Donghu Kim
Soyoung Yang
Chan Lee
Hunjoo Lee
Jaegul Choo
59
1
0
22 Mar 2025
Advancing Mobile GUI Agents: A Verifier-Driven Approach to Practical Deployment
Advancing Mobile GUI Agents: A Verifier-Driven Approach to Practical Deployment
Gaole Dai
Shiqi Jiang
Ting Cao
Yuanchun Li
Yue Yang
Rui Tan
Mo Li
Lili Qiu
54
2
0
20 Mar 2025
Benchmarking AI Models in Software Engineering: A Review, Search Tool, and Enhancement Protocol
Roham Koohestani
Philippe de Bekker
Maliheh Izadi
VLM
52
0
0
07 Mar 2025
Haste Makes Waste: Evaluating Planning Abilities of LLMs for Efficient and Feasible Multitasking with Time Constraints Between Actions
Zirui Wu
Xiao Liu
Jiayi Li
Lingpeng Kong
Yansong Feng
44
1
0
04 Mar 2025
ToolDial: Multi-turn Dialogue Generation Method for Tool-Augmented Language Models
Jeonghoon Shim
Gyuhyeon Seo
Cheongsu Lim
Yohan Jo
49
4
0
01 Mar 2025
Kanana: Compute-efficient Bilingual Language Models
Kanana: Compute-efficient Bilingual Language Models
Kanana LLM Team
Yunju Bak
Hojin Lee
Minho Ryu
Jiyeon Ham
...
Daniel Lee
Minchul Lee
M. Lee
Shinbok Lee
Gaeun Seo
98
1
0
26 Feb 2025
Agent-centric Information Access
Agent-centric Information Access
Evangelos Kanoulas
Panagiotis Eustratiadis
Yongkang Li
Yougang Lyu
Vaishali Pal
Gabrielle Poerwawinata
Jingfen Qiao
Zihan Wang
AIFin
39
0
0
26 Feb 2025
A Cooperative Multi-Agent Framework for Zero-Shot Named Entity Recognition
A Cooperative Multi-Agent Framework for Zero-Shot Named Entity Recognition
Zihan Wang
Ziqi Zhao
Yougang Lyu
Ziyang Chen
Maarten de Rijke
Z. Z. Ren
66
3
0
25 Feb 2025
MutaGReP: Execution-Free Repository-Grounded Plan Search for Code-Use
MutaGReP: Execution-Free Repository-Grounded Plan Search for Code-Use
Zaid Khan
Ali Farhadi
Ranjay Krishna
Luca Weihs
Joey Tianyi Zhou
Tanmay Gupta
44
0
0
21 Feb 2025
Can a Single Model Master Both Multi-turn Conversations and Tool Use? CoALM: A Unified Conversational Agentic Language Model
Can a Single Model Master Both Multi-turn Conversations and Tool Use? CoALM: A Unified Conversational Agentic Language Model
Emre Can Acikgoz
Jeremiah Greer
Akul Datta
Ze Yang
William Zeng
Oussama Elachqar
Emmanouil Koukoumidis
Dilek Hakkani-Tur
Gokhan Tur
LLMAG
108
3
0
20 Feb 2025
Generative AI for Cel-Animation: A Survey
Generative AI for Cel-Animation: A Survey
Yunlong Tang
Junjia Guo
Pinxin Liu
Zhiyuan Wang
Hang Hua
...
Jing Bi
Mingqian Feng
Xuzhao Li
Zeliang Zhang
Chenliang Xu
VGen
96
7
0
08 Jan 2025
NesTools: A Dataset for Evaluating Nested Tool Learning Abilities of Large Language Models
NesTools: A Dataset for Evaluating Nested Tool Learning Abilities of Large Language Models
Han Han
Tong Zhu
Xiang Zhang
Mengsong Wu
Hao Xiong
Wenliang Chen
38
0
0
08 Jan 2025
MapEval: A Map-Based Evaluation of Geo-Spatial Reasoning in Foundation Models
Mahir Labib Dihan
Md Tanvir Hassan
Md Tanvir Parvez
Md Hasebul Hasan
Md Almash Alam
Muhammad Aamir Cheema
Mohammed Eunus Ali
Md. Rizwan Parvez
LRM
ELM
43
1
0
03 Jan 2025
HammerBench: Fine-Grained Function-Calling Evaluation in Real Mobile Device Scenarios
HammerBench: Fine-Grained Function-Calling Evaluation in Real Mobile Device Scenarios
Jun Wang
Jiamu Zhou
Muning Wen
Xiaoyun Mo
Haifeng Zhang
...
Cheng Jin
Xihuai Wang
Weinan Zhang
Qiuying Peng
Jun Wang
LLMAG
111
0
0
21 Dec 2024
Tree-of-Code: A Tree-Structured Exploring Framework for End-to-End Code
  Generation and Execution in Complex Task Handling
Tree-of-Code: A Tree-Structured Exploring Framework for End-to-End Code Generation and Execution in Complex Task Handling
Ziyi Ni
Yifan Li
Ning Yang
Dou Shen
Pin Lv
Daxiang Dong
LRM
76
0
0
19 Dec 2024
GraphTool-Instruction: Revolutionizing Graph Reasoning in LLMs through
  Decomposed Subtask Instruction
GraphTool-Instruction: Revolutionizing Graph Reasoning in LLMs through Decomposed Subtask Instruction
Rongzheng Wang
Shuang Liang
Qizhi Chen
Jiasheng Zhang
Ke Qin
92
0
0
11 Dec 2024
Advanced System Integration: Analyzing OpenAPI Chunking for
  Retrieval-Augmented Generation
Advanced System Integration: Analyzing OpenAPI Chunking for Retrieval-Augmented Generation
Robin D. Pesl
Jerin G. Mathew
Massimo Mecella
Marco Aiello
78
1
0
29 Nov 2024
Action Engine: An LLM-based Framework for Automatic FaaS Workflow
  Generation
Action Engine: An LLM-based Framework for Automatic FaaS Workflow Generation
Akiharu Esashi
Pawissanutt Lertpongrujikorn
M. Salehi
82
0
0
29 Nov 2024
FactCheXcker: Mitigating Measurement Hallucinations in Chest X-ray
  Report Generation Models
FactCheXcker: Mitigating Measurement Hallucinations in Chest X-ray Report Generation Models
Alice Heiman
Xiaoman Zhang
E. Chen
Sung Eun Kim
Pranav Rajpurkar
HILM
MedIm
82
0
0
27 Nov 2024
CATP-LLM: Empowering Large Language Models for Cost-Aware Tool Planning
CATP-LLM: Empowering Large Language Models for Cost-Aware Tool Planning
Duo Wu
Yufei Guo
Yuan Meng
Yanning Zhang
Le Sun
Zhi Wang
258
0
0
25 Nov 2024
PTR: Precision-Driven Tool Recommendation for Large Language Models
PTR: Precision-Driven Tool Recommendation for Large Language Models
Hang Gao
Yongfeng Zhang
KELM
46
0
0
14 Nov 2024
EcoAct: Economic Agent Determines When to Register What Action
EcoAct: Economic Agent Determines When to Register What Action
Shaokun Zhang
Jieyu Zhang
Dujian Ding
Mirian Hipolito Garcia
Ankur Mallick
Daniel Madrigal
Menglin Xia
Victor Rühle
Qingyun Wu
Chi Wang
LLMAG
58
4
0
03 Nov 2024
Library Learning Doesn't: The Curious Case of the Single-Use "Library"
Library Learning Doesn't: The Curious Case of the Single-Use "Library"
Ian Berlot-Attwell
Frank Rudzicz
Xujie Si
37
1
0
26 Oct 2024
Improving Small-Scale Large Language Models Function Calling for
  Reasoning Tasks
Improving Small-Scale Large Language Models Function Calling for Reasoning Tasks
Graziano A. Manduzio
Federico A. Galatolo
M. G. Cimino
Enzo Pasquale Scilingo
Lorenzo Cominelli
LRM
29
1
0
24 Oct 2024
Optimizing Chain-of-Thought Reasoning: Tackling Arranging Bottleneck via
  Plan Augmentation
Optimizing Chain-of-Thought Reasoning: Tackling Arranging Bottleneck via Plan Augmentation
Yuli Qiu
Jiashu Yao
Heyan Huang
Yuhang Guo
LRM
31
0
0
22 Oct 2024
A Case for AI Consciousness: Language Agents and Global Workspace Theory
A Case for AI Consciousness: Language Agents and Global Workspace Theory
Simon Goldstein
Cameron Domenico Kirk-Giannini
31
1
0
15 Oct 2024
G-Designer: Architecting Multi-agent Communication Topologies via Graph Neural Networks
G-Designer: Architecting Multi-agent Communication Topologies via Graph Neural Networks
Guibin Zhang
Xinfeng Li
Xiangguo Sun
Guancheng Wan
Miao Yu
Sihang Li
Kun Wang
Dawei Cheng
Dawei Cheng
AAML
AI4CE
54
7
0
15 Oct 2024
AutoFeedback: An LLM-based Framework for Efficient and Accurate API
  Request Generation
AutoFeedback: An LLM-based Framework for Efficient and Accurate API Request Generation
Huanxi Liu
Jiaqi Liao
Dawei Feng
Kele Xu
Huaimin Wang
201
0
0
09 Oct 2024
Learning Evolving Tools for Large Language Models
Learning Evolving Tools for Large Language Models
Guoxin Chen
Zhong Zhang
Xin Cong
Fangda Guo
Yesai Wu
Yankai Lin
Wenzheng Feng
Yasheng Wang
KELM
54
1
0
09 Oct 2024
Hammer: Robust Function-Calling for On-Device Language Models via
  Function Masking
Hammer: Robust Function-Calling for On-Device Language Models via Function Masking
Qiqiang Lin
Muning Wen
Qiuying Peng
Guanyu Nie
Junwei Liao
...
Jiamu Zhou
Cheng Cheng
Yin Zhao
Jun Wang
Weinan Zhang
46
16
0
06 Oct 2024
Cut the Crap: An Economical Communication Pipeline for LLM-based
  Multi-Agent Systems
Cut the Crap: An Economical Communication Pipeline for LLM-based Multi-Agent Systems
Guibin Zhang
Xinfeng Li
Zhixun Li
Sukwon Yun
Guancheng Wan
Kun Wang
Dawei Cheng
Jeffrey Xu Yu
Tianlong Chen
34
9
0
03 Oct 2024
A Survey on Complex Tasks for Goal-Directed Interactive Agents
A Survey on Complex Tasks for Goal-Directed Interactive Agents
Mareike Hartmann
Alexander Koller
LM&Ro
LLMAG
36
1
0
27 Sep 2024
Quality Matters: Evaluating Synthetic Data for Tool-Using LLMs
Quality Matters: Evaluating Synthetic Data for Tool-Using LLMs
Shadi Iskander
Nachshon Cohen
Zohar Karnin
Ori Shapira
Sofia Tolmach
SyDa
29
1
0
24 Sep 2024
SEAL: Suite for Evaluating API-use of LLMs
SEAL: Suite for Evaluating API-use of LLMs
Woojeong Kim
Ashish Jagmohan
Aditya Vempaty
ELM
ALM
LLMAG
37
0
0
23 Sep 2024
SUPER: Evaluating Agents on Setting Up and Executing Tasks from Research
  Repositories
SUPER: Evaluating Agents on Setting Up and Executing Tasks from Research Repositories
Ben Bogin
Kejuan Yang
Shashank Gupta
Kyle Richardson
Erin Bransom
Peter Clark
Ashish Sabharwal
Tushar Khot
ELM
LRM
49
10
0
11 Sep 2024
xLAM: A Family of Large Action Models to Empower AI Agent Systems
xLAM: A Family of Large Action Models to Empower AI Agent Systems
Jianguo Zhang
Tian Lan
Ming Zhu
Zuxin Liu
Thai Hoang
...
Juan Carlos Niebles
Shelby Heinecke
Huan Wang
Silvio Savarese
Caiming Xiong
ALM
41
33
0
05 Sep 2024
Tool-Assisted Agent on SQL Inspection and Refinement in Real-World
  Scenarios
Tool-Assisted Agent on SQL Inspection and Refinement in Real-World Scenarios
Zhongyuan Wang
Richong Zhang
Zhijie Nie
Jaein Kim
52
1
0
30 Aug 2024
HiAgent: Hierarchical Working Memory Management for Solving Long-Horizon
  Agent Tasks with Large Language Model
HiAgent: Hierarchical Working Memory Management for Solving Long-Horizon Agent Tasks with Large Language Model
Mengkang Hu
Tianxing Chen
Qiguang Chen
Yao Mu
Wenqi Shao
Ping Luo
LM&Ro
LLMAG
RALM
29
4
0
18 Aug 2024
What should I wear to a party in a Greek taverna? Evaluation for
  Conversational Agents in the Fashion Domain
What should I wear to a party in a Greek taverna? Evaluation for Conversational Agents in the Fashion Domain
Antonis Maronikolakis
Ana Peleteiro Ramallo
Weiwei Cheng
Thomas Kober
LLMAG
32
1
0
13 Aug 2024
123
Next