ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2307.16789
  4. Cited By
ToolLLM: Facilitating Large Language Models to Master 16000+ Real-world
  APIs

ToolLLM: Facilitating Large Language Models to Master 16000+ Real-world APIs

31 July 2023
Yujia Qin
Shi Liang
Yining Ye
Kunlun Zhu
Lan Yan
Ya-Ting Lu
Yankai Lin
Xin Cong
Xiangru Tang
Bill Qian
Sihan Zhao
Lauren Hong
Runchu Tian
Ruobing Xie
Jie Zhou
Mark B. Gerstein
Dahai Li
Zhiyuan Liu
Maosong Sun
    CLL
    ALM
    LLMAG
    ELM
    LM&MA
ArXivPDFHTML

Papers citing "ToolLLM: Facilitating Large Language Models to Master 16000+ Real-world APIs"

50 / 471 papers shown
Title
ToolHop: A Query-Driven Benchmark for Evaluating Large Language Models in Multi-Hop Tool Use
ToolHop: A Query-Driven Benchmark for Evaluating Large Language Models in Multi-Hop Tool Use
Junjie Ye
Zhengyin Du
Xuesong Yao
Weijian Lin
Yufei Xu
...
Siyu Yuan
Tao Gui
Qi Zhang
Xuanjing Huang
Jiecao Chen
54
0
0
05 Jan 2025
MapEval: A Map-Based Evaluation of Geo-Spatial Reasoning in Foundation Models
Mahir Labib Dihan
Md Tanvir Hassan
Md Tanvir Parvez
Md Hasebul Hasan
Md Almash Alam
Muhammad Aamir Cheema
Mohammed Eunus Ali
Md. Rizwan Parvez
LRM
ELM
43
1
0
03 Jan 2025
Exposing Limitations of Language Model Agents in Sequential-Task Compositions on the Web
Exposing Limitations of Language Model Agents in Sequential-Task Compositions on the Web
Hiroki Furuta
Yutaka Matsuo
Aleksandra Faust
Izzeddin Gur
CLL
92
14
0
03 Jan 2025
From Generalist to Specialist: A Survey of Large Language Models for Chemistry
From Generalist to Specialist: A Survey of Large Language Models for Chemistry
Yang Han
Ziping Wan
Lu Chen
Kai Yu
Xin Chen
LM&MA
35
1
0
31 Dec 2024
GAIS: A Novel Approach to Instance Selection with Graph Attention
  Networks
GAIS: A Novel Approach to Instance Selection with Graph Attention Networks
Zahiriddin Rustamov
Ayham Zaitouny
Rafat Damseh
Nazar Zaki
41
0
0
26 Dec 2024
LegalAgentBench: Evaluating LLM Agents in Legal Domain
LegalAgentBench: Evaluating LLM Agents in Legal Domain
Hao Li
Junjie Chen
Jingli Yang
Qingyao Ai
Wei Jia
...
Guozhi Yuan
Yiran Hu
Wuyue Wang
Yong-Jin Liu
Minlie Huang
LLMAG
AILaw
ELM
66
11
0
23 Dec 2024
The Task Shield: Enforcing Task Alignment to Defend Against Indirect
  Prompt Injection in LLM Agents
The Task Shield: Enforcing Task Alignment to Defend Against Indirect Prompt Injection in LLM Agents
Feiran Jia
Tong Wu
Xin Qin
Anna Squicciarini
LLMAG
AAML
86
4
0
21 Dec 2024
HammerBench: Fine-Grained Function-Calling Evaluation in Real Mobile Device Scenarios
HammerBench: Fine-Grained Function-Calling Evaluation in Real Mobile Device Scenarios
Jun Wang
Jiamu Zhou
Muning Wen
Xiaoyun Mo
Haifeng Zhang
...
Cheng Jin
Xihuai Wang
Weinan Zhang
Qiuying Peng
Jun Wang
LLMAG
101
0
0
21 Dec 2024
Tree-of-Code: A Tree-Structured Exploring Framework for End-to-End Code
  Generation and Execution in Complex Task Handling
Tree-of-Code: A Tree-Structured Exploring Framework for End-to-End Code Generation and Execution in Complex Task Handling
Ziyi Ni
Yifan Li
Ning Yang
Dou Shen
Pin Lv
Daxiang Dong
LRM
74
0
0
19 Dec 2024
Disentangling Reasoning Tokens and Boilerplate Tokens For Language Model
  Fine-tuning
Disentangling Reasoning Tokens and Boilerplate Tokens For Language Model Fine-tuning
Ziang Ye
Z. Zhang
Yang Zhang
Jianxin Ma
Junyang Lin
Fuli Feng
LRM
85
0
0
19 Dec 2024
On the Structural Memory of LLM Agents
On the Structural Memory of LLM Agents
Ruihong Zeng
Jinyuan Fang
Siwei Liu
Zaiqiao Meng
LLMAG
KELM
89
4
0
17 Dec 2024
TrendSim: Simulating Trending Topics in Social Media Under Poisoning
  Attacks with LLM-based Multi-agent System
TrendSim: Simulating Trending Topics in Social Media Under Poisoning Attacks with LLM-based Multi-agent System
Zeyu Zhang
Jianxun Lian
Chen Ma
Yaning Qu
Ye Luo
...
X. Chen
Yankai Lin
Le Wu
Xing Xie
Ji-Rong Wen
LLMAG
AAML
70
3
0
14 Dec 2024
GraphTool-Instruction: Revolutionizing Graph Reasoning in LLMs through
  Decomposed Subtask Instruction
GraphTool-Instruction: Revolutionizing Graph Reasoning in LLMs through Decomposed Subtask Instruction
Rongzheng Wang
Shuang Liang
Qizhi Chen
Jiasheng Zhang
Ke Qin
87
0
0
11 Dec 2024
Action Engine: An LLM-based Framework for Automatic FaaS Workflow
  Generation
Action Engine: An LLM-based Framework for Automatic FaaS Workflow Generation
Akiharu Esashi
Pawissanutt Lertpongrujikorn
M. Salehi
77
0
0
29 Nov 2024
MAG-V: A Multi-Agent Framework for Synthetic Data Generation and Verification
MAG-V: A Multi-Agent Framework for Synthetic Data Generation and Verification
Saptarshi Sengupta
Kristal Curtis
Akshay Mallipeddi
Abhinav Mathur
Joseph Ross
Liang Gou
Liang Gou
LLMAG
SyDa
129
1
0
28 Nov 2024
LLM Augmentations to support Analytical Reasoning over Multiple
  Documents
LLM Augmentations to support Analytical Reasoning over Multiple Documents
Raquib Bin Yousuf
Nicholas Defelice
Mandar Sharma
Shengzhe Xu
Naren Ramakrishnan
61
2
0
25 Nov 2024
SRSA: A Cost-Efficient Strategy-Router Search Agent for Real-world
  Human-Machine Interactions
SRSA: A Cost-Efficient Strategy-Router Search Agent for Real-world Human-Machine Interactions
Yaqi Wang
Haipei Xu
LLMAG
74
0
0
21 Nov 2024
Towards Full Delegation: Designing Ideal Agentic Behaviors for Travel
  Planning
Towards Full Delegation: Designing Ideal Agentic Behaviors for Travel Planning
Song Jiang
Da JU
Andrew Cohen
Sasha Mitts
Aaron Foss
Justine T Kao
Xian Li
Yuandong Tian
67
2
0
21 Nov 2024
BALROG: Benchmarking Agentic LLM and VLM Reasoning On Games
BALROG: Benchmarking Agentic LLM and VLM Reasoning On Games
Davide Paglieri
Bartłomiej Cupiał
Samuel Coward
Ulyana Piterbarg
Maciej Wolczyk
...
Lerrel Pinto
Rob Fergus
Jakob Foerster
Jack Parker-Holder
Tim Rocktaschel
LLMAG
LRM
113
10
0
20 Nov 2024
PTR: Precision-Driven Tool Recommendation for Large Language Models
PTR: Precision-Driven Tool Recommendation for Large Language Models
Hang Gao
Yongfeng Zhang
KELM
46
0
0
14 Nov 2024
Foundations and Recent Trends in Multimodal Mobile Agents: A Survey
Foundations and Recent Trends in Multimodal Mobile Agents: A Survey
Biao Wu
Yanda Li
Meng Fang
Zirui Song
Zhiwei Zhang
Yunchao Wei
L. Chen
LM&Ro
LLMAG
OffRL
AI4TS
44
4
0
04 Nov 2024
DynaSaur: Large Language Agents Beyond Predefined Actions
DynaSaur: Large Language Agents Beyond Predefined Actions
Dang Nguyen
Viet Dac Lai
Seunghyun Yoon
Ryan Rossi
Handong Zhao
...
Nedim Lipka
Yu-Chiang Frank Wang
Trung H. Bui
Franck Dernoncourt
Dinesh Manocha
LM&Ro
ELM
LLMAG
54
6
0
04 Nov 2024
CRMArena: Understanding the Capacity of LLM Agents to Perform Professional CRM Tasks in Realistic Environments
CRMArena: Understanding the Capacity of LLM Agents to Perform Professional CRM Tasks in Realistic Environments
Kung-Hsiang Huang
Akshara Prabhakar
Sidharth Dhawan
Yixin Mao
Huan Wang
Silvio Savarese
Caiming Xiong
Philippe Laban
C. Wu
44
7
0
04 Nov 2024
EcoAct: Economic Agent Determines When to Register What Action
EcoAct: Economic Agent Determines When to Register What Action
Shaokun Zhang
Jieyu Zhang
Dujian Ding
Mirian Hipolito Garcia
Ankur Mallick
Daniel Madrigal
Menglin Xia
Victor Rühle
Qingyun Wu
Chi Wang
LLMAG
50
4
0
03 Nov 2024
Adapting While Learning: Grounding LLMs for Scientific Problems with Intelligent Tool Usage Adaptation
Adapting While Learning: Grounding LLMs for Scientific Problems with Intelligent Tool Usage Adaptation
Bohan Lyu
Yadi Cao
Duncan Watson-Parris
Leon Bergen
Taylor Berg-Kirkpatrick
Rose Yu
61
3
0
01 Nov 2024
Building Multi-Agent Copilot towards Autonomous Agricultural Data
  Management and Analysis
Building Multi-Agent Copilot towards Autonomous Agricultural Data Management and Analysis
Yu Pan
Jianxin Sun
Hongfeng Yu
Joe Luck
Geng Bai
Nipuna Chamara
Yufeng Ge
Tala Awada
43
0
0
31 Oct 2024
FATH: Authentication-based Test-time Defense against Indirect Prompt
  Injection Attacks
FATH: Authentication-based Test-time Defense against Indirect Prompt Injection Attacks
Jiongxiao Wang
Fangzhou Wu
Wendi Li
Jinsheng Pan
Edward Suh
Zhuoqing Mao
Muhao Chen
Chaowei Xiao
AAML
40
6
0
28 Oct 2024
NewTerm: Benchmarking Real-Time New Terms for Large Language Models with
  Annual Updates
NewTerm: Benchmarking Real-Time New Terms for Large Language Models with Annual Updates
Hexuan Deng
Wenxiang Jiao
Xuebo Liu
Min Zhang
Zhaopeng Tu
40
3
0
28 Oct 2024
AutoBench-V: Can Large Vision-Language Models Benchmark Themselves?
AutoBench-V: Can Large Vision-Language Models Benchmark Themselves?
Han Bao
Yue Huang
Yanbo Wang
Jiayi Ye
Xiangqi Wang
Xiuying Chen
Mohamed Elhoseiny
Xuzhi Zhang
Mohamed Elhoseiny
Xiangliang Zhang
47
7
0
28 Oct 2024
AgentSense: Benchmarking Social Intelligence of Language Agents through
  Interactive Scenarios
AgentSense: Benchmarking Social Intelligence of Language Agents through Interactive Scenarios
Xinyi Mou
Jingcong Liang
Jiayu Lin
Xuzhi Zhang
Xiawei Liu
...
Rong Ye
Lei Chen
Haoyu Kuang
Xuanjing Huang
Zhongyu Wei
31
8
0
25 Oct 2024
AgentStore: Scalable Integration of Heterogeneous Agents As Specialized
  Generalist Computer Assistant
AgentStore: Scalable Integration of Heterogeneous Agents As Specialized Generalist Computer Assistant
Chengyou Jia
Minnan Luo
Zhuohang Dang
Qiushi Sun
Fangzhi Xu
Junlin Hu
Tianbao Xie
Zhiyong Wu
LLMAG
31
6
0
24 Oct 2024
Improving Model Factuality with Fine-grained Critique-based Evaluator
Improving Model Factuality with Fine-grained Critique-based Evaluator
Yiqing Xie
Wenxuan Zhou
Pradyot Prakash
Di Jin
Yuning Mao
...
Sinong Wang
Han Fang
Carolyn Rose
Daniel Fried
Hejia Zhang
HILM
33
6
0
24 Oct 2024
Beyond Browsing: API-Based Web Agents
Beyond Browsing: API-Based Web Agents
Yueqi Song
Frank F. Xu
Shuyan Zhou
Graham Neubig
58
15
0
21 Oct 2024
Assistive AI for Augmenting Human Decision-making
Assistive AI for Augmenting Human Decision-making
Natabara Máté Gyöngyössy
Bernát Török
Csilla Farkas
Laura Lucaj
Attila Menyhárd
Krisztina Menyhárd-Balázs
András Simonyi
Patrick van der Smagt
Zsolt Ződi
András Lőrincz
39
0
0
18 Oct 2024
A Comparative Study on Reasoning Patterns of OpenAI's o1 Model
A Comparative Study on Reasoning Patterns of OpenAI's o1 Model
Siwei Wu
Zhongyuan Peng
Xinrun Du
Tuney Zheng
Minghao Liu
...
Zhaoxiang Zhang
Wenhao Huang
Ge Zhang
Chenghua Lin
J. H. Liu
ELM
LLMAG
LRM
AI4CE
37
30
0
17 Oct 2024
Agent Skill Acquisition for Large Language Models via CycleQD
Agent Skill Acquisition for Large Language Models via CycleQD
So Kuroki
Taishi Nakamura
Takuya Akiba
Yujin Tang
MoMe
34
0
0
16 Oct 2024
Facilitating Multi-turn Function Calling for LLMs via Compositional Instruction Tuning
Facilitating Multi-turn Function Calling for LLMs via Compositional Instruction Tuning
Mingyang Chen
Haoze Sun
Tianpeng Li
Fan Yang
Hao Liang
Keer Lu
Bin Cui
Wentao Zhang
Zenan Zhou
Weipeng Chen
LRM
49
5
0
16 Oct 2024
Planning Anything with Rigor: General-Purpose Zero-Shot Planning with LLM-based Formalized Programming
Planning Anything with Rigor: General-Purpose Zero-Shot Planning with LLM-based Formalized Programming
Yilun Hao
Yang Zhang
Chuchu Fan
LLMAG
49
11
0
15 Oct 2024
Toolken+: Improving LLM Tool Usage with Reranking and a Reject Option
Toolken+: Improving LLM Tool Usage with Reranking and a Reject Option
Konstantin Yakovlev
Sergey I. Nikolenko
A. Bout
21
0
0
15 Oct 2024
VidEgoThink: Assessing Egocentric Video Understanding Capabilities for
  Embodied AI
VidEgoThink: Assessing Egocentric Video Understanding Capabilities for Embodied AI
Sijie Cheng
Kechen Fang
Yangyang Yu
Sicheng Zhou
Yangqiu Song
Ye Tian
Tingguang Li
Lei Han
Yang Liu
51
8
0
15 Oct 2024
Balancing Continuous Pre-Training and Instruction Fine-Tuning:
  Optimizing Instruction-Following in LLMs
Balancing Continuous Pre-Training and Instruction Fine-Tuning: Optimizing Instruction-Following in LLMs
Ishan Jindal
Chandana Badrinath
Pranjal Bharti
Lakkidi Vinay
Sachin Dev Sharma
CLL
ALM
31
2
0
14 Oct 2024
Agentic Information Retrieval
Agentic Information Retrieval
Weinan Zhang
Junwei Liao
Ning Li
Kounianhua Du
Jianghao Lin
AIFin
49
2
0
13 Oct 2024
CAMPHOR: Collaborative Agents for Multi-input Planning and High-Order
  Reasoning On Device
CAMPHOR: Collaborative Agents for Multi-input Planning and High-Order Reasoning On Device
Yicheng Fu
R. Anantha
Jianpeng Cheng
LRM
LLMAG
28
2
0
12 Oct 2024
DAWN: Designing Distributed Agents in a Worldwide Network
DAWN: Designing Distributed Agents in a Worldwide Network
Zahra Aminiranjbar
Jianan Tang
Qiudan Wang
Shubha Pant
Mahesh Viswanathan
LLMAG
AI4CE
28
2
0
11 Oct 2024
QEFT: Quantization for Efficient Fine-Tuning of LLMs
QEFT: Quantization for Efficient Fine-Tuning of LLMs
Changhun Lee
Jun-gyu Jin
Younghyun Cho
Eunhyeok Park
MQ
42
1
0
11 Oct 2024
VoxelPrompt: A Vision-Language Agent for Grounded Medical Image Analysis
VoxelPrompt: A Vision-Language Agent for Grounded Medical Image Analysis
Andrew Hoopes
V. Butoi
John Guttag
Adrian V. Dalca
MedIm
LM&MA
35
1
0
10 Oct 2024
AppBench: Planning of Multiple APIs from Various APPs for Complex User
  Instruction
AppBench: Planning of Multiple APIs from Various APPs for Complex User Instruction
Hongru Wang
Rui Wang
Boyang Xue
Heming Xia
Jingtao Cao
Zeming Liu
Jeff Z. Pan
Kam-Fai Wong
ALM
40
8
0
10 Oct 2024
From Exploration to Mastery: Enabling LLMs to Master Tools via Self-Driven Interactions
From Exploration to Mastery: Enabling LLMs to Master Tools via Self-Driven Interactions
Changle Qu
Sunhao Dai
Xiaochi Wei
Hengyi Cai
Shuaiqiang Wang
Dawei Yin
Jun Xu
Ji-Rong Wen
60
9
0
10 Oct 2024
PositionID: LLMs can Control Lengths, Copy and Paste with Explicit
  Positional Awareness
PositionID: LLMs can Control Lengths, Copy and Paste with Explicit Positional Awareness
Zekun Wang
Feiyu Duan
Yibo Zhang
Wangchunshu Zhou
Ke Xu
Wenhao Huang
Jie Fu
LLMAG
26
1
0
09 Oct 2024
AutoFeedback: An LLM-based Framework for Efficient and Accurate API
  Request Generation
AutoFeedback: An LLM-based Framework for Efficient and Accurate API Request Generation
Huanxi Liu
Jiaqi Liao
Dawei Feng
Kele Xu
Huaimin Wang
118
0
0
09 Oct 2024
Previous
123456...8910
Next