ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2307.16789
  4. Cited By
ToolLLM: Facilitating Large Language Models to Master 16000+ Real-world
  APIs

ToolLLM: Facilitating Large Language Models to Master 16000+ Real-world APIs

31 July 2023
Yujia Qin
Shi Liang
Yining Ye
Kunlun Zhu
Lan Yan
Ya-Ting Lu
Yankai Lin
Xin Cong
Xiangru Tang
Bill Qian
Sihan Zhao
Lauren Hong
Runchu Tian
Ruobing Xie
Jie Zhou
Mark B. Gerstein
Dahai Li
Zhiyuan Liu
Maosong Sun
    CLL
    ALM
    LLMAG
    ELM
    LM&MA
ArXivPDFHTML

Papers citing "ToolLLM: Facilitating Large Language Models to Master 16000+ Real-world APIs"

50 / 471 papers shown
Title
Speech-Copilot: Leveraging Large Language Models for Speech Processing
  via Task Decomposition, Modularization, and Program Generation
Speech-Copilot: Leveraging Large Language Models for Speech Processing via Task Decomposition, Modularization, and Program Generation
Chun-Yi Kuan
Chih-Kai Yang
Wei-Ping Huang
Ke-Han Lu
Hung-yi Lee
49
7
0
13 Jul 2024
On Mitigating Code LLM Hallucinations with API Documentation
On Mitigating Code LLM Hallucinations with API Documentation
Nihal Jain
Robert Kwiatkowski
Baishakhi Ray
M. K. Ramanathan
Varun Kumar
41
7
0
13 Jul 2024
WorldAPIs: The World Is Worth How Many APIs? A Thought Experiment
WorldAPIs: The World Is Worth How Many APIs? A Thought Experiment
Jiefu Ou
Arda Uzunoglu
Benjamin Van Durme
Daniel Khashabi
LM&Ro
VGen
30
3
0
10 Jul 2024
EfficientQAT: Efficient Quantization-Aware Training for Large Language Models
EfficientQAT: Efficient Quantization-Aware Training for Large Language Models
Yonghong Tian
Wenqi Shao
Peng Xu
Jiahao Wang
Peng Gao
Kaipeng Zhang
Ping Luo
MQ
46
26
0
10 Jul 2024
Hypothetical Minds: Scaffolding Theory of Mind for Multi-Agent Tasks
  with Large Language Models
Hypothetical Minds: Scaffolding Theory of Mind for Multi-Agent Tasks with Large Language Models
Logan Cross
Violet Xiang
Agam Bhatia
Daniel L. K. Yamins
Nick Haber
LM&Ro
LRM
LLMAG
48
4
0
09 Jul 2024
Internet of Agents: Weaving a Web of Heterogeneous Agents for
  Collaborative Intelligence
Internet of Agents: Weaving a Web of Heterogeneous Agents for Collaborative Intelligence
Weize Chen
Ziming You
Ran Li
Yitong Guan
Chen Qian
Chenyang Zhao
Cheng Yang
Ruobing Xie
Zhiyuan Liu
Maosong Sun
LLMAG
45
36
0
09 Jul 2024
Achieving Tool Calling Functionality in LLMs Using Only Prompt
  Engineering Without Fine-Tuning
Achieving Tool Calling Functionality in LLMs Using Only Prompt Engineering Without Fine-Tuning
Shengtao He
LLMAG
23
1
0
06 Jul 2024
Re-Tuning: Overcoming the Compositionality Limits of Large Language
  Models with Recursive Tuning
Re-Tuning: Overcoming the Compositionality Limits of Large Language Models with Recursive Tuning
Eric Pasewark
Kyle Montgomery
Kefei Duan
Dawn Song
Chenguang Wang
LRM
CLL
ReLM
41
1
0
05 Jul 2024
DSLR: Document Refinement with Sentence-Level Re-ranking and
  Reconstruction to Enhance Retrieval-Augmented Generation
DSLR: Document Refinement with Sentence-Level Re-ranking and Reconstruction to Enhance Retrieval-Augmented Generation
Taeho Hwang
Soyeong Jeong
Sukmin Cho
SeungYoon Han
Jong C. Park
RALM
38
1
0
04 Jul 2024
What Affects the Stability of Tool Learning? An Empirical Study on the
  Robustness of Tool Learning Frameworks
What Affects the Stability of Tool Learning? An Empirical Study on the Robustness of Tool Learning Frameworks
Chengrui Huang
Zhengliang Shi
Yuntao Wen
Xiuying Chen
Peng Han
Shen Gao
Shuo Shang
44
1
0
03 Jul 2024
WTU-EVAL: A Whether-or-Not Tool Usage Evaluation Benchmark for Large
  Language Models
WTU-EVAL: A Whether-or-Not Tool Usage Evaluation Benchmark for Large Language Models
Kangyun Ning
Yisong Su
Xueqiang Lv
Yuanzhe Zhang
Jian Liu
Kang Liu
Jinan Xu
ELM
LLMAG
44
2
0
02 Jul 2024
Concise and Precise Context Compression for Tool-Using Language Models
Concise and Precise Context Compression for Tool-Using Language Models
Yang Xu
Yunlong Feng
Honglin Mu
Yutai Hou
Yitong Li
...
Zhongyang Li
Dandan Tu
Qingfu Zhu
Hao Fei
Wanxiang Che
LLMAG
24
3
0
02 Jul 2024
Mobile-Bench: An Evaluation Benchmark for LLM-based Mobile Agents
Mobile-Bench: An Evaluation Benchmark for LLM-based Mobile Agents
Shihan Deng
Weikai Xu
Hongda Sun
Wei Liu
Tao Tan
...
Ang Li
Jian Luan
Bin Wang
Rui Yan
Shuo Shang
LLMAG
44
7
0
01 Jul 2024
Applying RLAIF for Code Generation with API-usage in Lightweight LLMs
Applying RLAIF for Code Generation with API-usage in Lightweight LLMs
Sujan Dutta
Sayantan Mahinder
R. Anantha
Bortik Bandyopadhyay
ALM
39
4
0
28 Jun 2024
Simulating Financial Market via Large Language Model based Agents
Simulating Financial Market via Large Language Model based Agents
Shen Gao
Yuntao Wen
Minghang Zhu
Jianing Wei
Yuhan Cheng
Qunzi Zhang
Shuo Shang
AIFin
34
12
0
28 Jun 2024
From the Least to the Most: Building a Plug-and-Play Visual Reasoner via
  Data Synthesis
From the Least to the Most: Building a Plug-and-Play Visual Reasoner via Data Synthesis
Chuanqi Cheng
Jian Guan
Wei Wu
Rui Yan
LRM
45
10
0
28 Jun 2024
ShortcutsBench: A Large-Scale Real-world Benchmark for API-based Agents
ShortcutsBench: A Large-Scale Real-world Benchmark for API-based Agents
Haiyang Shen
Yue Li
Desong Meng
Dongqi Cai
Sheng Qi
Li Zhang
Mengwei Xu
Yun Ma
LLMAG
46
9
0
28 Jun 2024
Granite-Function Calling Model: Introducing Function Calling Abilities
  via Multi-task Learning of Granular Tasks
Granite-Function Calling Model: Introducing Function Calling Abilities via Multi-task Learning of Granular Tasks
Ibrahim Abdelaziz
Kinjal Basu
Mayank Agarwal
Sadhana Kumaravel
Matthew Stallone
...
Merve Unuvar
David D. Cox
Salim Roukos
Luis Lastras
Pavan Kapanipathi
LLMAG
34
21
0
27 Jun 2024
Tools Fail: Detecting Silent Errors in Faulty Tools
Tools Fail: Detecting Silent Errors in Faulty Tools
Jimin Sun
So Yeon Min
Yingshan Chang
Yonatan Bisk
37
5
0
27 Jun 2024
APIGen: Automated Pipeline for Generating Verifiable and Diverse
  Function-Calling Datasets
APIGen: Automated Pipeline for Generating Verifiable and Diverse Function-Calling Datasets
Zuxin Liu
Thai Hoang
Jianguo Zhang
Ming Zhu
Tian Lan
...
Silvio Savarese
Juan Carlos Niebles
Huan Wang
Shelby Heinecke
Caiming Xiong
53
46
0
26 Jun 2024
Enhancing Tool Retrieval with Iterative Feedback from Large Language
  Models
Enhancing Tool Retrieval with Iterative Feedback from Large Language Models
Qiancheng Xu
Yongqi Li
Heming Xia
Wenjie Li
KELM
42
4
0
25 Jun 2024
CogMG: Collaborative Augmentation Between Large Language Model and
  Knowledge Graph
CogMG: Collaborative Augmentation Between Large Language Model and Knowledge Graph
Tong Zhou
Yubo Chen
Kang Liu
Jun Zhao
HILM
RALM
37
1
0
25 Jun 2024
Unlocking the Future: Exploring Look-Ahead Planning Mechanistic
  Interpretability in Large Language Models
Unlocking the Future: Exploring Look-Ahead Planning Mechanistic Interpretability in Large Language Models
Tianyi Men
Pengfei Cao
Zhuoran Jin
Yubo Chen
Kang Liu
Jun Zhao
LLMAG
AIFin
28
4
0
23 Jun 2024
Multimodal Task Vectors Enable Many-Shot Multimodal In-Context Learning
Multimodal Task Vectors Enable Many-Shot Multimodal In-Context Learning
Brandon Huang
Chancharik Mitra
Assaf Arbelle
Leonid Karlinsky
Trevor Darrell
Roei Herzig
49
13
0
21 Jun 2024
FlowBench: Revisiting and Benchmarking Workflow-Guided Planning for
  LLM-based Agents
FlowBench: Revisiting and Benchmarking Workflow-Guided Planning for LLM-based Agents
Ruixuan Xiao
Wentao Ma
Ke Wang
Yuchuan Wu
Junbo Zhao
Haobo Wang
Fei Huang
Yongbin Li
40
8
0
21 Jun 2024
Learning to Plan for Retrieval-Augmented Large Language Models from
  Knowledge Graphs
Learning to Plan for Retrieval-Augmented Large Language Models from Knowledge Graphs
Junjie Wang
Mingyang Chen
Binbin Hu
Dan Yang
Ziqi Liu
...
Jinjie Gu
Jun Zhou
Jeff Z. Pan
Wen Zhang
Huajun Chen
RALM
43
12
0
20 Jun 2024
Step-Back Profiling: Distilling User History for Personalized Scientific
  Writing
Step-Back Profiling: Distilling User History for Personalized Scientific Writing
Xiangru Tang
Xingyao Zhang
Yanjun Shao
Jie Wu
Yilun Zhao
Arman Cohan
Ming Gong
Dongmei Zhang
Mark B. Gerstein
50
2
0
20 Jun 2024
AgentDojo: A Dynamic Environment to Evaluate Attacks and Defenses for
  LLM Agents
AgentDojo: A Dynamic Environment to Evaluate Attacks and Defenses for LLM Agents
Edoardo Debenedetti
Jie Zhang
Mislav Balunović
Luca Beurer-Kellner
Marc Fischer
Florian Tramèr
LLMAG
AAML
59
27
1
19 Jun 2024
APPL: A Prompt Programming Language for Harmonious Integration of
  Programs and Large Language Model Prompts
APPL: A Prompt Programming Language for Harmonious Integration of Programs and Large Language Model Prompts
Honghua Dong
Qidong Su
Yubo Gao
Zhaoyu Li
Yangjun Ruan
Gennady Pekhimenko
Chris J. Maddison
Xujie Si
LLMAG
34
1
0
19 Jun 2024
Vernacular? I Barely Know Her: Challenges with Style Control and
  Stereotyping
Vernacular? I Barely Know Her: Challenges with Style Control and Stereotyping
Ankit Aich
Tingting Liu
Salvatore Giorgi
Kelsey Isman
Lyle Ungar
Brenda L. Curtis
48
2
0
18 Jun 2024
Ask-before-Plan: Proactive Language Agents for Real-World Planning
Ask-before-Plan: Proactive Language Agents for Real-World Planning
Xuan Zhang
Yang Deng
Zifeng Ren
See-Kiong Ng
Tat-Seng Chua
LLMAG
LM&Ro
28
14
0
18 Jun 2024
Automatic benchmarking of large multimodal models via iterative
  experiment programming
Automatic benchmarking of large multimodal models via iterative experiment programming
Alessandro Conti
Enrico Fini
Paolo Rota
Yiming Wang
Massimiliano Mancini
Elisa Ricci
46
0
0
18 Jun 2024
CodeNav: Beyond tool-use to using real-world codebases with LLM agents
CodeNav: Beyond tool-use to using real-world codebases with LLM agents
Tanmay Gupta
Luca Weihs
Aniruddha Kembhavi
LLMAG
ELM
61
1
0
18 Jun 2024
Can Tool-augmented Large Language Models be Aware of Incomplete Conditions?
Can Tool-augmented Large Language Models be Aware of Incomplete Conditions?
Seungbin Yang
chaeHun Park
Taehee Kim
Jaegul Choo
46
2
0
18 Jun 2024
MedCalc-Bench: Evaluating Large Language Models for Medical Calculations
MedCalc-Bench: Evaluating Large Language Models for Medical Calculations
Nikhil Khandekar
Qiao Jin
Guangzhi Xiong
Soren Dunn
Serina S Applebaum
...
Amisha D. Dave
Andrew Taylor
Aidong Zhang
Qingyu Chen
Zhiyong Lu
LM&MA
ELM
31
6
0
17 Jun 2024
Interactive Evolution: A Neural-Symbolic Self-Training Framework For
  Large Language Models
Interactive Evolution: A Neural-Symbolic Self-Training Framework For Large Language Models
Fangzhi Xu
Qiushi Sun
Kanzhi Cheng
Xiaozhong Liu
Yu Qiao
Zhiyong Wu
LLMAG
41
5
0
17 Jun 2024
Optimizing Instructions and Demonstrations for Multi-Stage Language
  Model Programs
Optimizing Instructions and Demonstrations for Multi-Stage Language Model Programs
Krista Opsahl-Ong
Michael J Ryan
Josh Purtell
David Broman
Christopher Potts
Matei A. Zaharia
Omar Khattab
43
26
0
17 Jun 2024
RoboGolf: Mastering Real-World Minigolf with a Reflective Multi-Modality
  Vision-Language Model
RoboGolf: Mastering Real-World Minigolf with a Reflective Multi-Modality Vision-Language Model
Hantao Zhou
Tianying Ji
Lukas Sommerhalder
Michael Goerner
Norman Hendrich
Jianwei Zhang
Fuchun Sun
Huazhe Xu
50
0
0
14 Jun 2024
On LLMs-Driven Synthetic Data Generation, Curation, and Evaluation: A
  Survey
On LLMs-Driven Synthetic Data Generation, Curation, and Evaluation: A Survey
Lin Long
Rui Wang
Ruixuan Xiao
Junbo Zhao
Xiao Ding
Gang Chen
Haobo Wang
SyDa
59
93
0
14 Jun 2024
Multi-Agent Software Development through Cross-Team Collaboration
Multi-Agent Software Development through Cross-Team Collaboration
Zhuoyun Du
Chen Qian
Wei Liu
Zihao Xie
Yifei Wang
Yufan Dang
Weize Chen
Cheng Yang
LLMAG
46
19
0
13 Jun 2024
An Approach to Build Zero-Shot Slot-Filling System for Industry-Grade
  Conversational Assistants
An Approach to Build Zero-Shot Slot-Filling System for Industry-Grade Conversational Assistants
G P Shrivatsa Bhargav
S. Neelam
Udit Sharma
S. Ikbal
Dheeraj Sreedhar
...
Dinesh Garg
Kyle Croutwater
Haode Qi
Eric Wayne
J. William Murdock
29
1
0
13 Jun 2024
Scaling Large Language Model-based Multi-Agent Collaboration
Scaling Large Language Model-based Multi-Agent Collaboration
Chen Qian
Zihao Xie
YiFei Wang
Wei Liu
Yufan Dang
...
Zhuoyun Du
Weize Chen
Cheng Yang
Zhiyuan Liu
Maosong Sun
AI4CE
LLMAG
LM&Ro
66
46
0
11 Jun 2024
Advancing Tool-Augmented Large Language Models: Integrating Insights from Errors in Inference Trees
Advancing Tool-Augmented Large Language Models: Integrating Insights from Errors in Inference Trees
Sijia Chen
Yibo Wang
Yi-Feng Wu
Qing-Guo Chen
Zhao Xu
Weihua Luo
Kaifu Zhang
Lijun Zhang
LLMAG
LRM
50
10
0
11 Jun 2024
LLM-dCache: Improving Tool-Augmented LLMs with GPT-Driven Localized Data
  Caching
LLM-dCache: Improving Tool-Augmented LLMs with GPT-Driven Localized Data Caching
Simranjit Singh
Michael Fore
Andreas Karatzas
Chaehong Lee
Yanan Jian
Longfei Shangguan
Fuxun Yu
Iraklis Anagnostopoulos
Dimitrios Stamoulis
RALM
35
2
0
10 Jun 2024
Husky: A Unified, Open-Source Language Agent for Multi-Step Reasoning
Husky: A Unified, Open-Source Language Agent for Multi-Step Reasoning
Joongwon Kim
Bhargavi Paranjape
Tushar Khot
Hannaneh Hajishirzi
LM&Ro
ELM
LLMAG
LRM
46
9
0
10 Jun 2024
Towards Lifelong Learning of Large Language Models: A Survey
Towards Lifelong Learning of Large Language Models: A Survey
Junhao Zheng
Shengjie Qiu
Chengming Shi
Qianli Ma
KELM
CLL
30
15
0
10 Jun 2024
The BiGGen Bench: A Principled Benchmark for Fine-grained Evaluation of Language Models with Language Models
The BiGGen Bench: A Principled Benchmark for Fine-grained Evaluation of Language Models with Language Models
Seungone Kim
Juyoung Suk
Ji Yong Cho
Shayne Longpre
Chaeeun Kim
...
Sean Welleck
Graham Neubig
Moontae Lee
Kyungjae Lee
Minjoon Seo
ELM
ALM
LM&MA
105
31
0
09 Jun 2024
MEFT: Memory-Efficient Fine-Tuning through Sparse Adapter
MEFT: Memory-Efficient Fine-Tuning through Sparse Adapter
Jitai Hao
Weiwei Sun
Xin Xin
Qi Meng
Zhumin Chen
Pengjie Ren
Zhaochun Ren
MoE
42
2
0
07 Jun 2024
A Survey of Language-Based Communication in Robotics
A Survey of Language-Based Communication in Robotics
William Hunt
Sarvapali D. Ramchurn
Mohammad D. Soorati
LM&Ro
65
12
0
06 Jun 2024
Tool-Planner: Task Planning with Clusters across Multiple Tools
Tool-Planner: Task Planning with Clusters across Multiple Tools
Yanming Liu
Xinyue Peng
Jiannan Cao
Jiannan Cao
Xuhong Zhang
Sheng Cheng
Xun Wang
Xun Wang
Jianwei Yin
Tianyu Du
LLMAG
37
3
0
06 Jun 2024
Previous
123456...8910
Next