ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2305.15334
  4. Cited By
Gorilla: Large Language Model Connected with Massive APIs

Gorilla: Large Language Model Connected with Massive APIs

24 May 2023
Shishir G. Patil
Tianjun Zhang
Xin Wang
Joseph E. Gonzalez
    ELM
    CLL
    ALM
    SyDa
ArXivPDFHTML

Papers citing "Gorilla: Large Language Model Connected with Massive APIs"

50 / 392 papers shown
Title
MindSearch: Mimicking Human Minds Elicits Deep AI Searcher
MindSearch: Mimicking Human Minds Elicits Deep AI Searcher
Zehui Chen
Kuikun Liu
Qiuchen Wang
Jiangning Liu
Wenwei Zhang
Kai Chen
Feng Zhao
LLMAG
78
20
0
29 Jul 2024
MathViz-E: A Case-study in Domain-Specialized Tool-Using Agents
MathViz-E: A Case-study in Domain-Specialized Tool-Using Agents
Arya Bulusu
Brandon Man
Ashish Jagmohan
Aditya Vempaty
Jennifer Mari-Wyka
Deepak Akkil
41
2
0
24 Jul 2024
Operationalizing a Threat Model for Red-Teaming Large Language Models
  (LLMs)
Operationalizing a Threat Model for Red-Teaming Large Language Models (LLMs)
Apurv Verma
Satyapriya Krishna
Sebastian Gehrmann
Madhavan Seshadri
Anu Pradhan
Tom Ault
Leslie Barrett
David Rabinowitz
John Doucette
Nhathai Phan
59
10
0
20 Jul 2024
On Pre-training of Multimodal Language Models Customized for Chart
  Understanding
On Pre-training of Multimodal Language Models Customized for Chart Understanding
Wan-Cyuan Fan
Yen-Chun Chen
Mengchen Liu
Lu Yuan
Leonid Sigal
48
5
0
19 Jul 2024
Agent-E: From Autonomous Web Navigation to Foundational Design
  Principles in Agentic Systems
Agent-E: From Autonomous Web Navigation to Foundational Design Principles in Agentic Systems
Tamer Abuelsaad
Deepak Akkil
Prasenjit Dey
Ashish Jagmohan
Aditya Vempaty
Ravi Kokku
46
23
0
17 Jul 2024
Speech-Copilot: Leveraging Large Language Models for Speech Processing
  via Task Decomposition, Modularization, and Program Generation
Speech-Copilot: Leveraging Large Language Models for Speech Processing via Task Decomposition, Modularization, and Program Generation
Chun-Yi Kuan
Chih-Kai Yang
Wei-Ping Huang
Ke-Han Lu
Hung-yi Lee
55
7
0
13 Jul 2024
On Mitigating Code LLM Hallucinations with API Documentation
On Mitigating Code LLM Hallucinations with API Documentation
Nihal Jain
Robert Kwiatkowski
Baishakhi Ray
M. K. Ramanathan
Varun Kumar
41
7
0
13 Jul 2024
Large Language Models as Biomedical Hypothesis Generators: A
  Comprehensive Evaluation
Large Language Models as Biomedical Hypothesis Generators: A Comprehensive Evaluation
Biqing Qi
Kaiyan Zhang
Kai Tian
Haoxiang Li
Zhang-Ren Chen
Sihang Zeng
Ermo Hua
Hu Jinfang
Bowen Zhou
LM&MA
40
11
0
12 Jul 2024
WorldAPIs: The World Is Worth How Many APIs? A Thought Experiment
WorldAPIs: The World Is Worth How Many APIs? A Thought Experiment
Jiefu Ou
Arda Uzunoglu
Benjamin Van Durme
Daniel Khashabi
LM&Ro
VGen
35
3
0
10 Jul 2024
Optimal Decision Making Through Scenario Simulations Using Large
  Language Models
Optimal Decision Making Through Scenario Simulations Using Large Language Models
Sumedh Rasal
E. Hauer
50
1
0
09 Jul 2024
LLMBox: A Comprehensive Library for Large Language Models
LLMBox: A Comprehensive Library for Large Language Models
Tianyi Tang
Yiwen Hu
Bingqian Li
Wenyang Luo
Zijing Qin
...
Chunxuan Xia
Junyi Li
Kun Zhou
Wayne Xin Zhao
Ji-Rong Wen
50
1
0
08 Jul 2024
What Affects the Stability of Tool Learning? An Empirical Study on the
  Robustness of Tool Learning Frameworks
What Affects the Stability of Tool Learning? An Empirical Study on the Robustness of Tool Learning Frameworks
Chengrui Huang
Zhengliang Shi
Yuntao Wen
Xiuying Chen
Peng Han
Shen Gao
Shuo Shang
47
1
0
03 Jul 2024
WTU-EVAL: A Whether-or-Not Tool Usage Evaluation Benchmark for Large
  Language Models
WTU-EVAL: A Whether-or-Not Tool Usage Evaluation Benchmark for Large Language Models
Kangyun Ning
Yisong Su
Xueqiang Lv
Yuanzhe Zhang
Jian Liu
Kang Liu
Jinan Xu
ELM
LLMAG
48
2
0
02 Jul 2024
Concise and Precise Context Compression for Tool-Using Language Models
Concise and Precise Context Compression for Tool-Using Language Models
Yang Xu
Yunlong Feng
Honglin Mu
Yutai Hou
Yitong Li
...
Zhongyang Li
Dandan Tu
Qingfu Zhu
Hao Fei
Wanxiang Che
LLMAG
24
3
0
02 Jul 2024
Applying RLAIF for Code Generation with API-usage in Lightweight LLMs
Applying RLAIF for Code Generation with API-usage in Lightweight LLMs
Sujan Dutta
Sayantan Mahinder
R. Anantha
Bortik Bandyopadhyay
ALM
39
4
0
28 Jun 2024
BMW Agents -- A Framework For Task Automation Through Multi-Agent
  Collaboration
BMW Agents -- A Framework For Task Automation Through Multi-Agent Collaboration
Noel Crawford
Edward B. Duffy
Iman Evazzade
Torsten Foehr
Gregory Robbins
D. K. Saha
Jiya Varma
Marcin Ziolkowski
LLMAG
52
3
0
28 Jun 2024
ShortcutsBench: A Large-Scale Real-world Benchmark for API-based Agents
ShortcutsBench: A Large-Scale Real-world Benchmark for API-based Agents
Haiyang Shen
Yue Li
Desong Meng
Dongqi Cai
Sheng Qi
Li Zhang
Mengwei Xu
Yun Ma
LLMAG
51
10
0
28 Jun 2024
Granite-Function Calling Model: Introducing Function Calling Abilities
  via Multi-task Learning of Granular Tasks
Granite-Function Calling Model: Introducing Function Calling Abilities via Multi-task Learning of Granular Tasks
Ibrahim Abdelaziz
Kinjal Basu
Mayank Agarwal
Yara Rizk
Matthew Stallone
...
Merve Unuvar
David D. Cox
Salim Roukos
Luis Lastras
Pavan Kapanipathi
LLMAG
34
21
0
27 Jun 2024
APIGen: Automated Pipeline for Generating Verifiable and Diverse
  Function-Calling Datasets
APIGen: Automated Pipeline for Generating Verifiable and Diverse Function-Calling Datasets
Zuxin Liu
Thai Hoang
Jianguo Zhang
Ming Zhu
Tian Lan
...
Silvio Savarese
Juan Carlos Niebles
Huan Wang
Shelby Heinecke
Caiming Xiong
55
46
0
26 Jun 2024
Enhancing Tool Retrieval with Iterative Feedback from Large Language
  Models
Enhancing Tool Retrieval with Iterative Feedback from Large Language Models
Qiancheng Xu
Yongqi Li
Heming Xia
Wenjie Li
KELM
42
4
0
25 Jun 2024
BigCodeBench: Benchmarking Code Generation with Diverse Function Calls and Complex Instructions
BigCodeBench: Benchmarking Code Generation with Diverse Function Calls and Complex Instructions
Terry Yue Zhuo
Minh Chien Vu
Jenny Chim
Han Hu
Wenhao Yu
...
David Lo
Daniel Fried
Xiaoning Du
H. D. Vries
Leandro von Werra
77
140
0
22 Jun 2024
FlowBench: Revisiting and Benchmarking Workflow-Guided Planning for
  LLM-based Agents
FlowBench: Revisiting and Benchmarking Workflow-Guided Planning for LLM-based Agents
Ruixuan Xiao
Wentao Ma
Ke Wang
Yuchuan Wu
Junbo Zhao
Haobo Wang
Fei Huang
Yongbin Li
47
9
0
21 Jun 2024
AgentDojo: A Dynamic Environment to Evaluate Attacks and Defenses for
  LLM Agents
AgentDojo: A Dynamic Environment to Evaluate Attacks and Defenses for LLM Agents
Edoardo Debenedetti
Jie Zhang
Mislav Balunović
Luca Beurer-Kellner
Marc Fischer
Florian Tramèr
LLMAG
AAML
59
28
1
19 Jun 2024
APPL: A Prompt Programming Language for Harmonious Integration of
  Programs and Large Language Model Prompts
APPL: A Prompt Programming Language for Harmonious Integration of Programs and Large Language Model Prompts
Honghua Dong
Qidong Su
Yubo Gao
Zhaoyu Li
Yangjun Ruan
Gennady Pekhimenko
Chris J. Maddison
Xujie Si
LLMAG
34
1
0
19 Jun 2024
Can Tool-augmented Large Language Models be Aware of Incomplete Conditions?
Can Tool-augmented Large Language Models be Aware of Incomplete Conditions?
Seungbin Yang
chaeHun Park
Taehee Kim
Jaegul Choo
46
2
0
18 Jun 2024
MASAI: Modular Architecture for Software-engineering AI Agents
MASAI: Modular Architecture for Software-engineering AI Agents
Daman Arora
Atharv Sonwane
Nalin Wadhwa
Abhav Mehrotra
Saiteja Utpala
Ramakrishna Bairi
Aditya Kanade
Nagarajan Natarajan
LLMAG
51
15
0
17 Jun 2024
TorchOpera: A Compound AI System for LLM Safety
TorchOpera: A Compound AI System for LLM Safety
Shanshan Han
Yuhang Yao
Zijian Hu
Dimitris Stripelis
Zhaozhuo Xu
Chaoyang He
LLMAG
44
0
0
16 Jun 2024
Advancing Tool-Augmented Large Language Models: Integrating Insights from Errors in Inference Trees
Advancing Tool-Augmented Large Language Models: Integrating Insights from Errors in Inference Trees
Sijia Chen
Yibo Wang
Yi-Feng Wu
Qing-Guo Chen
Zhao Xu
Weihua Luo
Kaifu Zhang
Lijun Zhang
LLMAG
LRM
50
11
0
11 Jun 2024
Husky: A Unified, Open-Source Language Agent for Multi-Step Reasoning
Husky: A Unified, Open-Source Language Agent for Multi-Step Reasoning
Joongwon Kim
Bhargavi Paranjape
Tushar Khot
Hannaneh Hajishirzi
LM&Ro
ELM
LLMAG
LRM
46
9
0
10 Jun 2024
Towards Lifelong Learning of Large Language Models: A Survey
Towards Lifelong Learning of Large Language Models: A Survey
Junhao Zheng
Shengjie Qiu
Chengming Shi
Qianli Ma
KELM
CLL
30
15
0
10 Jun 2024
AICoderEval: Improving AI Domain Code Generation of Large Language
  Models
AICoderEval: Improving AI Domain Code Generation of Large Language Models
Yinghui Xia
Yuyan Chen
Tianyu Shi
Jun Wang
Jinsong Yang
34
3
0
07 Jun 2024
TACT: Advancing Complex Aggregative Reasoning with Information
  Extraction Tools
TACT: Advancing Complex Aggregative Reasoning with Information Extraction Tools
Avi Caciularu
Alon Jacovi
Eyal Ben-David
Sasha Goldshtein
Tal Schuster
Jonathan Herzig
G. Elidan
Amir Globerson
LMTD
53
3
0
05 Jun 2024
A Survey of Useful LLM Evaluation
A Survey of Useful LLM Evaluation
Ji-Lun Peng
Sijia Cheng
Egil Diau
Yung-Yu Shih
Po-Heng Chen
Yen-Ting Lin
Yun-Nung Chen
LLMAG
ELM
34
12
0
03 Jun 2024
VQA Training Sets are Self-play Environments for Generating Few-shot
  Pools
VQA Training Sets are Self-play Environments for Generating Few-shot Pools
Tautvydas Misiunas
Hassan Mansoor
Jasper Uijlings
Oriana Riva
Victor Carbune
LRM
VLM
35
0
0
30 May 2024
Grade Like a Human: Rethinking Automated Assessment with Large Language
  Models
Grade Like a Human: Rethinking Automated Assessment with Large Language Models
Wenjing Xie
Juxin Niu
Chun Jason Xue
Nan Guan
AI4Ed
44
3
0
30 May 2024
Robo-Instruct: Simulator-Augmented Instruction Alignment For Finetuning Code LLMs
Robo-Instruct: Simulator-Augmented Instruction Alignment For Finetuning Code LLMs
Zichao Hu
Junyi Jessy Li
Arjun Guha
Joydeep Biswas
SyDa
ALM
51
1
0
30 May 2024
A Human-Like Reasoning Framework for Multi-Phases Planning Task with
  Large Language Models
A Human-Like Reasoning Framework for Multi-Phases Planning Task with Large Language Models
Chengxing Xie
Difan Zou
LRM
LLMAG
43
4
0
28 May 2024
Tool Learning with Large Language Models: A Survey
Tool Learning with Large Language Models: A Survey
Changle Qu
Sunhao Dai
Xiaochi Wei
Hengyi Cai
Shuaiqiang Wang
Dawei Yin
Jun Xu
Jirong Wen
LLMAG
34
83
0
28 May 2024
TEII: Think, Explain, Interact and Iterate with Large Language Models to
  Solve Cross-lingual Emotion Detection
TEII: Think, Explain, Interact and Iterate with Large Language Models to Solve Cross-lingual Emotion Detection
Long Cheng
Qihao Shao
Christine Zhao
Sheng Bi
Gina-Anne Levow
19
5
0
27 May 2024
Tool Learning in the Wild: Empowering Language Models as Automatic Tool Agents
Tool Learning in the Wild: Empowering Language Models as Automatic Tool Agents
Zhengliang Shi
Shen Gao
Xiuyi Chen
Yue Feng
Lingyong Yan
Haibo Shi
Dawei Yin
Zhumin Chen
Suzan Verberne
LLMAG
47
15
0
26 May 2024
Small Language Models for Application Interactions: A Case Study
Small Language Models for Application Interactions: A Case Study
Beibin Li
Yi Zhang
Sébastien Bubeck
Jeevan Pathuri
Ishai Menache
42
4
0
23 May 2024
A Declarative System for Optimizing AI Workloads
A Declarative System for Optimizing AI Workloads
Chunwei Liu
Matthew Russo
Michael Cafarella
Lei Cao
Peter Baille Chen
Zui Chen
Michael Franklin
Tim Kraska
Samuel Madden
Gerardo Vitagliano
47
21
0
23 May 2024
Agent Planning with World Knowledge Model
Agent Planning with World Knowledge Model
Shuofei Qiao
Runnan Fang
Ningyu Zhang
Yuqi Zhu
Xiang Chen
Shumin Deng
Yong-jia Jiang
Pengjun Xie
Fei Huang
Huajun Chen
LLMAG
LM&Ro
97
15
0
23 May 2024
LLM+Reasoning+Planning for Supporting Incomplete User Queries in Presence of APIs
LLM+Reasoning+Planning for Supporting Incomplete User Queries in Presence of APIs
Sudhir Agarwal
A. Sreepathy
David H. Alonso
Prarit Lamba
LRM
60
1
0
21 May 2024
KG-RAG: Bridging the Gap Between Knowledge and Creativity
KG-RAG: Bridging the Gap Between Knowledge and Creativity
Diego Sanmartin
RALM
47
36
0
20 May 2024
MHPP: Exploring the Capabilities and Limitations of Language Models
  Beyond Basic Code Generation
MHPP: Exploring the Capabilities and Limitations of Language Models Beyond Basic Code Generation
Jianbo Dai
Jianqiao Lu
Yunlong Feng
Rongju Ruan
Ming Cheng
Haochen Tan
Zhijiang Guo
ELM
LRM
44
12
0
19 May 2024
Seal-Tools: Self-Instruct Tool Learning Dataset for Agent Tuning and
  Detailed Benchmark
Seal-Tools: Self-Instruct Tool Learning Dataset for Agent Tuning and Detailed Benchmark
Mengsong Wu
Tong Zhu
Han Han
Chuanyuan Tan
Xiang Zhang
Wenliang Chen
25
17
0
14 May 2024
Smurfs: Leveraging Multiple Proficiency Agents with Context-Efficiency
  for Tool Planning
Smurfs: Leveraging Multiple Proficiency Agents with Context-Efficiency for Tool Planning
Junzhi Chen
Juhao Liang
Benyou Wang
LLMAG
28
3
0
09 May 2024
WorkBench: a Benchmark Dataset for Agents in a Realistic Workplace
  Setting
WorkBench: a Benchmark Dataset for Agents in a Realistic Workplace Setting
Olly Styles
Sam Miller
Patricio Cerda-Mardini
T. Guha
Victor Sanchez
Bertie Vidgen
LLMAG
41
3
0
01 May 2024
Mixture-of-Instructions: Aligning Large Language Models via Mixture Prompting
Mixture-of-Instructions: Aligning Large Language Models via Mixture Prompting
Bowen Xu
Shaoyu Wu
Kai Liu
Lulu Hu
39
2
0
29 Apr 2024
Previous
12345678
Next