ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2305.15334
  4. Cited By
Gorilla: Large Language Model Connected with Massive APIs

Gorilla: Large Language Model Connected with Massive APIs

24 May 2023
Shishir G. Patil
Tianjun Zhang
Xin Wang
Joseph E. Gonzalez
    ELMCLLALMSyDa
ArXiv (abs)PDFHTML

Papers citing "Gorilla: Large Language Model Connected with Massive APIs"

50 / 413 papers shown
Title
Agent Security Bench (ASB): Formalizing and Benchmarking Attacks and Defenses in LLM-based Agents
Agent Security Bench (ASB): Formalizing and Benchmarking Attacks and Defenses in LLM-based Agents
H. Zhang
Jingyuan Huang
Kai Mei
Yifei Yao
Zhenting Wang
Chenlu Zhan
Hongwei Wang
Yongfeng Zhang
AAMLLLMAGELM
214
40
0
03 Oct 2024
Moral Alignment for LLM Agents
Moral Alignment for LLM Agents
Elizaveta Tennant
Stephen Hailes
Mirco Musolesi
132
8
0
02 Oct 2024
A Survey on Complex Tasks for Goal-Directed Interactive Agents
A Survey on Complex Tasks for Goal-Directed Interactive Agents
Mareike Hartmann
Alexander Koller
LM&RoLLMAG
72
1
0
27 Sep 2024
Data Analysis in the Era of Generative AI
Data Analysis in the Era of Generative AI
J. Inala
Chenglong Wang
Steven Drucker
Gonzalo Ramos
Victor C. Dibia
N. Riche
Dave Brown
Dan Marshall
Jianfeng Gao
99
9
0
27 Sep 2024
Quality Matters: Evaluating Synthetic Data for Tool-Using LLMs
Quality Matters: Evaluating Synthetic Data for Tool-Using LLMs
Shadi Iskander
Nachshon Cohen
Zohar Karnin
Ori Shapira
Sofia Tolmach
SyDa
38
3
0
24 Sep 2024
MOSS: Enabling Code-Driven Evolution and Context Management for AI
  Agents
MOSS: Enabling Code-Driven Evolution and Context Management for AI Agents
Ming Zhu
Yi Zhou
97
2
0
24 Sep 2024
LLM With Tools: A Survey
LLM With Tools: A Survey
Zhuocheng Shen
85
14
0
24 Sep 2024
Automated test generation to evaluate tool-augmented LLMs as
  conversational AI agents
Automated test generation to evaluate tool-augmented LLMs as conversational AI agents
Samuel Arcadinho
David Aparicio
Mariana Almeida
131
7
0
24 Sep 2024
SEAL: Suite for Evaluating API-use of LLMs
SEAL: Suite for Evaluating API-use of LLMs
Woojeong Kim
Ashish Jagmohan
Aditya Vempaty
ELMALMLLMAG
93
1
0
23 Sep 2024
ToolPlanner: A Tool Augmented LLM for Multi Granularity Instructions
  with Path Planning and Feedback
ToolPlanner: A Tool Augmented LLM for Multi Granularity Instructions with Path Planning and Feedback
Qinzhuo Wu
Wei Liu
Jian Luan
Bin Wang
111
11
0
23 Sep 2024
EchoAtt: Attend, Copy, then Adjust for More Efficient Large Language
  Models
EchoAtt: Attend, Copy, then Adjust for More Efficient Large Language Models
Hossein Rajabzadeh
A. Jafari
Aman Sharma
Benyamin Jami
Hyock Ju Kwon
Ali Ghodsi
Boxing Chen
Mehdi Rezagholizadeh
63
0
0
22 Sep 2024
LLM-Agent-UMF: LLM-based Agent Unified Modeling Framework for Seamless
  Integration of Multi Active/Passive Core-Agents
LLM-Agent-UMF: LLM-based Agent Unified Modeling Framework for Seamless Integration of Multi Active/Passive Core-Agents
Amine B. Hassouna
Hana Chaari
Ines Belhaj
LLMAG
99
1
0
17 Sep 2024
SUPER: Evaluating Agents on Setting Up and Executing Tasks from Research
  Repositories
SUPER: Evaluating Agents on Setting Up and Executing Tasks from Research Repositories
Ben Bogin
Kejuan Yang
Shashank Gupta
Kyle Richardson
Erin Bransom
Peter Clark
Ashish Sabharwal
Tushar Khot
ELMLRM
87
20
0
11 Sep 2024
xLAM: A Family of Large Action Models to Empower AI Agent Systems
xLAM: A Family of Large Action Models to Empower AI Agent Systems
Jianguo Zhang
Tian Lan
Ming Zhu
Zuxin Liu
Thai Hoang
...
Juan Carlos Niebles
Shelby Heinecke
Huan Wang
Silvio Savarese
Caiming Xiong
ALM
116
46
0
05 Sep 2024
NESTFUL: A Benchmark for Evaluating LLMs on Nested Sequences of API Calls
NESTFUL: A Benchmark for Evaluating LLMs on Nested Sequences of API Calls
Kinjal Basu
Ibrahim Abdelaziz
Kiran Kate
Mayank Agarwal
Maxwell Crouse
...
Sadhana Kumaravel
Saurabh Goyal
Xin Wang
Luis A. Lastras
Pavan Kapanipathi
88
11
0
04 Sep 2024
ToolACE: Winning the Points of LLM Function Calling
ToolACE: Winning the Points of LLM Function Calling
Weiwen Liu
Xiaolin Huang
Xingshan Zeng
Xinlong Hao
Shuai Yu
...
Xin Jiang
Ruiming Tang
Defu Lian
Qun Liu
Enhong Chen
LLMAG
112
48
0
02 Sep 2024
TinyAgent: Function Calling at the Edge
TinyAgent: Function Calling at the Edge
Lutfi Eren Erdogan
Nicholas Lee
Siddharth Jha
Sehoon Kim
Ryan Tabrizi
Suhong Moon
Coleman Hooper
Gopala Anumanchipalli
Kurt Keutzer
Amir Gholami
LLMAG
114
13
0
01 Sep 2024
Learning to Ask: When LLM Agents Meet Unclear Instruction
Learning to Ask: When LLM Agents Meet Unclear Instruction
Wenxuan Wang
Juluan Shi
Chaozheng Wang
Cheryl Lee
Chaozheng Wang
Cheryl Lee
Youliang Yuan
Jen-tse Huang
Wenxiang Jiao
Michael R. Lyu
LLMAG
185
12
0
31 Aug 2024
CodeGraph: Enhancing Graph Reasoning of LLMs with Code
CodeGraph: Enhancing Graph Reasoning of LLMs with Code
Qiaolong Cai
Zhaowei Wang
Shizhe Diao
James Kwok
Yangqiu Song
LRM
119
4
0
25 Aug 2024
Exploring Large Language Models for Feature Selection: A Data-centric
  Perspective
Exploring Large Language Models for Feature Selection: A Data-centric Perspective
Dawei Li
Zhen Tan
Huan Liu
LM&MA
112
11
0
21 Aug 2024
Plan with Code: Comparing approaches for robust NL to DSL generation
Plan with Code: Comparing approaches for robust NL to DSL generation
Nastaran Bassamzadeh
Chhaya Methani
32
1
0
15 Aug 2024
Validation Requirements for AI-based Intervention-Evaluation in Aging
  and Longevity Research and Practice
Validation Requirements for AI-based Intervention-Evaluation in Aging and Longevity Research and Practice
G. Fuellen
Anton Y Kulaga
Sebastian Lobentanzer
Maximilian Unfried
Roberto Avelar
Daniel Palmer
Brian K. Kennedy
60
3
0
11 Aug 2024
ToolSandbox: A Stateful, Conversational, Interactive Evaluation Benchmark for LLM Tool Use Capabilities
ToolSandbox: A Stateful, Conversational, Interactive Evaluation Benchmark for LLM Tool Use Capabilities
Jiarui Lu
Thomas Holleis
Yizhe Zhang
Bernhard Aumayer
Feng Nan
...
Shen Ma
Mengyu Li
Guoli Yin
Zirui Wang
Ruoming Pang
LLMAGELM
110
39
0
08 Aug 2024
StructuredRAG: JSON Response Formatting with Large Language Models
StructuredRAG: JSON Response Formatting with Large Language Models
Connor Shorten
Charles Pierse
Thomas Benjamin Smith
Erika Cardenas
Akanksha Sharma
John Trengrove
Bob van Luijt
80
8
0
07 Aug 2024
LLM-Aided Compilation for Tensor Accelerators
LLM-Aided Compilation for Tensor Accelerators
Charles Hong
Sahil Bhatia
Altan Haan
Shengjun Kris Dong
Dima Nikiforov
Alvin Cheung
Y. Shao
74
2
0
06 Aug 2024
Re-Invoke: Tool Invocation Rewriting for Zero-Shot Tool Retrieval
Re-Invoke: Tool Invocation Rewriting for Zero-Shot Tool Retrieval
Yanfei Chen
Jinsung Yoon
Devendra Singh Sachan
Qingze Wang
Vincent Cohen-Addad
M. Bateni
Chen-Yu Lee
Tomas Pfister
85
8
0
03 Aug 2024
Coalitions of Large Language Models Increase the Robustness of AI Agents
Coalitions of Large Language Models Increase the Robustness of AI Agents
Prattyush Mangal
Carol Mak
Theo Kanakis
Timothy Donovan
Dave Braines
Edward Pyzer-Knapp
53
1
0
02 Aug 2024
Tulip Agent -- Enabling LLM-Based Agents to Solve Tasks Using Large Tool
  Libraries
Tulip Agent -- Enabling LLM-Based Agents to Solve Tasks Using Large Tool Libraries
Felix Ocker
Daniel Tanneberg
Julian Eggert
Michael Gienger
LLMAGLM&RoVLM
88
5
0
31 Jul 2024
Domain Adaptable Prescriptive AI Agent for Enterprise
Domain Adaptable Prescriptive AI Agent for Enterprise
Piero Orderique
Wei-Ju Sun
Kristjan Greenewald
78
1
0
29 Jul 2024
Apple Intelligence Foundation Language Models
Apple Intelligence Foundation Language Models
Tom Gunter
Zirui Wang
Chong-Jun Wang
Ruoming Pang
Andy Narayanan
...
Xinwen Liu
Yang Zhao
Yin Xia
Zhile Ren
Zhongzheng Ren
148
40
0
29 Jul 2024
MindSearch: Mimicking Human Minds Elicits Deep AI Searcher
MindSearch: Mimicking Human Minds Elicits Deep AI Searcher
Zehui Chen
Kuikun Liu
Qiuchen Wang
Jiangning Liu
Wenwei Zhang
Kai Chen
Feng Zhao
LLMAG
137
29
0
29 Jul 2024
MathViz-E: A Case-study in Domain-Specialized Tool-Using Agents
MathViz-E: A Case-study in Domain-Specialized Tool-Using Agents
Arya Bulusu
Brandon Man
Ashish Jagmohan
Aditya Vempaty
Jennifer Mari-Wyka
Deepak Akkil
78
2
0
24 Jul 2024
OpenHands: An Open Platform for AI Software Developers as Generalist Agents
OpenHands: An Open Platform for AI Software Developers as Generalist Agents
Xingyao Wang
Boxuan Li
Yufan Song
Frank F. Xu
Xiangru Tang
...
Junyang Lin
Robert Brennan
Hao Peng
Heng Ji
Graham Neubig
VLM
110
131
0
23 Jul 2024
Operationalizing a Threat Model for Red-Teaming Large Language Models (LLMs)
Operationalizing a Threat Model for Red-Teaming Large Language Models (LLMs)
Apurv Verma
Satyapriya Krishna
Sebastian Gehrmann
Madhavan Seshadri
Anu Pradhan
Tom Ault
Leslie Barrett
David Rabinowitz
John Doucette
Nhathai Phan
129
15
0
20 Jul 2024
On Pre-training of Multimodal Language Models Customized for Chart
  Understanding
On Pre-training of Multimodal Language Models Customized for Chart Understanding
Wan-Cyuan Fan
Yen-Chun Chen
Mengchen Liu
Lu Yuan
Leonid Sigal
103
7
0
19 Jul 2024
Agent-E: From Autonomous Web Navigation to Foundational Design
  Principles in Agentic Systems
Agent-E: From Autonomous Web Navigation to Foundational Design Principles in Agentic Systems
Tamer Abuelsaad
Deepak Akkil
Prasenjit Dey
Ashish Jagmohan
Aditya Vempaty
Ravi Kokku
100
28
0
17 Jul 2024
Speech-Copilot: Leveraging Large Language Models for Speech Processing
  via Task Decomposition, Modularization, and Program Generation
Speech-Copilot: Leveraging Large Language Models for Speech Processing via Task Decomposition, Modularization, and Program Generation
Chun-Yi Kuan
Chih-Kai Yang
Wei-Ping Huang
Ke-Han Lu
Hung-yi Lee
113
13
0
13 Jul 2024
On Mitigating Code LLM Hallucinations with API Documentation
On Mitigating Code LLM Hallucinations with API Documentation
Nihal Jain
Robert Kwiatkowski
Baishakhi Ray
M. K. Ramanathan
Varun Kumar
107
8
0
13 Jul 2024
Large Language Models as Biomedical Hypothesis Generators: A
  Comprehensive Evaluation
Large Language Models as Biomedical Hypothesis Generators: A Comprehensive Evaluation
Biqing Qi
Kaiyan Zhang
Kai Tian
Haoxiang Li
Zhang-Ren Chen
Sihang Zeng
Ermo Hua
Hu Jinfang
Bowen Zhou
LM&MA
127
18
0
12 Jul 2024
WorldAPIs: The World Is Worth How Many APIs? A Thought Experiment
WorldAPIs: The World Is Worth How Many APIs? A Thought Experiment
Jiefu Ou
Arda Uzunoglu
Benjamin Van Durme
Daniel Khashabi
LM&RoVGen
90
3
0
10 Jul 2024
Optimal Decision Making Through Scenario Simulations Using Large
  Language Models
Optimal Decision Making Through Scenario Simulations Using Large Language Models
Sumedh Rasal
E. Hauer
89
1
0
09 Jul 2024
LLMBox: A Comprehensive Library for Large Language Models
LLMBox: A Comprehensive Library for Large Language Models
Tianyi Tang
Yiwen Hu
Bingqian Li
Wenyang Luo
Zijing Qin
...
Chunxuan Xia
Junyi Li
Kun Zhou
Wayne Xin Zhao
Ji-Rong Wen
65
2
0
08 Jul 2024
What Affects the Stability of Tool Learning? An Empirical Study on the
  Robustness of Tool Learning Frameworks
What Affects the Stability of Tool Learning? An Empirical Study on the Robustness of Tool Learning Frameworks
Chengrui Huang
Zhengliang Shi
Yuntao Wen
Xiuying Chen
Peng Han
Shen Gao
Shuo Shang
76
2
0
03 Jul 2024
WTU-EVAL: A Whether-or-Not Tool Usage Evaluation Benchmark for Large
  Language Models
WTU-EVAL: A Whether-or-Not Tool Usage Evaluation Benchmark for Large Language Models
Kangyun Ning
Yisong Su
Xueqiang Lv
Yuanzhe Zhang
Jian Liu
Kang Liu
Jinan Xu
ELMLLMAG
83
3
0
02 Jul 2024
Concise and Precise Context Compression for Tool-Using Language Models
Concise and Precise Context Compression for Tool-Using Language Models
Yang Xu
Yunlong Feng
Honglin Mu
Yutai Hou
Yitong Li
...
Zhongyang Li
Dandan Tu
Qingfu Zhu
Hao Fei
Wanxiang Che
LLMAG
77
3
0
02 Jul 2024
Applying RLAIF for Code Generation with API-usage in Lightweight LLMs
Applying RLAIF for Code Generation with API-usage in Lightweight LLMs
Sujan Dutta
Sayantan Mahinder
R. Anantha
Bortik Bandyopadhyay
ALM
75
7
0
28 Jun 2024
BMW Agents -- A Framework For Task Automation Through Multi-Agent
  Collaboration
BMW Agents -- A Framework For Task Automation Through Multi-Agent Collaboration
Noel Crawford
Edward B. Duffy
Iman Evazzade
Torsten Foehr
Gregory Robbins
D. K. Saha
Jiya Varma
Marcin Ziolkowski
LLMAG
134
3
0
28 Jun 2024
ShortcutsBench: A Large-Scale Real-world Benchmark for API-based Agents
ShortcutsBench: A Large-Scale Real-world Benchmark for API-based Agents
Haiyang Shen
Yue Li
Desong Meng
Dongqi Cai
Sheng Qi
Li Zhang
Mengwei Xu
Yudong Han
LLMAG
147
12
0
28 Jun 2024
Granite-Function Calling Model: Introducing Function Calling Abilities
  via Multi-task Learning of Granular Tasks
Granite-Function Calling Model: Introducing Function Calling Abilities via Multi-task Learning of Granular Tasks
Ibrahim Abdelaziz
Kinjal Basu
Mayank Agarwal
Yara Rizk
Matthew Stallone
...
Merve Unuvar
David D. Cox
Salim Roukos
Luis A. Lastras
Pavan Kapanipathi
LLMAG
92
24
0
27 Jun 2024
APIGen: Automated Pipeline for Generating Verifiable and Diverse
  Function-Calling Datasets
APIGen: Automated Pipeline for Generating Verifiable and Diverse Function-Calling Datasets
Zuxin Liu
Thai Hoang
Jianguo Zhang
Ming Zhu
Tian Lan
...
Silvio Savarese
Juan Carlos Niebles
Huan Wang
Shelby Heinecke
Caiming Xiong
122
62
0
26 Jun 2024
Previous
123456789
Next