ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2501.10132
  4. Cited By

ComplexFuncBench: Exploring Multi-Step and Constrained Function Calling under Long-Context Scenario

20 January 2025
Lucen Zhong
Zhengxiao Du
Xiaohan Zhang
Haiyi Hu
J. Tang
    LLMAG
ArXiv (abs)PDFHTML

Papers citing "ComplexFuncBench: Exploring Multi-Step and Constrained Function Calling under Long-Context Scenario"

6 / 6 papers shown
Title
Invocable APIs derived from NL2SQL datasets for LLM Tool-Calling Evaluation
Invocable APIs derived from NL2SQL datasets for LLM Tool-Calling Evaluation
Benjamin Elder
Anupama Murthi
J. Kang
Ankita Rajaram Naik
Kiran Kate
Kinjal Basu
Danish Contractor
20
0
0
12 Jun 2025
Evolutionary Perspectives on the Evaluation of LLM-Based AI Agents: A Comprehensive Survey
Evolutionary Perspectives on the Evaluation of LLM-Based AI Agents: A Comprehensive Survey
Jiachen Zhu
Menghui Zhu
Renting Rui
Rong Shan
Congmin Zheng
...
Jianghao Lin
Weiwen Liu
Ruiming Tang
Yong Yu
Weinan Zhang
LLMAGELM
40
0
0
06 Jun 2025
ToolHaystack: Stress-Testing Tool-Augmented Language Models in Realistic Long-Term Interactions
ToolHaystack: Stress-Testing Tool-Augmented Language Models in Realistic Long-Term Interactions
Beong-woo Kwak
Minju Kim
Dongha Lim
Hyungjoo Chae
Dongjin Kang
Sunghwan Kim
Dongil Yang
Jinyoung Yeo
LLMAGRALM
71
0
0
29 May 2025
Small Models, Big Tasks: An Exploratory Empirical Study on Small Language Models for Function Calling
Small Models, Big Tasks: An Exploratory Empirical Study on Small Language Models for Function Calling
Ishan Kavathekar
Raghav Donakanti
Ponnurangam Kumaraguru
Karthik Vaidhyanathan
143
1
0
27 Apr 2025
Survey on Evaluation of LLM-based Agents
Survey on Evaluation of LLM-based Agents
Asaf Yehudai
Lilach Eden
Alan Li
Guy Uziel
Yilun Zhao
Roy Bar-Haim
Arman Cohan
Michal Shmueli-Scheuer
LLMAGELM
Presented at ResearchTrend Connect | LLMAG on 07 May 2025
197
14
0
20 Mar 2025
Benchmarking Chinese Medical LLMs: A Medbench-based Analysis of Performance Gaps and Hierarchical Optimization Strategies
Luyi Jiang
Jiasi Chen
Lu Lu
Xinwei Peng
Lihao Liu
Junjun He
Jie Xu
ELMLM&MA
80
0
0
10 Mar 2025
1