ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2502.18836
  4. Cited By
REALM-Bench: A Real-World Planning Benchmark for LLMs and Multi-Agent Systems

REALM-Bench: A Real-World Planning Benchmark for LLMs and Multi-Agent Systems

26 February 2025
Longling Geng
Edward Y. Chang
    LLMAG
ArXiv (abs)PDFHTML

Papers citing "REALM-Bench: A Real-World Planning Benchmark for LLMs and Multi-Agent Systems"

8 / 8 papers shown
Title
LLM-Powered AI Agent Systems and Their Applications in Industry
LLM-Powered AI Agent Systems and Their Applications in Industry
Guannan Liang
Qianqian Tong
LLMAGLM&Ro
61
3
0
22 May 2025
PLANET: A Collection of Benchmarks for Evaluating LLMs' Planning Capabilities
PLANET: A Collection of Benchmarks for Evaluating LLMs' Planning Capabilities
Haoming Li
Zhaoliang Chen
Jonathan Zhang
Fei Liu
LLMAG
126
2
0
21 Apr 2025
MACI: Multi-Agent Collaborative Intelligence for Adaptive Reasoning and Temporal Planning
MACI: Multi-Agent Collaborative Intelligence for Adaptive Reasoning and Temporal Planning
Edward Y. Chang
LLMAG
107
1
0
28 Jan 2025
DeepSeek-R1: Incentivizing Reasoning Capability in LLMs via Reinforcement Learning
DeepSeek-R1: Incentivizing Reasoning Capability in LLMs via Reinforcement Learning
DeepSeek-AI
Daya Guo
Dejian Yang
Haowei Zhang
Junxiao Song
...
Shiyu Wang
S. Yu
Shunfeng Zhou
Shuting Pan
S.S. Li
ReLMVLMOffRLAI4TSLRM
380
1,967
0
22 Jan 2025
TaskBench: Benchmarking Large Language Models for Task Automation
TaskBench: Benchmarking Large Language Models for Task Automation
Yongliang Shen
Kaitao Song
Xu Tan
Wenqi Zhang
Kan Ren
Siyu Yuan
Weiming Lu
Dongsheng Li
Yueting Zhuang
103
65
0
30 Nov 2023
TimeBench: A Comprehensive Evaluation of Temporal Reasoning Abilities in
  Large Language Models
TimeBench: A Comprehensive Evaluation of Temporal Reasoning Abilities in Large Language Models
Zheng Chu
Jingchang Chen
Qianglong Chen
Weijiang Yu
Haotian Wang
Ming Liu
Bing Qin
LRMELM
104
15
0
29 Nov 2023
CAMEL: Communicative Agents for "Mind" Exploration of Large Language
  Model Society
CAMEL: Communicative Agents for "Mind" Exploration of Large Language Model Society
Ge Li
Hasan Hammoud
Hani Itani
Dmitrii Khizbullin
Guohao Li
SyDaALM
130
513
0
31 Mar 2023
Language Models are Few-Shot Learners
Language Models are Few-Shot Learners
Tom B. Brown
Benjamin Mann
Nick Ryder
Melanie Subbiah
Jared Kaplan
...
Christopher Berner
Sam McCandlish
Alec Radford
Ilya Sutskever
Dario Amodei
BDL
859
42,379
0
28 May 2020
1