Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2505.15935
Cited By
MAPS: A Multilingual Benchmark for Global Agent Performance and Security
21 May 2025
Omer Hofman
Oren Rachmil
Shamik Bose
Vikas Pahuja
Jonathan Brokman
Toshiya Shimizu
Trisha Starostina
Kelly Marchisio
Seraphina Goldfarb-Tarrant
Roman Vainshtein
Re-assign community
ArXiv (abs)
PDF
HTML
Papers citing
"MAPS: A Multilingual Benchmark for Global Agent Performance and Security"
4 / 4 papers shown
Title
Survey on Evaluation of LLM-based Agents
Asaf Yehudai
Lilach Eden
Alan Li
Guy Uziel
Yilun Zhao
Roy Bar-Haim
Arman Cohan
Michal Shmueli-Scheuer
LLMAG
ELM
Presented at
ResearchTrend Connect | LLMAG
on
07 May 2025
200
14
0
20 Mar 2025
TheAgentCompany: Benchmarking LLM Agents on Consequential Real World Tasks
Frank F. Xu
Yufan Song
Boxuan Li
Yuxuan Tang
Kritanjali Jain
...
Wayne Chi
Lawrence Jang
Yiqing Xie
Shuyan Zhou
Graham Neubig
LLMAG
197
42
0
18 Dec 2024
Agent Security Bench (ASB): Formalizing and Benchmarking Attacks and Defenses in LLM-based Agents
H. Zhang
Jingyuan Huang
Kai Mei
Yifei Yao
Zhenting Wang
Chenlu Zhan
Hongwei Wang
Yongfeng Zhang
AAML
LLMAG
ELM
210
40
0
03 Oct 2024
Crosslingual Capabilities and Knowledge Barriers in Multilingual Large Language Models
Lynn Chua
Badih Ghazi
Yangsibo Huang
Pritish Kamath
Ravi Kumar
Pasin Manurangsi
Amer Sinha
Chulin Xie
Chiyuan Zhang
156
2
0
23 Jun 2024
1