ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2407.13943
  4. Cited By
Werewolf Arena: A Case Study in LLM Evaluation via Social Deduction

Werewolf Arena: A Case Study in LLM Evaluation via Social Deduction

18 July 2024
Suma Bailis
Jane Friedhoff
Feiyang Chen
ArXivPDFHTML

Papers citing "Werewolf Arena: A Case Study in LLM Evaluation via Social Deduction"

3 / 3 papers shown
Title
DSGBench: A Diverse Strategic Game Benchmark for Evaluating LLM-based Agents in Complex Decision-Making Environments
Wenjie Tang
Yuan Zhou
Erqiang Xu
Keyan Cheng
Minne Li
Liquan Xiao
ELM
47
1
0
08 Mar 2025
A Survey on Large Language Model-Based Social Agents in Game-Theoretic
  Scenarios
A Survey on Large Language Model-Based Social Agents in Game-Theoretic Scenarios
Xiachong Feng
Longxu Dou
Ella Li
Qinghao Wang
H. Wang
Yu Guo
Chang Ma
Lingpeng Kong
LM&Ro
LM&MA
ELM
LLMAG
AI4CE
70
4
0
05 Dec 2024
Simulating Human Strategic Behavior: Comparing Single and Multi-agent
  LLMs
Simulating Human Strategic Behavior: Comparing Single and Multi-agent LLMs
Karthik Sreedhar
Lydia B. Chilton
LLMAG
48
12
0
13 Feb 2024
1