ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2310.03051
  4. Cited By
How FaR Are Large Language Models From Agents with Theory-of-Mind?

How FaR Are Large Language Models From Agents with Theory-of-Mind?

4 October 2023
Pei Zhou
Aman Madaan
Srividya Pranavi Potharaju
Aditya Gupta
Kevin R. McKee
Ari Holtzman
Jay Pujara
Xiang Ren
Swaroop Mishra
Aida Nematzadeh
Shyam Upadhyay
Manaal Faruqui
    LRM
    AI4CE
ArXivPDFHTML

Papers citing "How FaR Are Large Language Models From Agents with Theory-of-Mind?"

41 / 41 papers shown
Title
Re-evaluating Theory of Mind evaluation in large language models
Re-evaluating Theory of Mind evaluation in large language models
Jennifer Hu
Felix Sosa
T. Ullman
45
0
0
28 Feb 2025
From Perceptions to Decisions: Wildfire Evacuation Decision Prediction with Behavioral Theory-informed LLMs
From Perceptions to Decisions: Wildfire Evacuation Decision Prediction with Behavioral Theory-informed LLMs
Ruxiao Chen
Chenguang Wang
Yuran Sun
Xilei Zhao
Susu Xu
95
1
0
24 Feb 2025
HARBOR: Exploring Persona Dynamics in Multi-Agent Competition
HARBOR: Exploring Persona Dynamics in Multi-Agent Competition
Kenan Jiang
Li Xiong
Fei Liu
61
0
0
17 Feb 2025
Large Language Models as Theory of Mind Aware Generative Agents with Counterfactual Reflection
Bo Yang
Jiaxian Guo
Yusuke Iwasawa
Y. Matsuo
AI4CE
41
1
0
28 Jan 2025
Friends-MMC: A Dataset for Multi-modal Multi-party Conversation
  Understanding
Friends-MMC: A Dataset for Multi-modal Multi-party Conversation Understanding
Yueqian Wang
Xiaojun Meng
Yijiao Wang
Jianxin Liang
Qun Liu
Dongyan Zhao
36
0
0
23 Dec 2024
Lies, Damned Lies, and Distributional Language Statistics: Persuasion
  and Deception with Large Language Models
Lies, Damned Lies, and Distributional Language Statistics: Persuasion and Deception with Large Language Models
Cameron R. Jones
Benjamin Bergen
67
5
0
22 Dec 2024
Multi-ToM: Evaluating Multilingual Theory of Mind Capabilities in Large
  Language Models
Multi-ToM: Evaluating Multilingual Theory of Mind Capabilities in Large Language Models
Jayanta Sadhu
Ayan Antik Khan
Noshin Nawal
Sanju Basak
Abhik Bhattacharjee
Rifat Shahriyar
74
0
0
24 Nov 2024
Hermes: A Large Language Model Framework on the Journey to Autonomous
  Networks
Hermes: A Large Language Model Framework on the Journey to Autonomous Networks
Fadhel Ayed
Ali Maatouk
Nicola Piovesan
Antonio De Domenico
Merouane Debbah
Zhi-Quan Luo
37
3
0
10 Nov 2024
Belief in the Machine: Investigating Epistemological Blind Spots of
  Language Models
Belief in the Machine: Investigating Epistemological Blind Spots of Language Models
Mirac Suzgun
Tayfun Gur
Federico Bianchi
Daniel E. Ho
Thomas Icard
Dan Jurafsky
James Zou
31
1
0
28 Oct 2024
SimpleToM: Exposing the Gap between Explicit ToM Inference and Implicit
  ToM Application in LLMs
SimpleToM: Exposing the Gap between Explicit ToM Inference and Implicit ToM Application in LLMs
Yuling Gu
Oyvind Tafjord
Hyunwoo Kim
Jared Moore
Ronan Le Bras
Peter Clark
Yejin Choi
33
8
0
17 Oct 2024
Agents Thinking Fast and Slow: A Talker-Reasoner Architecture
Agents Thinking Fast and Slow: A Talker-Reasoner Architecture
Konstantina Christakopoulou
Shibl Mourad
Maja Matarić
LLMAG
41
11
0
10 Oct 2024
Auto-Evolve: Enhancing Large Language Model's Performance via
  Self-Reasoning Framework
Auto-Evolve: Enhancing Large Language Model's Performance via Self-Reasoning Framework
Krishna Aswani
Huilin Lu
Pranav Patankar
Priya Dhalwani
Iris Tan
Jayant Ganeshmohan
Simon Lacasse
ReLM
LLMAG
LRM
32
0
0
08 Oct 2024
Recent Advancement of Emotion Cognition in Large Language Models
Recent Advancement of Emotion Cognition in Large Language Models
Yuyan Chen
Yanghua Xiao
OffRL
37
6
0
20 Sep 2024
CHARTOM: A Visual Theory-of-Mind Benchmark for Multimodal Large Language Models
CHARTOM: A Visual Theory-of-Mind Benchmark for Multimodal Large Language Models
S. Bharti
Shiyun Cheng
Jihyun Rho
Martina Rao
Mu Cai
Yong Jae Lee
Martina Rau
Xiaojin Zhu
42
1
0
26 Aug 2024
Perceptions of Linguistic Uncertainty by Language Models and Humans
Perceptions of Linguistic Uncertainty by Language Models and Humans
Catarina G Belém
Markelle Kelly
M. Steyvers
Sameer Singh
Padhraic Smyth
43
3
0
22 Jul 2024
Perceptions to Beliefs: Exploring Precursory Inferences for Theory of
  Mind in Large Language Models
Perceptions to Beliefs: Exploring Precursory Inferences for Theory of Mind in Large Language Models
Chani Jung
Dongkwan Kim
Jiho Jin
Jiseon Kim
Yeon Seonwoo
Yejin Choi
Alice H. Oh
Hyunwoo Kim
LRM
58
7
0
08 Jul 2024
Brittle Minds, Fixable Activations: Understanding Belief Representations in Language Models
Brittle Minds, Fixable Activations: Understanding Belief Representations in Language Models
Matteo Bortoletto
Constantin Ruhdorfer
Lei Shi
Andreas Bulling
AI4MH
LRM
48
4
0
25 Jun 2024
InterIntent: Investigating Social Intelligence of LLMs via Intention
  Understanding in an Interactive Game Context
InterIntent: Investigating Social Intelligence of LLMs via Intention Understanding in an Interactive Game Context
Ziyi Liu
Abhishek Anand
Pei Zhou
Jen-tse Huang
Jieyu Zhao
83
6
0
18 Jun 2024
A Notion of Complexity for Theory of Mind via Discrete World Models
A Notion of Complexity for Theory of Mind via Discrete World Models
X. A. Huang
Emanuele La Malfa
Samuele Marro
Andrea Asperti
Anthony Cohn
Michael Wooldridge
45
6
0
16 Jun 2024
The BiGGen Bench: A Principled Benchmark for Fine-grained Evaluation of Language Models with Language Models
The BiGGen Bench: A Principled Benchmark for Fine-grained Evaluation of Language Models with Language Models
Seungone Kim
Juyoung Suk
Ji Yong Cho
Shayne Longpre
Chaeeun Kim
...
Sean Welleck
Graham Neubig
Moontae Lee
Kyungjae Lee
Minjoon Seo
ELM
ALM
LM&MA
105
31
0
09 Jun 2024
Cognitive Insights and Stable Coalition Matching for Fostering Multi-Agent Cooperation
Cognitive Insights and Stable Coalition Matching for Fostering Multi-Agent Cooperation
Jiaqi Shao
Tianjun Yuan
Tao Lin
Xuanyu Cao
50
0
0
28 May 2024
Is the Pope Catholic? Yes, the Pope is Catholic. Generative Evaluation
  of Non-Literal Intent Resolution in LLMs
Is the Pope Catholic? Yes, the Pope is Catholic. Generative Evaluation of Non-Literal Intent Resolution in LLMs
Akhila Yerukola
Saujas Vaduguru
Daniel Fried
Maarten Sap
37
1
0
14 May 2024
Language Models Represent Beliefs of Self and Others
Language Models Represent Beliefs of Self and Others
Wentao Zhu
Zhining Zhang
Yizhou Wang
MILM
LRM
50
7
0
28 Feb 2024
KoDialogBench: Evaluating Conversational Understanding of Language
  Models with Korean Dialogue Benchmark
KoDialogBench: Evaluating Conversational Understanding of Language Models with Korean Dialogue Benchmark
Seongbo Jang
Seonghyeon Lee
Hwanjo Yu
ELM
29
0
0
27 Feb 2024
ToMBench: Benchmarking Theory of Mind in Large Language Models
ToMBench: Benchmarking Theory of Mind in Large Language Models
Zhuang Chen
Jincenzi Wu
Jinfeng Zhou
Bosi Wen
Guanqun Bi
...
Yaru Cao
Mengting Hu
Yunghwei Lai
Zexuan Xiong
Minlie Huang
40
12
0
23 Feb 2024
Evolving AI Collectives to Enhance Human Diversity and Enable
  Self-Regulation
Evolving AI Collectives to Enhance Human Diversity and Enable Self-Regulation
Shiyang Lai
Yujin Potter
Junsol Kim
Richard Zhuang
Dawn Song
James Evans
52
3
0
19 Feb 2024
Multi-Task Inference: Can Large Language Models Follow Multiple
  Instructions at Once?
Multi-Task Inference: Can Large Language Models Follow Multiple Instructions at Once?
Guijin Son
Sangwon Baek
Sangdae Nam
Ilgyun Jeong
Seungone Kim
ELM
LRM
40
14
0
18 Feb 2024
Toward a Team of AI-made Scientists for Scientific Discovery from Gene
  Expression Data
Toward a Team of AI-made Scientists for Scientific Discovery from Gene Expression Data
Haoyang Liu
Yijiang Li
Jinglin Jian
Yuxuan Cheng
Jianrong Lu
Shuyi Guo
Jinglei Zhu
Mianchen Zhang
Miantong Zhang
Haohan Wang
19
4
0
15 Feb 2024
OpenToM: A Comprehensive Benchmark for Evaluating Theory-of-Mind
  Reasoning Capabilities of Large Language Models
OpenToM: A Comprehensive Benchmark for Evaluating Theory-of-Mind Reasoning Capabilities of Large Language Models
Hainiu Xu
Runcong Zhao
Lixing Zhu
Bin Liang
Yulan He
84
20
0
08 Feb 2024
Self-Discover: Large Language Models Self-Compose Reasoning Structures
Self-Discover: Large Language Models Self-Compose Reasoning Structures
Pei Zhou
Jay Pujara
Xiang Ren
Xinyun Chen
Heng-Tze Cheng
Quoc V. Le
Ed H. Chi
Denny Zhou
Swaroop Mishra
Huaixiu Steven Zheng
LRM
ReLM
27
48
0
06 Feb 2024
Cooperation on the Fly: Exploring Language Agents for Ad Hoc Teamwork in
  the Avalon Game
Cooperation on the Fly: Exploring Language Agents for Ad Hoc Teamwork in the Avalon Game
Zijing Shi
Meng Fang
Shunfeng Zheng
Shilong Deng
Ling-Hao Chen
Yali Du
36
23
0
29 Dec 2023
CLOVA: A Closed-Loop Visual Assistant with Tool Usage and Update
CLOVA: A Closed-Loop Visual Assistant with Tool Usage and Update
Zhi Gao
Yuntao Du
Xintong Zhang
Xiaojian Ma
Wenjuan Han
Song-Chun Zhu
Qing Li
LLMAG
VLM
31
21
0
18 Dec 2023
Merlin:Empowering Multimodal LLMs with Foresight Minds
Merlin:Empowering Multimodal LLMs with Foresight Minds
En Yu
Liang Zhao
Yana Wei
Jinrong Yang
Dongming Wu
...
Haoran Wei
Tiancai Wang
Zheng Ge
Xiangyu Zhang
Wenbing Tao
LRM
18
25
0
30 Nov 2023
Rephrase and Respond: Let Large Language Models Ask Better Questions for
  Themselves
Rephrase and Respond: Let Large Language Models Ask Better Questions for Themselves
Yihe Deng
Weitong Zhang
Zixiang Chen
Quanquan Gu
LRM
26
73
0
07 Nov 2023
Towards A Holistic Landscape of Situated Theory of Mind in Large
  Language Models
Towards A Holistic Landscape of Situated Theory of Mind in Large Language Models
Ziqiao Ma
Jacob Sansom
Run Peng
Joyce Chai
47
17
0
30 Oct 2023
MindDial: Belief Dynamics Tracking with Theory-of-Mind Modeling for
  Situated Neural Dialogue Generation
MindDial: Belief Dynamics Tracking with Theory-of-Mind Modeling for Situated Neural Dialogue Generation
Shuwen Qiu
Mingdian Liu
Hengli Li
Song-Chun Zhu
Zilong Zheng
16
0
0
27 Jun 2023
Generative Agents: Interactive Simulacra of Human Behavior
Generative Agents: Interactive Simulacra of Human Behavior
J. Park
Joseph C. O'Brien
Carrie J. Cai
Meredith Ringel Morris
Percy Liang
Michael S. Bernstein
LM&Ro
AI4CE
232
1,754
0
07 Apr 2023
ReAct: Synergizing Reasoning and Acting in Language Models
ReAct: Synergizing Reasoning and Acting in Language Models
Shunyu Yao
Jeffrey Zhao
Dian Yu
Nan Du
Izhak Shafran
Karthik Narasimhan
Yuan Cao
LLMAG
ReLM
LRM
273
2,510
0
06 Oct 2022
Large Language Models are Zero-Shot Reasoners
Large Language Models are Zero-Shot Reasoners
Takeshi Kojima
S. Gu
Machel Reid
Yutaka Matsuo
Yusuke Iwasawa
ReLM
LRM
328
4,077
0
24 May 2022
Chain-of-Thought Prompting Elicits Reasoning in Large Language Models
Chain-of-Thought Prompting Elicits Reasoning in Large Language Models
Jason W. Wei
Xuezhi Wang
Dale Schuurmans
Maarten Bosma
Brian Ichter
F. Xia
Ed H. Chi
Quoc Le
Denny Zhou
LM&Ro
LRM
AI4CE
ReLM
398
8,559
0
28 Jan 2022
MindCraft: Theory of Mind Modeling for Situated Dialogue in
  Collaborative Tasks
MindCraft: Theory of Mind Modeling for Situated Dialogue in Collaborative Tasks
Cristian-Paul Bara
Sky CH-Wang
J. Chai
67
61
0
13 Sep 2021
1