ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2305.14325
  4. Cited By
Improving Factuality and Reasoning in Language Models through Multiagent
  Debate

Improving Factuality and Reasoning in Language Models through Multiagent Debate

23 May 2023
Yilun Du
Shuang Li
Antonio Torralba
J. Tenenbaum
Igor Mordatch
    LLMAG
    LRM
ArXivPDFHTML

Papers citing "Improving Factuality and Reasoning in Language Models through Multiagent Debate"

50 / 465 papers shown
Title
Reward Design for Justifiable Sequential Decision-Making
Reward Design for Justifiable Sequential Decision-Making
A. Sukovic
Goran Radanović
24
0
0
24 Feb 2024
DEEM: Dynamic Experienced Expert Modeling for Stance Detection
DEEM: Dynamic Experienced Expert Modeling for Stance Detection
Xiaolong Wang
Yile Wang
Sijie Cheng
Peng Li
Yang Liu
39
5
0
23 Feb 2024
CriticBench: Benchmarking LLMs for Critique-Correct Reasoning
CriticBench: Benchmarking LLMs for Critique-Correct Reasoning
Zicheng Lin
Zhibin Gou
Tian Liang
Ruilin Luo
Haowei Liu
Yujiu Yang
LRM
42
43
0
22 Feb 2024
BBA: Bi-Modal Behavioral Alignment for Reasoning with Large
  Vision-Language Models
BBA: Bi-Modal Behavioral Alignment for Reasoning with Large Vision-Language Models
Xueliang Zhao
Xinting Huang
Tingchen Fu
Qintong Li
Shansan Gong
Lemao Liu
Wei Bi
Lingpeng Kong
LRM
37
1
0
21 Feb 2024
AgentScope: A Flexible yet Robust Multi-Agent Platform
AgentScope: A Flexible yet Robust Multi-Agent Platform
Dawei Gao
Zitao Li
Xuchen Pan
Weirui Kuang
Zhijian Ma
...
Chen Cheng
Hongzhu Shi
Yaliang Li
Bolin Ding
Jingren Zhou
LLMAG
32
28
0
21 Feb 2024
Large Language Models for Data Annotation: A Survey
Large Language Models for Data Annotation: A Survey
Zhen Tan
Dawei Li
Song Wang
Alimohammad Beigi
Bohan Jiang
Amrita Bhattacharjee
Mansooreh Karami
Wenlin Yao
Lu Cheng
Huan Liu
SyDa
56
50
0
21 Feb 2024
Soft Self-Consistency Improves Language Model Agents
Soft Self-Consistency Improves Language Model Agents
Han Wang
Archiki Prasad
Elias Stengel-Eskin
Mohit Bansal
LLMAG
24
8
0
20 Feb 2024
Defending Jailbreak Prompts via In-Context Adversarial Game
Defending Jailbreak Prompts via In-Context Adversarial Game
Yujun Zhou
Yufei Han
Haomin Zhuang
Kehan Guo
Zhenwen Liang
Hongyan Bao
Xiangliang Zhang
LLMAG
AAML
42
11
0
20 Feb 2024
Evolving AI Collectives to Enhance Human Diversity and Enable
  Self-Regulation
Evolving AI Collectives to Enhance Human Diversity and Enable Self-Regulation
Shiyang Lai
Yujin Potter
Junsol Kim
Richard Zhuang
Dawn Song
James Evans
52
3
0
19 Feb 2024
Confidence Matters: Revisiting Intrinsic Self-Correction Capabilities of
  Large Language Models
Confidence Matters: Revisiting Intrinsic Self-Correction Capabilities of Large Language Models
Loka Li
Zhenhao Chen
Guan-Hong Chen
Yixuan Zhang
Yusheng Su
Eric P. Xing
Kun Zhang
LRM
44
16
0
19 Feb 2024
GTBench: Uncovering the Strategic Reasoning Limitations of LLMs via
  Game-Theoretic Evaluations
GTBench: Uncovering the Strategic Reasoning Limitations of LLMs via Game-Theoretic Evaluations
Jinhao Duan
Renming Zhang
James Diffenderfer
B. Kailkhura
Lichao Sun
Elias Stengel-Eskin
Mohit Bansal
Tianlong Chen
Kaidi Xu
ELM
LRM
34
58
0
19 Feb 2024
Shall We Team Up: Exploring Spontaneous Cooperation of Competing LLM
  Agents
Shall We Team Up: Exploring Spontaneous Cooperation of Competing LLM Agents
Zengqing Wu
Run Peng
Shuyuan Zheng
Qianying Liu
Xu Han
Brian Inhyuk Kwon
Makoto Onizuka
Shaojie Tang
Chuan Xiao
44
10
0
19 Feb 2024
Your Large Language Model is Secretly a Fairness Proponent and You
  Should Prompt it Like One
Your Large Language Model is Secretly a Fairness Proponent and You Should Prompt it Like One
Tianlin Li
Xiaoyu Zhang
Chao Du
Tianyu Pang
Qian Liu
Qing Guo
Chao Shen
Yang Liu
ALM
45
10
0
19 Feb 2024
LongAgent: Scaling Language Models to 128k Context through Multi-Agent
  Collaboration
LongAgent: Scaling Language Models to 128k Context through Multi-Agent Collaboration
Jun Zhao
Can Zu
Haotian Xu
Yi Lu
Wei He
Yiwen Ding
Tao Gui
Qi Zhang
Xuanjing Huang
RALM
LLMAG
47
22
0
18 Feb 2024
Can LLMs Speak For Diverse People? Tuning LLMs via Debate to Generate
  Controllable Controversial Statements
Can LLMs Speak For Diverse People? Tuning LLMs via Debate to Generate Controllable Controversial Statements
Ming Li
Jiuhai Chen
Lichang Chen
Dinesh Manocha
71
18
0
16 Feb 2024
Retrieve Only When It Needs: Adaptive Retrieval Augmentation for
  Hallucination Mitigation in Large Language Models
Retrieve Only When It Needs: Adaptive Retrieval Augmentation for Hallucination Mitigation in Large Language Models
Hanxing Ding
Liang Pang
Zihao Wei
Huawei Shen
Xueqi Cheng
HILM
RALM
81
16
0
16 Feb 2024
Language Models with Conformal Factuality Guarantees
Language Models with Conformal Factuality Guarantees
Christopher Mohri
Tatsunori Hashimoto
HILM
44
33
0
15 Feb 2024
Not Just Novelty: A Longitudinal Study on Utility and Customization of
  an AI Workflow
Not Just Novelty: A Longitudinal Study on Utility and Customization of an AI Workflow
Tao Long
Katy Ilonka Gero
Lydia B. Chilton
41
13
0
15 Feb 2024
Toward a Team of AI-made Scientists for Scientific Discovery from Gene
  Expression Data
Toward a Team of AI-made Scientists for Scientific Discovery from Gene Expression Data
Haoyang Liu
Yijiang Li
Jinglin Jian
Yuxuan Cheng
Jianrong Lu
Shuyi Guo
Jinglei Zhu
Mianchen Zhang
Miantong Zhang
Haohan Wang
19
4
0
15 Feb 2024
Agent Smith: A Single Image Can Jailbreak One Million Multimodal LLM
  Agents Exponentially Fast
Agent Smith: A Single Image Can Jailbreak One Million Multimodal LLM Agents Exponentially Fast
Xiangming Gu
Xiaosen Zheng
Tianyu Pang
Chao Du
Qian Liu
Ye Wang
Jing Jiang
Min-Bin Lin
LLMAG
LM&Ro
37
49
0
13 Feb 2024
On the Self-Verification Limitations of Large Language Models on
  Reasoning and Planning Tasks
On the Self-Verification Limitations of Large Language Models on Reasoning and Planning Tasks
Kaya Stechly
Karthik Valmeekam
Subbarao Kambhampati
ReLM
LRM
36
50
0
12 Feb 2024
Large Language Models as Agents in Two-Player Games
Large Language Models as Agents in Two-Player Games
Yang Liu
Peng Sun
Hang Li
LLMAG
45
3
0
12 Feb 2024
Can LLMs Produce Faithful Explanations For Fact-checking? Towards
  Faithful Explainable Fact-Checking via Multi-Agent Debate
Can LLMs Produce Faithful Explanations For Fact-checking? Towards Faithful Explainable Fact-Checking via Multi-Agent Debate
Kyungha Kim
Sangyun Lee
Kung-Hsiang Huang
Hou Pong Chan
Manling Li
Chenhui Xu
LRM
57
38
0
12 Feb 2024
Editable Scene Simulation for Autonomous Driving via Collaborative
  LLM-Agents
Editable Scene Simulation for Autonomous Driving via Collaborative LLM-Agents
Yuxi Wei
Zi Wang
Yifan Lu
Chenxin Xu
Chang-rui Liu
Hao Zhao
Siheng Chen
Yanfeng Wang
VGen
65
59
0
08 Feb 2024
Self-Alignment of Large Language Models via Monopolylogue-based Social
  Scene Simulation
Self-Alignment of Large Language Models via Monopolylogue-based Social Scene Simulation
Xianghe Pang
Shuo Tang
Rui Ye
Yuxin Xiong
Bolun Zhang
Yanfeng Wang
Siheng Chen
122
28
0
08 Feb 2024
LLM Multi-Agent Systems: Challenges and Open Problems
LLM Multi-Agent Systems: Challenges and Open Problems
Shanshan Han
Qifan Zhang
Yuhang Yao
Weizhao Jin
Zhaozhuo Xu
LLMAG
50
36
0
05 Feb 2024
Factuality of Large Language Models in the Year 2024
Factuality of Large Language Models in the Year 2024
Yuxia Wang
Minghan Wang
Muhammad Arslan Manzoor
Fei Liu
Georgi Georgiev
Rocktim Jyoti Das
Preslav Nakov
LRM
HILM
38
7
0
04 Feb 2024
More Agents Is All You Need
More Agents Is All You Need
Junyou Li
Qin Zhang
Yangbin Yu
Qiang Fu
Deheng Ye
LLMAG
147
63
0
03 Feb 2024
MAGDi: Structured Distillation of Multi-Agent Interaction Graphs
  Improves Reasoning in Smaller Language Models
MAGDi: Structured Distillation of Multi-Agent Interaction Graphs Improves Reasoning in Smaller Language Models
Justin Chih-Yao Chen
Swarnadeep Saha
Elias Stengel-Eskin
Mohit Bansal
LRM
LLMAG
32
15
0
02 Feb 2024
Foundation Model Sherpas: Guiding Foundation Models through Knowledge
  and Reasoning
Foundation Model Sherpas: Guiding Foundation Models through Knowledge and Reasoning
D. Bhattacharjya
Junkyu Lee
Don Joven Agravante
Balaji Ganesan
Radu Marinescu
LLMAG
38
1
0
02 Feb 2024
Compositional Generative Modeling: A Single Model is Not All You Need
Compositional Generative Modeling: A Single Model is Not All You Need
Yilun Du
L. Kaelbling
PINN
GAN
51
20
0
02 Feb 2024
Improving Weak-to-Strong Generalization with Scalable Oversight and
  Ensemble Learning
Improving Weak-to-Strong Generalization with Scalable Oversight and Ensemble Learning
Jitao Sang
Yuhang Wang
Jing Zhang
Yanxu Zhu
Chao Kong
Junhong Ye
Shuyu Wei
Jinlin Xiao
39
7
0
01 Feb 2024
Don't Hallucinate, Abstain: Identifying LLM Knowledge Gaps via Multi-LLM
  Collaboration
Don't Hallucinate, Abstain: Identifying LLM Knowledge Gaps via Multi-LLM Collaboration
Shangbin Feng
Weijia Shi
Yike Wang
Wenxuan Ding
Vidhisha Balachandran
Yulia Tsvetkov
29
78
0
01 Feb 2024
LLM Voting: Human Choices and AI Collective Decision Making
LLM Voting: Human Choices and AI Collective Decision Making
Joshua C. Yang
Damian Dailisan
Marcin Korecki
C. I. Hausladen
Dirk Helbing
34
17
0
31 Jan 2024
WSC+: Enhancing The Winograd Schema Challenge Using Tree-of-Experts
WSC+: Enhancing The Winograd Schema Challenge Using Tree-of-Experts
Pardis Sadat Zahraei
Ali Emami
27
6
0
31 Jan 2024
Propagation and Pitfalls: Reasoning-based Assessment of Knowledge
  Editing through Counterfactual Tasks
Propagation and Pitfalls: Reasoning-based Assessment of Knowledge Editing through Counterfactual Tasks
Wenyue Hua
Jiang Guo
Mingwen Dong
He Zhu
Patrick K. L. Ng
Zhiguo Wang
KELM
76
17
0
31 Jan 2024
Evaluating Gender Bias in Large Language Models via Chain-of-Thought
  Prompting
Evaluating Gender Bias in Large Language Models via Chain-of-Thought Prompting
Masahiro Kaneko
Danushka Bollegala
Naoaki Okazaki
Timothy Baldwin
LRM
37
27
0
28 Jan 2024
Meta-Prompting: Enhancing Language Models with Task-Agnostic Scaffolding
Meta-Prompting: Enhancing Language Models with Task-Agnostic Scaffolding
Mirac Suzgun
Adam Tauman Kalai
KELM
LRM
LLMAG
ReLM
51
65
0
23 Jan 2024
Automated Fact-Checking of Climate Change Claims with Large Language
  Models
Automated Fact-Checking of Climate Change Claims with Large Language Models
Markus Leippold
S. Vaghefi
Dominik Stammbach
V. Muccione
J. Bingler
...
Tobias Schimanski
Glen Gostlow
Ting Yu
Juerg Luterbacher
C. Huggel
23
9
0
23 Jan 2024
Large Language Model based Multi-Agents: A Survey of Progress and
  Challenges
Large Language Model based Multi-Agents: A Survey of Progress and Challenges
Taicheng Guo
Xiuying Chen
Yaqi Wang
Ruidi Chang
Shichao Pei
Nitesh V. Chawla
Olaf Wiest
Xiangliang Zhang
LLMAG
LM&Ro
AI4CE
LRM
45
252
0
21 Jan 2024
Emergent Dominance Hierarchies in Reinforcement Learning Agents
Emergent Dominance Hierarchies in Reinforcement Learning Agents
Ram Rachum
Yonatan Nakar
Bill Tomlinson
Nitay Alon
Reuth Mirsky
30
0
0
21 Jan 2024
Linear Alignment: A Closed-form Solution for Aligning Human Preferences
  without Tuning and Feedback
Linear Alignment: A Closed-form Solution for Aligning Human Preferences without Tuning and Feedback
Songyang Gao
Qiming Ge
Wei Shen
Shihan Dou
Junjie Ye
...
Yicheng Zou
Zhi Chen
Hang Yan
Qi Zhang
Dahua Lin
57
10
0
21 Jan 2024
Generative AI in EU Law: Liability, Privacy, Intellectual Property, and
  Cybersecurity
Generative AI in EU Law: Liability, Privacy, Intellectual Property, and Cybersecurity
Claudio Novelli
F. Casolari
Philipp Hacker
Giorgio Spedicato
Luciano Floridi
AILaw
SILM
50
44
0
14 Jan 2024
Evolving Code with A Large Language Model
Evolving Code with A Large Language Model
Erik Hemberg
Stephen Moskal
Una-May O’Reilly
30
26
0
13 Jan 2024
Improving Large Language Models via Fine-grained Reinforcement Learning
  with Minimum Editing Constraint
Improving Large Language Models via Fine-grained Reinforcement Learning with Minimum Editing Constraint
Zhipeng Chen
Kun Zhou
Wayne Xin Zhao
Junchen Wan
Fuzheng Zhang
Di Zhang
Ji-Rong Wen
KELM
39
32
0
11 Jan 2024
Combating Adversarial Attacks with Multi-Agent Debate
Combating Adversarial Attacks with Multi-Agent Debate
Steffi Chern
Zhen Fan
Andy Liu
AAML
45
5
0
11 Jan 2024
Designing Heterogeneous LLM Agents for Financial Sentiment Analysis
Designing Heterogeneous LLM Agents for Financial Sentiment Analysis
Frank Xing
AIFin
28
50
0
11 Jan 2024
Risk Taxonomy, Mitigation, and Assessment Benchmarks of Large Language
  Model Systems
Risk Taxonomy, Mitigation, and Assessment Benchmarks of Large Language Model Systems
Tianyu Cui
Yanling Wang
Chuanpu Fu
Yong Xiao
Sijia Li
...
Junwu Xiong
Xinyu Kong
Zujie Wen
Ke Xu
Qi Li
63
56
0
11 Jan 2024
Why Solving Multi-agent Path Finding with Large Language Model has not
  Succeeded Yet
Why Solving Multi-agent Path Finding with Large Language Model has not Succeeded Yet
Weizhe Chen
Sven Koenig
B. Dilkina
LM&Ro
LLMAG
AI4CE
72
16
0
08 Jan 2024
XUAT-Copilot: Multi-Agent Collaborative System for Automated User
  Acceptance Testing with Large Language Model
XUAT-Copilot: Multi-Agent Collaborative System for Automated User Acceptance Testing with Large Language Model
Zhitao Wang
Wei Wang
Zirao Li
Long Wang
Can Yi
Xinjie Xu
Luyang Cao
Hanjing Su
Shouzhi Chen
Jun Zhou
ALM
LLMAG
37
7
0
05 Jan 2024
Previous
123...106789
Next