ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2305.14325
  4. Cited By
Improving Factuality and Reasoning in Language Models through Multiagent
  Debate

Improving Factuality and Reasoning in Language Models through Multiagent Debate

23 May 2023
Yilun Du
Shuang Li
Antonio Torralba
J. Tenenbaum
Igor Mordatch
    LLMAG
    LRM
ArXivPDFHTML

Papers citing "Improving Factuality and Reasoning in Language Models through Multiagent Debate"

50 / 465 papers shown
Title
Teaching Models to Balance Resisting and Accepting Persuasion
Teaching Models to Balance Resisting and Accepting Persuasion
Elias Stengel-Eskin
Peter Hase
Joey Tianyi Zhou
MU
31
4
0
18 Oct 2024
ControlAgent: Automating Control System Design via Novel Integration of
  LLM Agents and Domain Expertise
ControlAgent: Automating Control System Design via Novel Integration of LLM Agents and Domain Expertise
Xingang Guo
Darioush Keivan
U. Syed
Lianhui Qin
Huan Zhang
Geir Dullerud
Peter M. Seiler
Bin Hu
34
5
0
17 Oct 2024
Think Thrice Before You Act: Progressive Thought Refinement in Large
  Language Models
Think Thrice Before You Act: Progressive Thought Refinement in Large Language Models
Chengyu Du
Jinyi Han
Yizhou Ying
Aili Chen
Qianyu He
...
Haoran Guo
Jiaqing Liang
Zulong Chen
Liangyue Li
Yanghua Xiao
KELM
CLL
LRM
30
1
0
17 Oct 2024
Anchored Alignment for Self-Explanations Enhancement
Anchored Alignment for Self-Explanations Enhancement
Luis Felipe Villa-Arenas
Ata Nizamoglu
Qianli Wang
Sebastian Möller
Vera Schmitt
26
0
0
17 Oct 2024
"Let's Argue Both Sides": Argument Generation Can Force Small Models to
  Utilize Previously Inaccessible Reasoning Capabilities
"Let's Argue Both Sides": Argument Generation Can Force Small Models to Utilize Previously Inaccessible Reasoning Capabilities
Kaveh Eskandari Miandoab
Vasanth Sarathy
LRM
ReLM
33
1
0
16 Oct 2024
MedAide: Towards an Omni Medical Aide via Specialized LLM-based
  Multi-Agent Collaboration
MedAide: Towards an Omni Medical Aide via Specialized LLM-based Multi-Agent Collaboration
Jinjie Wei
Dingkang Yang
Yanshu Li
Qingyao Xu
Zhaoyu Chen
M. Li
Yue Jiang
Xiaolu Hou
Lihua Zhang
33
1
0
16 Oct 2024
Model Swarms: Collaborative Search to Adapt LLM Experts via Swarm
  Intelligence
Model Swarms: Collaborative Search to Adapt LLM Experts via Swarm Intelligence
Shangbin Feng
Zifeng Wang
Yike Wang
Sayna Ebrahimi
Hamid Palangi
...
Nathalie Rauschmayr
Yejin Choi
Yulia Tsvetkov
Chen-Yu Lee
Tomas Pfister
MoMe
37
3
0
15 Oct 2024
G-Designer: Architecting Multi-agent Communication Topologies via Graph Neural Networks
G-Designer: Architecting Multi-agent Communication Topologies via Graph Neural Networks
Guibin Zhang
Xinfeng Li
Xiangguo Sun
Guancheng Wan
Miao Yu
Junfeng Fang
Kun Wang
Dawei Cheng
Dawei Cheng
AAML
AI4CE
54
7
0
15 Oct 2024
A Scalable Communication Protocol for Networks of Large Language Models
A Scalable Communication Protocol for Networks of Large Language Models
Samuele Marro
Emanuele La Malfa
Jesse Wright
Ge Li
Nigel Shadbolt
Michael Wooldridge
Philip Torr
GNN
AIFin
43
8
0
14 Oct 2024
Expanding Search Space with Diverse Prompting Agents: An Efficient
  Sampling Approach for LLM Mathematical Reasoning
Expanding Search Space with Diverse Prompting Agents: An Efficient Sampling Approach for LLM Mathematical Reasoning
Gisang Lee
Sangwoo Park
Junyoung Park
Andrew Chung
Sieun Park
Yoonah Park
Byungju Kim
Min-gyu Cho
LRM
27
1
0
13 Oct 2024
Society of Medical Simplifiers
Society of Medical Simplifiers
Chen Lyu
Gabriele Pergola
MedIm
31
0
0
12 Oct 2024
Diversity of Thought Elicits Stronger Reasoning Capabilities in Multi-Agent Debate Frameworks
Diversity of Thought Elicits Stronger Reasoning Capabilities in Multi-Agent Debate Frameworks
Mahmood Hegazy
LLMAG
LRM
AI4CE
40
0
0
10 Oct 2024
Optima: Optimizing Effectiveness and Efficiency for LLM-Based
  Multi-Agent System
Optima: Optimizing Effectiveness and Efficiency for LLM-Based Multi-Agent System
Weize Chen
Jiarui Yuan
Chen Qian
Cheng Yang
Zhiyuan Liu
Maosong Sun
LLMAG
36
4
0
10 Oct 2024
MACPO: Weak-to-Strong Alignment via Multi-Agent Contrastive Preference Optimization
MACPO: Weak-to-Strong Alignment via Multi-Agent Contrastive Preference Optimization
Yougang Lyu
Lingyong Yan
Zihan Wang
Dawei Yin
Pengjie Ren
Maarten de Rijke
Z. Z. Ren
63
6
0
10 Oct 2024
ReIFE: Re-evaluating Instruction-Following Evaluation
ReIFE: Re-evaluating Instruction-Following Evaluation
Yixin Liu
Kejian Shi
Alexander R. Fabbri
Yilun Zhao
Peifeng Wang
Chien-Sheng Wu
Shafiq Joty
Arman Cohan
30
6
0
09 Oct 2024
LLM Self-Correction with DeCRIM: Decompose, Critique, and Refine for
  Enhanced Following of Instructions with Multiple Constraints
LLM Self-Correction with DeCRIM: Decompose, Critique, and Refine for Enhanced Following of Instructions with Multiple Constraints
Thomas Palmeira Ferraz
Kartik Mehta
Yu-Hsiang Lin
Haw-Shiuan Chang
Shereen Oraby
Sijia Liu
Vivek Subramanian
Tagyoung Chung
Mohit Bansal
Nanyun Peng
56
8
0
09 Oct 2024
Learning How Hard to Think: Input-Adaptive Allocation of LM Computation
Learning How Hard to Think: Input-Adaptive Allocation of LM Computation
Mehul Damani
Idan Shenfeld
Andi Peng
Andreea Bobu
Jacob Andreas
41
16
0
07 Oct 2024
Leveraging Large Language Models for Suicide Detection on Social Media
  with Limited Labels
Leveraging Large Language Models for Suicide Detection on Social Media with Limited Labels
Vy Nguyen
Chau Pham
ALM
AI4MH
37
2
0
06 Oct 2024
MindScope: Exploring cognitive biases in large language models through
  Multi-Agent Systems
MindScope: Exploring cognitive biases in large language models through Multi-Agent Systems
Zhentao Xie
Jiabao Zhao
Yilei Wang
Jinxin Shi
Yanhong Bai
Xingjiao Wu
Liang He
LLMAG
28
0
0
06 Oct 2024
Persona Knowledge-Aligned Prompt Tuning Method for Online Debate
Persona Knowledge-Aligned Prompt Tuning Method for Online Debate
Chunkit Chan
Cheng Jiayang
Xin Liu
Yauwai Yim
Yuxin Jiang
Zheye Deng
Haoran Li
Yangqiu Song
Ginny Wong
Simon See
39
0
0
05 Oct 2024
Are Expert-Level Language Models Expert-Level Annotators?
Are Expert-Level Language Models Expert-Level Annotators?
Yu-Min Tseng
Wei-Lin Chen
Chung-Chi Chen
Hsin-Hsi Chen
ALM
39
1
0
04 Oct 2024
Cut the Crap: An Economical Communication Pipeline for LLM-based
  Multi-Agent Systems
Cut the Crap: An Economical Communication Pipeline for LLM-based Multi-Agent Systems
Guibin Zhang
Xinfeng Li
Zhixun Li
Sukwon Yun
Guancheng Wan
Kun Wang
Dawei Cheng
Jeffrey Xu Yu
Tianlong Chen
34
9
0
03 Oct 2024
Collective Critics for Creative Story Generation
Collective Critics for Creative Story Generation
Minwook Bae
Hyounghun Kim
34
2
0
03 Oct 2024
Zodiac: A Cardiologist-Level LLM Framework for Multi-Agent Diagnostics
Zodiac: A Cardiologist-Level LLM Framework for Multi-Agent Diagnostics
Yuan Zhou
Peng Zhang
Mengya Song
Alice Zheng
Yiwen Lu
Zhiheng Liu
Yong Chen
Zhaohan Xi
LM&MA
38
1
0
02 Oct 2024
Integrative Decoding: Improve Factuality via Implicit Self-consistency
Integrative Decoding: Improve Factuality via Implicit Self-consistency
Yi Cheng
Xiao Liang
Yeyun Gong
Wen Xiao
Song Wang
...
Wenjie Li
Jian Jiao
Qi Chen
Peng Cheng
Wayne Xiong
HILM
59
1
0
02 Oct 2024
TypedThinker: Diversify Large Language Model Reasoning with Typed Thinking
TypedThinker: Diversify Large Language Model Reasoning with Typed Thinking
Danqing Wang
Jianxin Ma
Fei Fang
Lei Li
LLMAG
LRM
190
0
0
02 Oct 2024
Truth or Deceit? A Bayesian Decoding Game Enhances Consistency and
  Reliability
Truth or Deceit? A Bayesian Decoding Game Enhances Consistency and Reliability
Weitong Zhang
Chengqi Zang
Bernhard Kainz
33
0
0
01 Oct 2024
From Facts to Insights: A Study on the Generation and Evaluation of
  Analytical Reports for Deciphering Earnings Calls
From Facts to Insights: A Study on the Generation and Evaluation of Analytical Reports for Deciphering Earnings Calls
Tomas Goldsack
Yang Wang
Chenghua Lin
Chung-Chi Chen
20
2
0
01 Oct 2024
Interactive Speculative Planning: Enhance Agent Efficiency through
  Co-design of System and User Interface
Interactive Speculative Planning: Enhance Agent Efficiency through Co-design of System and User Interface
Wenyue Hua
Mengting Wan
Shashank Vadrevu
Ryan Nadel
Yongfeng Zhang
Chi Wang
LLMAG
39
1
0
30 Sep 2024
Data Analysis in the Era of Generative AI
Data Analysis in the Era of Generative AI
J. Inala
Chenglong Wang
Steven Drucker
Gonzalo Ramos
Victor C. Dibia
N. Riche
Dave Brown
Dan Marshall
Jianfeng Gao
29
8
0
27 Sep 2024
Attention Prompting on Image for Large Vision-Language Models
Attention Prompting on Image for Large Vision-Language Models
Runpeng Yu
Weihao Yu
Xinchao Wang
VLM
43
6
0
25 Sep 2024
Training Language Models to Win Debates with Self-Play Improves Judge
  Accuracy
Training Language Models to Win Debates with Self-Play Improves Judge Accuracy
Samuel Arnesen
David Rein
Julian Michael
ELM
38
3
0
25 Sep 2024
Evaluating and Enhancing Large Language Models for Novelty Assessment in
  Scholarly Publications
Evaluating and Enhancing Large Language Models for Novelty Assessment in Scholarly Publications
Ethan Lin
Zhiyuan Peng
Yi Fang
149
4
0
25 Sep 2024
COHERENT: Collaboration of Heterogeneous Multi-Robot System with Large Language Models
COHERENT: Collaboration of Heterogeneous Multi-Robot System with Large Language Models
Kehui Liu
Zixin Tang
Dong Wang
Zihan Wang
Bin Zhao
Bin Zhao
38
11
0
23 Sep 2024
GroupDebate: Enhancing the Efficiency of Multi-Agent Debate Using Group
  Discussion
GroupDebate: Enhancing the Efficiency of Multi-Agent Debate Using Group Discussion
Tongxuan Liu
Xingyu Wang
Weizhe Huang
Wenjiang Xu
Yuting Zeng
Lei Jiang
Hailong Yang
Jing Li
LLMAG
44
8
0
21 Sep 2024
MAgICoRe: Multi-Agent, Iterative, Coarse-to-Fine Refinement for
  Reasoning
MAgICoRe: Multi-Agent, Iterative, Coarse-to-Fine Refinement for Reasoning
Justin Chih-Yao Chen
Archiki Prasad
Swarnadeep Saha
Elias Stengel-Eskin
Joey Tianyi Zhou
LRM
40
8
0
18 Sep 2024
To CoT or not to CoT? Chain-of-thought helps mainly on math and symbolic reasoning
To CoT or not to CoT? Chain-of-thought helps mainly on math and symbolic reasoning
Zayne Sprague
Fangcong Yin
Juan Diego Rodriguez
Dongwei Jiang
Manya Wadhwa
Prasann Singhal
Xinyu Zhao
Xi Ye
Kyle Mahowald
Greg Durrett
ReLM
LRM
122
88
0
18 Sep 2024
Improving LLM Reasoning with Multi-Agent Tree-of-Thought Validator Agent
Improving LLM Reasoning with Multi-Agent Tree-of-Thought Validator Agent
Fatemeh Haji
Mazal Bethany
Maryam Tabar
Jason Chiang
Anthony Rios
Peyman Najafirad
LLMAG
LRM
AI4CE
40
4
0
17 Sep 2024
Towards Agentic AI on Particle Accelerators
Towards Agentic AI on Particle Accelerators
Antonin Sulc
Thorsten Hellert
Raimund Kammering
Hayden Houscher
Jason St. John
35
2
0
10 Sep 2024
LLM-based multi-agent poetry generation in non-cooperative environments
LLM-based multi-agent poetry generation in non-cooperative environments
Ran Zhang
Steffen Eger
LLMAG
37
5
0
05 Sep 2024
LoraMap: Harnessing the Power of LoRA Connections
LoraMap: Harnessing the Power of LoRA Connections
Hyeryun Park
Jeongwon Kwak
Dongsuk Jang
Sumin Park
Jinwook Choi
MoMe
33
0
0
29 Aug 2024
Into the Unknown Unknowns: Engaged Human Learning through Participation
  in Language Model Agent Conversations
Into the Unknown Unknowns: Engaged Human Learning through Participation in Language Model Agent Conversations
Yucheng Jiang
Yijia Shao
Dekun Ma
Sina J. Semnani
Monica S. Lam
LLMAG
40
15
0
27 Aug 2024
The Fellowship of the LLMs: Multi-Agent Workflows for Synthetic
  Preference Optimization Dataset Generation
The Fellowship of the LLMs: Multi-Agent Workflows for Synthetic Preference Optimization Dataset Generation
Samee Arif
Sualeha Farid
Abdul Hameed Azeemi
Awais Athar
Agha Ali Raza
LLMAG
24
7
0
16 Aug 2024
Automated Design of Agentic Systems
Automated Design of Agentic Systems
Shengran Hu
Cong Lu
Jeff Clune
AI4CE
45
41
0
15 Aug 2024
AutoGen Studio: A No-Code Developer Tool for Building and Debugging
  Multi-Agent Systems
AutoGen Studio: A No-Code Developer Tool for Building and Debugging Multi-Agent Systems
Victor C. Dibia
Jingya Chen
Gagan Bansal
Suff Syed
Adam Fourney
Erkang Zhu
Chi Wang
Saleema Amershi
LLMAG
35
5
0
09 Aug 2024
Can LLMs Beat Humans in Debating? A Dynamic Multi-agent Framework for
  Competitive Debate
Can LLMs Beat Humans in Debating? A Dynamic Multi-agent Framework for Competitive Debate
Yiqun Zhang
Xiaocui Yang
Shi Feng
Daling Wang
Yifei Zhang
Kaisong Song
LLMAG
40
4
0
08 Aug 2024
Jailbreaking Text-to-Image Models with LLM-Based Agents
Jailbreaking Text-to-Image Models with LLM-Based Agents
Yingkai Dong
Zheng Li
Xiangtao Meng
Ning Yu
Shanqing Guo
LLMAG
45
13
0
01 Aug 2024
Improving Faithfulness of Large Language Models in Summarization via
  Sliding Generation and Self-Consistency
Improving Faithfulness of Large Language Models in Summarization via Sliding Generation and Self-Consistency
Taiji Li
Zhi Li
Yin Zhang
HILM
38
6
0
31 Jul 2024
Prompting Medical Large Vision-Language Models to Diagnose Pathologies by Visual Question Answering
Prompting Medical Large Vision-Language Models to Diagnose Pathologies by Visual Question Answering
Danfeng Guo
Sumitaka Honji
LRM
79
0
0
31 Jul 2024
CityX: Controllable Procedural Content Generation for Unbounded 3D
  Cities
CityX: Controllable Procedural Content Generation for Unbounded 3D Cities
Shougao Zhang
Mengqi Zhou
Yuxi Wang
Chuanchen Luo
Rongyu Wang
Yiwei Li
Xucheng Yin
Zhaoxiang Zhang
Junran Peng
43
7
0
24 Jul 2024
Previous
12345...8910
Next