Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2305.14325
Cited By
Improving Factuality and Reasoning in Language Models through Multiagent Debate
23 May 2023
Yilun Du
Shuang Li
Antonio Torralba
J. Tenenbaum
Igor Mordatch
LLMAG
LRM
Re-assign community
ArXiv
PDF
HTML
Papers citing
"Improving Factuality and Reasoning in Language Models through Multiagent Debate"
50 / 465 papers shown
Title
Building Machines that Learn and Think with People
Katherine M. Collins
Ilia Sucholutsky
Umang Bhatt
Kartik Chandra
Lionel Wong
...
Mark K. Ho
Vikash K. Mansinghka
Adrian Weller
Joshua B. Tenenbaum
Thomas Griffiths
54
30
0
22 Jul 2024
Operationalizing a Threat Model for Red-Teaming Large Language Models (LLMs)
Apurv Verma
Satyapriya Krishna
Sebastian Gehrmann
Madhavan Seshadri
Anu Pradhan
Tom Ault
Leslie Barrett
David Rabinowitz
John Doucette
Nhathai Phan
59
10
0
20 Jul 2024
Internal Consistency and Self-Feedback in Large Language Models: A Survey
Xun Liang
Shichao Song
Zifan Zheng
Hanyu Wang
Qingchen Yu
...
Rong-Hua Li
Peng Cheng
Zhonghao Wang
Zhiyu Li
Zhiyu Li
HILM
LRM
70
25
0
19 Jul 2024
Internet of Agents: Weaving a Web of Heterogeneous Agents for Collaborative Intelligence
Weize Chen
Ziming You
Ran Li
Yitong Guan
Chen Qian
Chenyang Zhao
Cheng Yang
Ruobing Xie
Zhiyuan Liu
Maosong Sun
LLMAG
45
36
0
09 Jul 2024
FinCon: A Synthesized LLM Multi-Agent System with Conceptual Verbal Reinforcement for Enhanced Financial Decision Making
Yangyang Yu
Zhiyuan Yao
Haohang Li
Zhiyang Deng
Yupeng Cao
...
Guojun Xiong
Yueru He
Jimin Huang
Dong Li
Qianqian Xie
AIFin
LLMAG
42
14
0
09 Jul 2024
Automated Justification Production for Claim Veracity in Fact Checking: A Survey on Architectures and Approaches
Islam Eldifrawi
Shengrui Wang
Amine Trabelsi
49
8
0
09 Jul 2024
Collective Innovation in Groups of Large Language Models
Eleni Nisioti
Sebastian Risi
Ida Momennejad
Pierre-Yves Oudeyer
Clément Moulin-Frier
LLMAG
29
3
0
07 Jul 2024
ANAH-v2: Scaling Analytical Hallucination Annotation of Large Language Models
Yuzhe Gu
Ziwei Ji
Wenwei Zhang
Chengqi Lyu
Dahua Lin
Kai Chen
HILM
42
5
0
05 Jul 2024
On scalable oversight with weak LLMs judging strong LLMs
Zachary Kenton
Noah Y. Siegel
János Kramár
Jonah Brown-Cohen
Samuel Albanie
...
Rishabh Agarwal
David Lindner
Yunhao Tang
Noah D. Goodman
Rohin Shah
ELM
43
29
0
05 Jul 2024
VDMA: Video Question Answering with Dynamically Generated Multi-Agents
Noriyuki Kugo
Tatsuya Ishibashi
Kosuke Ono
Yuji Sato
41
1
0
04 Jul 2024
MentalAgora: A Gateway to Advanced Personalized Care in Mental Health through Multi-Agent Debating and Attribute Control
Yeonji Lee
Sangjun Park
Kyunghyun Cho
Jinyeong Bak
42
1
0
03 Jul 2024
Debate-to-Write: A Persona-Driven Multi-Agent Framework for Diverse Argument Generation
Zhe Hu
Hou Pong Chan
Jing Li
Yu Yin
LLMAG
48
0
0
28 Jun 2024
Autonomous Prompt Engineering in Large Language Models
Daan Kepel
Konstantina Valogianni
LLMAG
48
7
0
25 Jun 2024
On the Transformations across Reward Model, Parameter Update, and In-Context Prompt
Deng Cai
Huayang Li
Tingchen Fu
Siheng Li
Weiwen Xu
...
Leyang Cui
Yan Wang
Lemao Liu
Taro Watanabe
Shuming Shi
KELM
30
2
0
24 Jun 2024
Teaching LLMs to Abstain across Languages via Multilingual Feedback
Shangbin Feng
Weijia Shi
Yike Wang
Wenxuan Ding
Orevaoghene Ahia
Shuyue Stella Li
Vidhisha Balachandran
Sunayana Sitaram
Yulia Tsvetkov
75
4
0
22 Jun 2024
MultiAgent Collaboration Attack: Investigating Adversarial Attacks in Large Language Model Collaborations via Debate
Alfonso Amayuelas
Xianjun Yang
Antonis Antoniades
Wenyue Hua
Liangming Pan
William Wang
AAML
LLMAG
35
10
0
20 Jun 2024
Adversaries Can Misuse Combinations of Safe Models
Erik Jones
Anca Dragan
Jacob Steinhardt
45
7
0
20 Jun 2024
From Single Agent to Multi-Agent: Improving Traffic Signal Control
Maksim Tislenko
Dmitrii Kisilev
33
0
0
19 Jun 2024
FuseGen: PLM Fusion for Data-generation based Zero-shot Learning
Tianyuan Zou
Yang Liu
Peng Li
Jianqing Zhang
Jingjing Liu
Ya-Qin Zhang
36
3
0
18 Jun 2024
Problem-Solving in Language Model Networks
Ciaran Regan
Alexandre Gournail
Mizuki Oka
LRM
LLMAG
KELM
32
1
0
18 Jun 2024
Improving Multi-Agent Debate with Sparse Communication Topology
Yunxuan Li
Yibing Du
Jiageng Zhang
Le Hou
Peter Grabowski
Yeqing Li
Eugene Ie
LLMAG
36
18
0
17 Jun 2024
Counterfactual Debating with Preset Stances for Hallucination Elimination of LLMs
Yi Fang
Moxin Li
Wenjie Wang
Hui Lin
Fuli Feng
LRM
65
5
0
17 Jun 2024
KAOS: Large Model Multi-Agent Operating System
Zhao Zhuo
Rongzhen Li
Kai Liu
Huhai Zou
KaiMao Li
Jie Yu
Tianhao Sun
Qingbo Wu
VLM
LLMAG
49
1
0
17 Jun 2024
SLEGO: A Collaborative Data Analytics System with LLM Recommender for Diverse Users
Siu Lung Ng
Hirad Rezaei
F. Rabhi
31
0
0
17 Jun 2024
AgileCoder: Dynamic Collaborative Agents for Software Development based on Agile Methodology
Minh Huynh Nguyen
Thang Phan Chau
Phong X. Nguyen
Nghi D. Q. Bui
37
12
0
16 Jun 2024
From Text to Life: On the Reciprocal Relationship between Artificial Life and Large Language Models
Eleni Nisioti
Claire Glanois
Elias Najarro
Andrew Dai
Elliot Meyerson
J. Pedersen
Laetitia Teodorescu
Conor F. Hayes
Shyam Sudhakaran
Sebastian Risi
AI4CE
LM&Ro
51
3
0
14 Jun 2024
Multi-Agent Software Development through Cross-Team Collaboration
Zhuoyun Du
Chen Qian
Wei Liu
Zihao Xie
Yifei Wang
Yufan Dang
Weize Chen
Cheng Yang
LLMAG
46
19
0
13 Jun 2024
StreamBench: Towards Benchmarking Continuous Improvement of Language Agents
Cheng-Kuang Wu
Zhi Rui Tam
Chieh-Yen Lin
Yun-Nung Chen
Hung-yi Lee
LLMAG
42
7
0
13 Jun 2024
Accessing GPT-4 level Mathematical Olympiad Solutions via Monte Carlo Tree Self-refine with LLaMa-3 8B
Di Zhang
Xiaoshui Huang
Dongzhan Zhou
Yuqiang Li
Wanli Ouyang
LRM
49
55
0
11 Jun 2024
CoEvol: Constructing Better Responses for Instruction Finetuning through Multi-Agent Cooperation
Renhao Li
Minghuan Tan
Derek F. Wong
Min Yang
LLMAG
23
1
0
11 Jun 2024
Scaling Large Language Model-based Multi-Agent Collaboration
Chen Qian
Zihao Xie
YiFei Wang
Wei Liu
Yufan Dang
...
Zhuoyun Du
Weize Chen
Cheng Yang
Zhiyuan Liu
Maosong Sun
AI4CE
LLMAG
LM&Ro
66
46
0
11 Jun 2024
Raccoon: Prompt Extraction Benchmark of LLM-Integrated Applications
Junlin Wang
Tianyi Yang
Roy Xie
Bhuwan Dhingra
SILM
AAML
36
4
0
10 Jun 2024
Reasoning in Token Economies: Budget-Aware Evaluation of LLM Reasoning Strategies
Junlin Wang
Siddhartha Jain
Dejiao Zhang
Baishakhi Ray
Varun Kumar
Ben Athiwaratkun
41
19
0
10 Jun 2024
Mixture-of-Agents Enhances Large Language Model Capabilities
Junlin Wang
Jue Wang
Ben Athiwaratkun
Ce Zhang
James Zou
LLMAG
AIFin
41
101
0
07 Jun 2024
Open-Endedness is Essential for Artificial Superhuman Intelligence
Edward Hughes
Michael Dennis
Jack Parker-Holder
Feryal M. P. Behbahani
Aditi Mavalankar
Yuge Shi
Tom Schaul
Tim Rocktaschel
LRM
40
22
0
06 Jun 2024
A Survey of Language-Based Communication in Robotics
William Hunt
Sarvapali D. Ramchurn
Mohammad D. Soorati
LM&Ro
65
12
0
06 Jun 2024
Towards Detecting LLMs Hallucination via Markov Chain-based Multi-agent Debate Framework
Xiaoxi Sun
Jinpeng Li
Yan Zhong
Dongyan Zhao
Rui Yan
LLMAG
HILM
29
5
0
05 Jun 2024
Chain of Agents: Large Language Models Collaborating on Long-Context Tasks
Yusen Zhang
Ruoxi Sun
Yanfei Chen
Tomas Pfister
Rui Zhang
Sercan Ö. Arik
RALM
AI4CE
LLMAG
54
30
0
04 Jun 2024
AI Agents Under Threat: A Survey of Key Security Challenges and Future Pathways
Zehang Deng
Yongjian Guo
Changzhou Han
Wanlun Ma
Junwu Xiong
Sheng Wen
Yang Xiang
49
24
0
04 Jun 2024
When Can LLMs Actually Correct Their Own Mistakes? A Critical Survey of Self-Correction of LLMs
Ryo Kamoi
Yusen Zhang
Nan Zhang
Jiawei Han
Rui Zhang
LRM
50
57
0
03 Jun 2024
Brainstorming Brings Power to Large Language Models of Knowledge Reasoning
Zining Qin
Chenhao Wang
Huiling Qin
Weijia Jia
LRM
45
1
0
02 Jun 2024
Harnessing Business and Media Insights with Large Language Models
Yujia Bao
Ankit Parag Shah
Neeru Narang
Jonathan Rivers
Rajeev Maksey
...
Gyuhak Kim
Dengpan Yin
Don Hejna
Mo Nomeli
Wei Wei
AIFin
59
3
0
02 Jun 2024
ANAH: Analytical Annotation of Hallucinations in Large Language Models
Ziwei Ji
Yuzhe Gu
Wenwei Zhang
Chengqi Lyu
Dahua Lin
Kai-xiang Chen
HILM
56
2
0
30 May 2024
Auto Arena of LLMs: Automating LLM Evaluations with Agent Peer-battles and Committee Discussions
Ruochen Zhao
Wenxuan Zhang
Yew Ken Chia
Deli Zhao
Lidong Bing
41
9
0
30 May 2024
Cracking the Code of Juxtaposition: Can AI Models Understand the Humorous Contradictions
Zhe Hu
Tuo Liang
Jing Li
Yiren Lu
Yunlai Zhou
Yiran Qiao
Jing Ma
Yu Yin
55
4
0
29 May 2024
Adaptive In-conversation Team Building for Language Model Agents
Linxin Song
Jiale Liu
Jieyu Zhang
Shaokun Zhang
Ao Luo
Shijian Wang
Qingyun Wu
Chi Wang
LLMAG
71
10
0
29 May 2024
Tool Learning in the Wild: Empowering Language Models as Automatic Tool Agents
Zhengliang Shi
Shen Gao
Xiuyi Chen
Yue Feng
Lingyong Yan
Haibo Shi
Dawei Yin
Zhumin Chen
Suzan Verberne
LLMAG
47
6
0
26 May 2024
Confidence Under the Hood: An Investigation into the Confidence-Probability Alignment in Large Language Models
Abhishek Kumar
Robert D Morabito
Sanzhar Umbet
Jad Kabbara
Ali Emami
53
5
0
25 May 2024
(Perhaps) Beyond Human Translation: Harnessing Multi-Agent Collaboration for Translating Ultra-Long Literary Texts
Minghao Wu
Jiahao Xu
Yulin Yuan
Gholamreza Haffari
Longyue Wang
Weihua Luo
Kaifu Zhang
LLMAG
119
22
0
20 May 2024
Medical Dialogue: A Survey of Categories, Methods, Evaluation and Challenges
Xiaoming Shi
Zeming Liu
Li Du
Yuxuan Wang
Hongru Wang
Yuhang Guo
Tong Ruan
Jie Xu
Shaoting Zhang
LM&MA
ELM
43
2
0
17 May 2024
Previous
1
2
3
4
5
6
...
8
9
10
Next