ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2312.05162
  4. Cited By
A Review of Cooperation in Multi-agent Learning

A Review of Cooperation in Multi-agent Learning

8 December 2023
Yali Du
Joel Z Leibo
Usman Islam
Richard Willis
P. Sunehag
ArXiv (abs)PDFHTML

Papers citing "A Review of Cooperation in Multi-agent Learning"

50 / 54 papers shown
Title
Quantifying the Self-Interest Level of Markov Social Dilemmas
Quantifying the Self-Interest Level of Markov Social Dilemmas
Richard Willis
Yali Du
Joel Z Leibo
Michael Luck
119
0
0
27 Jan 2025
Learning to Balance Altruism and Self-interest Based on Empathy in Mixed-Motive Games
Learning to Balance Altruism and Self-interest Based on Empathy in Mixed-Motive Games
Fanqi Kong
Yizhe Huang
Song-Chun Zhu
Siyuan Qi
Xue Feng
86
2
0
10 Oct 2024
Enabling Multi-Robot Collaboration from Single-Human Guidance
Enabling Multi-Robot Collaboration from Single-Human Guidance
Zhengran Ji
Lingyu Zhang
Paul Sajda
Boyuan Chen
66
2
0
30 Sep 2024
Generative agent-based modeling with actions grounded in physical,
  social, or digital space using Concordia
Generative agent-based modeling with actions grounded in physical, social, or digital space using Concordia
A. Vezhnevets
J. Agapiou
Avia Aharon
Ron Ziv
Jayd Matyas
Edgar A. Duénez-Guzmán
William A. Cunningham
Simon Osindero
Danny Karmon
Joel Z Leibo
LLMAGLM&RoAI4CE
99
50
0
06 Dec 2023
STAS: Spatial-Temporal Return Decomposition for Multi-agent
  Reinforcement Learning
STAS: Spatial-Temporal Return Decomposition for Multi-agent Reinforcement Learning
Sirui Chen
Zhaowei Zhang
Yaodong Yang
Yali Du
74
5
0
15 Apr 2023
Diversity Through Exclusion (DTE): Niche Identification for
  Reinforcement Learning through Value-Decomposition
Diversity Through Exclusion (DTE): Niche Identification for Reinforcement Learning through Value-Decomposition
P. Sunehag
A. Vezhnevets
Edgar A. Duénez-Guzmán
Igor Mordach
Joel Z Leibo
55
2
0
02 Feb 2023
Learning to Participate through Trading of Reward Shares
Learning to Participate through Trading of Reward Shares
Michael Kölle
Tim Matheis
Philipp Altmann
Kyrill Schmid
69
8
0
18 Jan 2023
PECAN: Leveraging Policy Ensemble for Context-Aware Zero-Shot Human-AI
  Coordination
PECAN: Leveraging Policy Ensemble for Context-Aware Zero-Shot Human-AI Coordination
Xingzhou Lou
Jiaxian Guo
Junge Zhang
Jun Wang
Kaiqi Huang
Yali Du
48
29
0
16 Jan 2023
Melting Pot 2.0
Melting Pot 2.0
J. Agapiou
A. Vezhnevets
Edgar A. Duénez-Guzmán
Jayd Matyas
Yiran Mao
...
Sukhdeep Singh
Julia Haas
Igor Mordatch
D. Mobbs
Joel Z Leibo
89
34
0
24 Nov 2022
Towards a Standardised Performance Evaluation Protocol for Cooperative
  MARL
Towards a Standardised Performance Evaluation Protocol for Cooperative MARL
R. Gorsane
Omayma Mahjoub
Ruan de Kock
Roland Dubb
Siddarth S. Singh
Arnu Pretorius
OffRL
82
50
0
21 Sep 2022
Emergent Bartering Behaviour in Multi-Agent Reinforcement Learning
Emergent Bartering Behaviour in Multi-Agent Reinforcement Learning
Michael Bradley Johanson
Edward Hughes
Finbarr Timbers
Joel Z Leibo
70
23
0
13 May 2022
Trust Region Policy Optimisation in Multi-Agent Reinforcement Learning
Trust Region Policy Optimisation in Multi-Agent Reinforcement Learning
J. Kuba
Ruiqing Chen
Munning Wen
Ying Wen
Fanglei Sun
Jun Wang
Yaodong Yang
117
245
0
23 Sep 2021
A learning agent that acquires social norms from public sanctions in
  decentralized multi-agent settings
A learning agent that acquires social norms from public sanctions in decentralized multi-agent settings
Eugene Vinitsky
Raphael Köster
J. Agapiou
Edgar A. Duénez-Guzmán
A. Vezhnevets
Joel Z Leibo
62
41
0
16 Jun 2021
SHAQ: Incorporating Shapley Value Theory into Multi-Agent Q-Learning
SHAQ: Incorporating Shapley Value Theory into Multi-Agent Q-Learning
Jianhong Wang
Yuan Zhang
Yunjie Gu
Tae-Kyun Kim
OffRLFAtt
72
23
0
31 May 2021
Emergent Prosociality in Multi-Agent Games Through Gifting
Emergent Prosociality in Multi-Agent Games Through Gifting
Woodrow Z. Wang
M. Beliaev
Erdem Biyik
Daniel A. Lazar
Ramtin Pedarsani
Dorsa Sadigh
AI4CE
74
25
0
13 May 2021
The Surprising Effectiveness of PPO in Cooperative, Multi-Agent Games
The Surprising Effectiveness of PPO in Cooperative, Multi-Agent Games
Chao Yu
Akash Velu
Eugene Vinitsky
Jiaxuan Gao
Yu Wang
Alexandre M. Bayen
Yi Wu
OffRL
163
1,278
0
02 Mar 2021
Emergent Reciprocity and Team Formation from Randomized Uncertain Social
  Preferences
Emergent Reciprocity and Team Formation from Randomized Uncertain Social Preferences
Bowen Baker
LRM
63
37
0
10 Nov 2020
Model-free conventions in multi-agent reinforcement learning with
  heterogeneous preferences
Model-free conventions in multi-agent reinforcement learning with heterogeneous preferences
Raphael Köster
Kevin R. McKee
Richard Everett
Laura Weidinger
William S. Isaac
Edward Hughes
Edgar A. Duénez-Guzmán
T. Graepel
M. Botvinick
Joel Z Leibo
85
23
0
18 Oct 2020
QPLEX: Duplex Dueling Multi-Agent Q-Learning
QPLEX: Duplex Dueling Multi-Agent Q-Learning
Jianhao Wang
Zhizhou Ren
Terry Liu
Yang Yu
Chongjie Zhang
OffRL
110
457
0
03 Aug 2020
Towards Open Ad Hoc Teamwork Using Graph-based Policy Learning
Towards Open Ad Hoc Teamwork Using Graph-based Policy Learning
Arrasy Rahman
Niklas Höpner
Filippos Christianos
Stefano V. Albrecht
70
58
0
18 Jun 2020
Benchmarking Multi-Agent Deep Reinforcement Learning Algorithms in
  Cooperative Tasks
Benchmarking Multi-Agent Deep Reinforcement Learning Algorithms in Cooperative Tasks
Georgios Papoudakis
Filippos Christianos
Lukas Schafer
Stefano V. Albrecht
OffRL
98
233
0
14 Jun 2020
Shared Experience Actor-Critic for Multi-Agent Reinforcement Learning
Shared Experience Actor-Critic for Multi-Agent Reinforcement Learning
Filippos Christianos
Lukas Schafer
Stefano V. Albrecht
133
168
0
12 Jun 2020
Scalable Multi-Agent Reinforcement Learning for Networked Systems with
  Average Reward
Scalable Multi-Agent Reinforcement Learning for Networked Systems with Average Reward
Guannan Qu
Yiheng Lin
Adam Wierman
Na Li
81
70
0
11 Jun 2020
"Other-Play" for Zero-Shot Coordination
"Other-Play" for Zero-Shot Coordination
Hengyuan Hu
Adam Lerer
A. Peysakhovich
Jakob N. Foerster
VLMOffRL
182
225
0
06 Mar 2020
Learning to Resolve Alliance Dilemmas in Many-Player Zero-Sum Games
Learning to Resolve Alliance Dilemmas in Many-Player Zero-Sum Games
Edward Hughes
Thomas W. Anthony
Tom Eccles
Joel Z Leibo
David Balduzzi
Yoram Bachrach
78
21
0
27 Feb 2020
Qatten: A General Framework for Cooperative Multiagent Reinforcement
  Learning
Qatten: A General Framework for Cooperative Multiagent Reinforcement Learning
Yaodong Yang
Jianye Hao
B. Liao
Kun Shao
Guangyong Chen
Wulong Liu
Hongyao Tang
OffRL
77
187
0
10 Feb 2020
Decentralized Multi-Agent Reinforcement Learning with Networked Agents:
  Recent Advances
Decentralized Multi-Agent Reinforcement Learning with Networked Agents: Recent Advances
Kai Zhang
Zhuoran Yang
Tamer Basar
65
68
0
09 Dec 2019
On the Utility of Learning about Humans for Human-AI Coordination
On the Utility of Learning about Humans for Human-AI Coordination
Micah Carroll
Rohin Shah
Mark K. Ho
Thomas Griffiths
Sanjit A. Seshia
Pieter Abbeel
Anca Dragan
HAI
71
403
0
13 Oct 2019
Deep Coordination Graphs
Deep Coordination Graphs
Wendelin Bohmer
Vitaly Kurin
Shimon Whiteson
GNN
85
180
0
27 Sep 2019
Google Research Football: A Novel Reinforcement Learning Environment
Google Research Football: A Novel Reinforcement Learning Environment
Karol Kurach
Anton Raichuk
Piotr Stańczyk
Michal Zajac
Olivier Bachem
...
C. Riquelme
Damien Vincent
Marcin Michalski
Olivier Bousquet
Sylvain Gelly
159
407
0
25 Jul 2019
QTRAN: Learning to Factorize with Transformation for Cooperative
  Multi-Agent Reinforcement Learning
QTRAN: Learning to Factorize with Transformation for Cooperative Multi-Agent Reinforcement Learning
Kyunghwan Son
Daewoo Kim
Wan Ju Kang
D. Hostallero
Yung Yi
OffRL
71
809
0
14 May 2019
Emergent Coordination Through Competition
Emergent Coordination Through Competition
Siqi Liu
Guy Lever
J. Merel
S. Tunyasuvunakool
N. Heess
T. Graepel
112
150
0
19 Feb 2019
Learning to Schedule Communication in Multi-agent Reinforcement Learning
Learning to Schedule Communication in Multi-agent Reinforcement Learning
Daewoo Kim
Sang-chul Moon
D. Hostallero
Wan Ju Kang
Taeyoung Lee
Kyunghwan Son
Yung Yi
74
208
0
05 Feb 2019
The Hanabi Challenge: A New Frontier for AI Research
The Hanabi Challenge: A New Frontier for AI Research
Nolan Bard
Jakob N. Foerster
A. Chandar
Neil Burch
Marc Lanctot
...
Iain Dunning
Shibl Mourad
Hugo Larochelle
Marc G. Bellemare
Michael Bowling
LLMAG
76
355
0
01 Feb 2019
Learning when to Communicate at Scale in Multiagent Cooperative and
  Competitive Tasks
Learning when to Communicate at Scale in Multiagent Cooperative and Competitive Tasks
Amanpreet Singh
Tushar Jain
Sainbayar Sukhbaatar
129
244
0
23 Dec 2018
Stable Opponent Shaping in Differentiable Games
Stable Opponent Shaping in Differentiable Games
Alistair Letcher
Jakob N. Foerster
David Balduzzi
Tim Rocktaschel
Shimon Whiteson
128
110
0
20 Nov 2018
TarMAC: Targeted Multi-Agent Communication
TarMAC: Targeted Multi-Agent Communication
Abhishek Das
Théophile Gervet
Joshua Romoff
Dhruv Batra
Devi Parikh
Michael G. Rabbat
Joelle Pineau
104
387
0
26 Oct 2018
Actor-Attention-Critic for Multi-Agent Reinforcement Learning
Actor-Attention-Critic for Multi-Agent Reinforcement Learning
Shariq Iqbal
Fei Sha
74
755
0
05 Oct 2018
Human-level performance in first-person multiplayer games with
  population-based deep reinforcement learning
Human-level performance in first-person multiplayer games with population-based deep reinforcement learning
Max Jaderberg
Wojciech M. Czarnecki
Iain Dunning
Luke Marris
Guy Lever
...
Joel Z Leibo
David Silver
Demis Hassabis
Koray Kavukcuoglu
T. Graepel
OffRL
119
728
0
03 Jul 2018
Learning Attentional Communication for Multi-Agent Cooperation
Learning Attentional Communication for Multi-Agent Cooperation
Jiechuan Jiang
Zongqing Lu
78
488
0
20 May 2018
QMIX: Monotonic Value Function Factorisation for Deep Multi-Agent
  Reinforcement Learning
QMIX: Monotonic Value Function Factorisation for Deep Multi-Agent Reinforcement Learning
Tabish Rashid
Mikayel Samvelyan
Christian Schroeder de Witt
Gregory Farquhar
Jakob N. Foerster
Shimon Whiteson
166
1,677
0
30 Mar 2018
Fully Decentralized Multi-Agent Reinforcement Learning with Networked
  Agents
Fully Decentralized Multi-Agent Reinforcement Learning with Networked Agents
Kai Zhang
Zhuoran Yang
Han Liu
Tong Zhang
Tamer Basar
110
591
0
23 Feb 2018
Soft Actor-Critic: Off-Policy Maximum Entropy Deep Reinforcement
  Learning with a Stochastic Actor
Soft Actor-Critic: Off-Policy Maximum Entropy Deep Reinforcement Learning with a Stochastic Actor
Tuomas Haarnoja
Aurick Zhou
Pieter Abbeel
Sergey Levine
317
8,432
0
04 Jan 2018
Population Based Training of Neural Networks
Population Based Training of Neural Networks
Max Jaderberg
Valentin Dalibard
Simon Osindero
Wojciech M. Czarnecki
Jeff Donahue
...
Tim Green
Iain Dunning
Karen Simonyan
Chrisantha Fernando
Koray Kavukcuoglu
93
744
0
27 Nov 2017
StarCraft II: A New Challenge for Reinforcement Learning
StarCraft II: A New Challenge for Reinforcement Learning
Oriol Vinyals
T. Ewalds
Sergey Bartunov
Petko Georgiev
A. Vezhnevets
...
Anthony Brunasso
David Lawrence
Anders Ekermo
J. Repp
Rodney Tsing
94
874
0
16 Aug 2017
Proximal Policy Optimization Algorithms
Proximal Policy Optimization Algorithms
John Schulman
Filip Wolski
Prafulla Dhariwal
Alec Radford
Oleg Klimov
OffRL
583
19,315
0
20 Jul 2017
Multi-Agent Actor-Critic for Mixed Cooperative-Competitive Environments
Multi-Agent Actor-Critic for Mixed Cooperative-Competitive Environments
Ryan J. Lowe
Yi Wu
Aviv Tamar
J. Harb
Pieter Abbeel
Igor Mordatch
164
4,520
0
07 Jun 2017
Counterfactual Multi-Agent Policy Gradients
Counterfactual Multi-Agent Policy Gradients
Jakob N. Foerster
Gregory Farquhar
Triantafyllos Afouras
Nantas Nardelli
Shimon Whiteson
156
2,090
0
24 May 2017
Stabilising Experience Replay for Deep Multi-Agent Reinforcement
  Learning
Stabilising Experience Replay for Deep Multi-Agent Reinforcement Learning
Jakob N. Foerster
Nantas Nardelli
Gregory Farquhar
Triantafyllos Afouras
Philip Torr
Pushmeet Kohli
Shimon Whiteson
OffRL
198
599
0
28 Feb 2017
Learning Multiagent Communication with Backpropagation
Learning Multiagent Communication with Backpropagation
Sainbayar Sukhbaatar
Arthur Szlam
Rob Fergus
234
1,150
0
25 May 2016
12
Next