Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2110.00304
Cited By
v1
v2 (latest)
Divergence-Regularized Multi-Agent Actor-Critic
1 October 2021
Kefan Su
Zongqing Lu
Re-assign community
ArXiv (abs)
PDF
HTML
Papers citing
"Divergence-Regularized Multi-Agent Actor-Critic"
28 / 28 papers shown
Title
Divergence-Augmented Policy Optimization
Qing Wang
Yingru Li
Jiechao Xiong
Tong Zhang
OffRL
164
16
0
28 Jan 2025
Networked Communication for Decentralised Agents in Mean-Field Games
Patrick Benjamin
Alessandro Abate
FedML
151
2
0
05 Jun 2023
Regularized Softmax Deep Multi-Agent
Q
Q
Q
-Learning
L. Pan
Tabish Rashid
Bei Peng
Longbo Huang
Shimon Whiteson
101
36
0
22 Mar 2021
The Surprising Effectiveness of PPO in Cooperative, Multi-Agent Games
Chao Yu
Akash Velu
Eugene Vinitsky
Jiaxuan Gao
Yu Wang
Alexandre M. Bayen
Yi Wu
OffRL
163
1,278
0
02 Mar 2021
DFAC Framework: Factorizing the Value Function via Quantile Mixture for Multi-Agent Distributional Q-Learning
Wei-Fang Sun
Cheng-Kuang Lee
Chun-Yi Lee
OffRL
57
49
0
16 Feb 2021
QPLEX: Duplex Dueling Multi-Agent Q-Learning
Jianhao Wang
Zhizhou Ren
Terry Liu
Yang Yu
Chongjie Zhang
OffRL
110
457
0
03 Aug 2020
Off-Policy Multi-Agent Decomposed Policy Gradients
Yihan Wang
Beining Han
Tonghan Wang
Heng Dong
Chongjie Zhang
93
181
0
24 Jul 2020
Monotonic Value Function Factorisation for Deep Multi-Agent Reinforcement Learning
Tabish Rashid
Mikayel Samvelyan
Christian Schroeder de Witt
Gregory Farquhar
Jakob N. Foerster
Shimon Whiteson
122
817
0
19 Mar 2020
FACMAC: Factored Multi-Agent Centralised Policy Gradients
Bei Peng
Tabish Rashid
Christian Schroeder de Witt
Pierre-Alexandre Kamienny
Philip Torr
Wendelin Bohmer
Shimon Whiteson
64
263
0
14 Mar 2020
Stable Policy Optimization via Off-Policy Divergence Regularization
Ahmed Touati
Amy Zhang
Joelle Pineau
Pascal Vincent
OffRL
115
17
0
09 Mar 2020
Qatten: A General Framework for Cooperative Multiagent Reinforcement Learning
Yaodong Yang
Jianye Hao
B. Liao
Kun Shao
Guangyong Chen
Wulong Liu
Hongyao Tang
OffRL
77
187
0
10 Feb 2020
If MaxEnt RL is the Answer, What is the Question?
Benjamin Eysenbach
Sergey Levine
66
59
0
04 Oct 2019
Learning Transferable Cooperative Behavior in Multi-Agent Teams
Akshat Agarwal
Sumit Kumar
Katia Sycara
AI4CE
68
119
0
04 Jun 2019
QTRAN: Learning to Factorize with Transformation for Cooperative Multi-Agent Reinforcement Learning
Kyunghwan Son
Daewoo Kim
Wan Ju Kang
D. Hostallero
Yung Yi
OffRL
71
809
0
14 May 2019
The StarCraft Multi-Agent Challenge
Mikayel Samvelyan
Tabish Rashid
Christian Schroeder de Witt
Gregory Farquhar
Nantas Nardelli
Tim G. J. Rudner
Chia-Man Hung
Philip Torr
Jakob N. Foerster
Shimon Whiteson
98
958
0
11 Feb 2019
Graph Convolutional Reinforcement Learning
Jiechuan Jiang
Chen Dun
Tiejun Huang
Zongqing Lu
GNN
82
338
0
22 Oct 2018
Actor-Attention-Critic for Multi-Agent Reinforcement Learning
Shariq Iqbal
Fei Sha
74
755
0
05 Oct 2018
QMIX: Monotonic Value Function Factorisation for Deep Multi-Agent Reinforcement Learning
Tabish Rashid
Mikayel Samvelyan
Christian Schroeder de Witt
Gregory Farquhar
Jakob N. Foerster
Shimon Whiteson
166
1,677
0
30 Mar 2018
IMPALA: Scalable Distributed Deep-RL with Importance Weighted Actor-Learner Architectures
L. Espeholt
Hubert Soyer
Rémi Munos
Karen Simonyan
Volodymyr Mnih
...
Vlad Firoiu
Tim Harley
Iain Dunning
Shane Legg
Koray Kavukcuoglu
247
1,609
0
05 Feb 2018
Soft Actor-Critic: Off-Policy Maximum Entropy Deep Reinforcement Learning with a Stochastic Actor
Tuomas Haarnoja
Aurick Zhou
Pieter Abbeel
Sergey Levine
317
8,432
0
04 Jan 2018
Proximal Policy Optimization Algorithms
John Schulman
Filip Wolski
Prafulla Dhariwal
Alec Radford
Oleg Klimov
OffRL
580
19,315
0
20 Jul 2017
Trust-PCL: An Off-Policy Trust Region Method for Continuous Control
Ofir Nachum
Mohammad Norouzi
Kelvin Xu
Dale Schuurmans
82
107
0
06 Jul 2017
Multi-Agent Actor-Critic for Mixed Cooperative-Competitive Environments
Ryan J. Lowe
Yi Wu
Aviv Tamar
J. Harb
Pieter Abbeel
Igor Mordatch
164
4,520
0
07 Jun 2017
Counterfactual Multi-Agent Policy Gradients
Jakob N. Foerster
Gregory Farquhar
Triantafyllos Afouras
Nantas Nardelli
Shimon Whiteson
156
2,090
0
24 May 2017
A unified view of entropy-regularized Markov decision processes
Gergely Neu
Anders Jonsson
Vicencc Gómez
104
264
0
22 May 2017
Bridging the Gap Between Value and Policy Based Reinforcement Learning
Ofir Nachum
Mohammad Norouzi
Kelvin Xu
Dale Schuurmans
174
476
0
28 Feb 2017
Reinforcement Learning with Deep Energy-Based Policies
Tuomas Haarnoja
Haoran Tang
Pieter Abbeel
Sergey Levine
118
1,350
0
27 Feb 2017
Trust Region Policy Optimization
John Schulman
Sergey Levine
Philipp Moritz
Michael I. Jordan
Pieter Abbeel
283
6,807
0
19 Feb 2015
1