Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
1803.11485
Cited By
QMIX: Monotonic Value Function Factorisation for Deep Multi-Agent Reinforcement Learning
30 March 2018
Tabish Rashid
Mikayel Samvelyan
Christian Schroeder de Witt
Gregory Farquhar
Jakob N. Foerster
Shimon Whiteson
Re-assign community
ArXiv
PDF
HTML
Papers citing
"QMIX: Monotonic Value Function Factorisation for Deep Multi-Agent Reinforcement Learning"
50 / 750 papers shown
Title
Safe and Efficient Manoeuvring for Emergency Vehicles in Autonomous Traffic using Multi-Agent Proximal Policy Optimisation
L. Parada
Eduardo Candela
Luís Marques
Panagiotis Angeloudis
14
11
0
31 Oct 2022
Curiosity-Driven Multi-Agent Exploration with Mixed Objectives
Roben Delos Reyes
Kyunghwan Son
Jinhwan Jung
Wan Ju Kang
Yung Yi
34
4
0
29 Oct 2022
Non-Linear Coordination Graphs
Yipeng Kang
Tonghan Wang
Xiao-Ren Wu
Qianlan Yang
Chongjie Zhang
37
9
0
26 Oct 2022
Entity Divider with Language Grounding in Multi-Agent Reinforcement Learning
Ziluo Ding
Wanpeng Zhang
Junpeng Yue
Xiangjun Wang
Tiejun Huang
Zongqing Lu
LLMAG
AI4CE
28
4
0
25 Oct 2022
The Design and Realization of Multi-agent Obstacle Avoidance based on Reinforcement Learning
Enyu Zhao
Chanjuan Liu
Houfu Su
Yang Liu
AI4CE
31
0
0
24 Oct 2022
Classifying Ambiguous Identities in Hidden-Role Stochastic Games with Multi-Agent Reinforcement Learning
Shijie Han
Siyuan Li
Bo An
Wei Zhao
P. Liu
35
0
0
24 Oct 2022
Solving Continuous Control via Q-learning
Tim Seyde
Peter Werner
Wilko Schwarting
Igor Gilitschenski
Martin Riedmiller
Daniela Rus
Markus Wulfmeier
OffRL
LRM
37
22
0
22 Oct 2022
RPM: Generalizable Behaviors for Multi-Agent Reinforcement Learning
Wei Qiu
Xiao Ma
Bo An
S. Obraztsova
Shuicheng Yan
Zhongwen Xu
27
1
0
18 Oct 2022
Learning Control Admissibility Models with Graph Neural Networks for Multi-Agent Navigation
Chenning Yu
Hong-Den Yu
Sicun Gao
42
17
0
17 Oct 2022
PTDE: Personalized Training with Distilled Execution for Multi-Agent Reinforcement Learning
Yiqun Chen
Hangyu Mao
Jiaxin Mao
Shiguang Wu
Tianle Zhang
Bin Zhang
Bin Wang
Hong Chang
OffRL
41
7
0
17 Oct 2022
Distributional Reward Estimation for Effective Multi-Agent Deep Reinforcement Learning
Jifeng Hu
Yanchao Sun
Hechang Chen
Sili Huang
Haiyin Piao
Yi-Ju Chang
Lichao Sun
23
5
0
14 Oct 2022
Towards Multi-Agent Reinforcement Learning driven Over-The-Counter Market Simulations
N. Vadori
Leo Ardon
Sumitra Ganesh
Thomas Spooner
Selim Amrouni
Jared Vann
Mengda Xu
Zeyu Zheng
T. Balch
Manuela Veloso
18
16
0
13 Oct 2022
Multi-agent Dynamic Algorithm Configuration
Ke Xue
Jiacheng Xu
Lei Yuan
Mingxing Li
Chao Qian
Zongzhang Zhang
Yang Yu
37
29
0
13 Oct 2022
Personalized Federated Hypernetworks for Privacy Preservation in Multi-Task Reinforcement Learning
Doseok Jang
La-mei Yan
Lucas Spangher
C. Spanos
33
1
0
13 Oct 2022
Centralized Training with Hybrid Execution in Multi-Agent Reinforcement Learning
Pedro P. Santos
Diogo S. Carvalho
Miguel Vasco
Alberto Sardinha
Pedro A. Santos
Ana Paiva
Francisco S. Melo
21
1
0
12 Oct 2022
A General Learning Framework for Open Ad Hoc Teamwork Using Graph-based Policy Learning
Arrasy Rahman
Ignacio Carlucho
Niklas Höpner
Stefano V. Albrecht
48
8
0
11 Oct 2022
Human-AI Coordination via Human-Regularized Search and Learning
Hengyuan Hu
David J. Wu
Adam Lerer
Jakob N. Foerster
Noam Brown
19
7
0
11 Oct 2022
MARLlib: A Scalable and Efficient Multi-agent Reinforcement Learning Library
Siyi Hu
Yifan Zhong
Minquan Gao
Weixun Wang
Hao Dong
Xiaodan Liang
Zhihui Li
Xiaojun Chang
Yaodong Yang
23
15
0
11 Oct 2022
Learning Explicit Credit Assignment for Cooperative Multi-Agent Reinforcement Learning via Polarization Policy Gradient
Wubing Chen
Wenbin Li
Xiao Liu
Shangdong Yang
Yang Gao
48
5
0
10 Oct 2022
ELIGN: Expectation Alignment as a Multi-Agent Intrinsic Reward
Zixian Ma
Rose E. Wang
Li Fei-Fei
Michael S. Bernstein
Ranjay Krishna
29
16
0
09 Oct 2022
Multi-agent Deep Covering Skill Discovery
Jiayu Chen
Marina Haliem
Tian-Shing Lan
Vaneet Aggarwal
24
0
0
07 Oct 2022
Spatial-Temporal-Aware Safe Multi-Agent Reinforcement Learning of Connected Autonomous Vehicles in Challenging Scenarios
Zhili Zhang
Songyang Han
Jiangwei Wang
Fei Miao
38
19
0
05 Oct 2022
Stateful active facilitator: Coordination and Environmental Heterogeneity in Cooperative Multi-Agent Reinforcement Learning
Dianbo Liu
Vedant Shah
Oussama Boussif
Cristian Meo
Anirudh Goyal
Tianmin Shu
Michael C. Mozer
N. Heess
Yoshua Bengio
24
7
0
04 Oct 2022
Pareto Actor-Critic for Equilibrium Selection in Multi-Agent Reinforcement Learning
Filippos Christianos
Georgios Papoudakis
Stefano V. Albrecht
35
4
0
28 Sep 2022
Multi-Agent Sequential Decision-Making via Communication
Ziluo Ding
Kefan Su
Wei-xing Hong
Liwen Zhu
Tiejun Huang
Zongqing Lu
18
1
0
26 Sep 2022
More Centralized Training, Still Decentralized Execution: Multi-Agent Conditional Policy Factorization
Jiangxing Wang
Deheng Ye
Zongqing Lu
OffRL
47
18
0
26 Sep 2022
Towards a Standardised Performance Evaluation Protocol for Cooperative MARL
R. Gorsane
Omayma Mahjoub
Ruan de Kock
Roland Dubb
Siddarth S. Singh
Arnu Pretorius
OffRL
44
50
0
21 Sep 2022
Rethinking Individual Global Max in Cooperative Multi-Agent Reinforcement Learning
Yi-Te Hong
Yaochu Jin
Yang Tang
19
22
0
20 Sep 2022
MA2QL: A Minimalist Approach to Fully Decentralized Multi-Agent Reinforcement Learning
Kefan Su
Siyuan Zhou
Jiechuan Jiang
Chuang Gan
Xiangjun Wang
Zongqing Lu
OffRL
36
6
0
17 Sep 2022
Mean-Field Approximation of Cooperative Constrained Multi-Agent Reinforcement Learning (CMARL)
Washim Uddin Mondal
Vaneet Aggarwal
S. Ukkusuri
37
4
0
15 Sep 2022
MIXRTs: Toward Interpretable Multi-Agent Reinforcement Learning via Mixing Recurrent Soft Decision Trees
Zichuan Liu
Zichuan Liu
Zhi Wang
Yuanyang Zhu
Chunlin Chen
63
5
0
15 Sep 2022
Graphon Mean-Field Control for Cooperative Multi-Agent Reinforcement Learning
Yuanquan Hu
Xiaoli Wei
Jun Yan
Heng-Wei Zhang
42
8
0
11 Sep 2022
A Survey on Large-Population Systems and Scalable Multi-Agent Reinforcement Learning
Kai Cui
Anam Tahir
Gizem Ekinci
Ahmed Elshamanhory
Yannick Eich
Mengguang Li
Heinz Koeppl
AI4CE
89
15
0
08 Sep 2022
On the Near-Optimality of Local Policies in Large Cooperative Multi-Agent Reinforcement Learning
Washim Uddin Mondal
Vaneet Aggarwal
S. Ukkusuri
28
5
0
07 Sep 2022
Learning Practical Communication Strategies in Cooperative Multi-Agent Reinforcement Learning
Diyi Hu
Chi Zhang
Viktor Prasanna
Krishnamachari
24
2
0
02 Sep 2022
Taming Multi-Agent Reinforcement Learning with Estimator Variance Reduction
Taher Jafferjee
Juliusz Ziomek
Tianpei Yang
Zipeng Dai
Jianhong Wang
Matthew E. Taylor
Kun Shao
Jun Wang
D. Mguni
40
0
0
02 Sep 2022
Decentralized Coordination in Partially Observable Queueing Networks
Jiekai Jia
Anam Tahir
Heinz Koeppl
39
1
0
29 Aug 2022
Entropy Enhanced Multi-Agent Coordination Based on Hierarchical Graph Learning for Continuous Action Space
Yining Chen
Ke Wang
Guang-hua Song
Xiaohong Jiang
28
3
0
23 Aug 2022
Exploring Task-oriented Communication in Multi-agent System: A Deep Reinforcement Learning Approach
Guojun He
29
0
0
22 Aug 2022
A Policy Resonance Approach to Solve the Problem of Responsibility Diffusion in Multiagent Reinforcement Learning
Qing Fu
Tenghai Qiu
Jianqiang Yi
Zhiqiang Pu
Xiaolin Ai
Wanmai Yuan
24
0
0
16 Aug 2022
Transformer-based Value Function Decomposition for Cooperative Multi-agent Reinforcement Learning in StarCraft
Muhammad Junaid Khan
Syed Hammad Ahmed
G. Sukthankar
26
15
0
15 Aug 2022
Ad Hoc Teamwork in the Presence of Adversaries
Ted Fujimoto
Samrat Chatterjee
A. Ganguly
35
2
0
09 Aug 2022
Multi-agent reinforcement learning for intent-based service assurance in cellular networks
S. K. Perepu
†. JeanP.Martins
Ricardo Souza
Kaushik Dey
11
2
0
07 Aug 2022
Transferable Multi-Agent Reinforcement Learning with Dynamic Participating Agents
Xuting Tang
Jia Xu
Shusen Wang
33
1
0
04 Aug 2022
AACC: Asymmetric Actor-Critic in Contextual Reinforcement Learning
Wangyang Yue
Yuan Zhou
Xiaochuan Zhang
Yuchen Hua
Zhiyuan Wang
Guang Kou
OffRL
9
3
0
03 Aug 2022
Cooperative Actor-Critic via TD Error Aggregation
Martin Figura
Yixuan Lin
Ji Liu
V. Gupta
28
1
0
25 Jul 2022
A Deep Reinforcement Learning Approach for Finding Non-Exploitable Strategies in Two-Player Atari Games
Zihan Ding
DiJia Su
Qinghua Liu
Chi Jin
33
3
0
18 Jul 2022
Stochastic Market Games
Kyrill Schmid
Lenz Belzner
Robert Muller
Johannes Tochtermann
Claudia Linnhoff-Popien
14
5
0
15 Jul 2022
K-level Reasoning for Zero-Shot Coordination in Hanabi
Brandon Cui
Hengyuan Hu
Luis Pineda
Jakob N. Foerster
OffRL
LRM
25
33
0
14 Jul 2022
Scalable Model-based Policy Optimization for Decentralized Networked Systems
Yali Du
Chengdong Ma
Yuchen Liu
Runji Lin
Hao Dong
Jun Wang
Yaodong Yang
34
8
0
13 Jul 2022
Previous
1
2
3
...
6
7
8
...
13
14
15
Next