Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
1705.08926
Cited By
v1
v2 (latest)
Counterfactual Multi-Agent Policy Gradients
24 May 2017
Jakob N. Foerster
Gregory Farquhar
Triantafyllos Afouras
Nantas Nardelli
Shimon Whiteson
Re-assign community
ArXiv (abs)
PDF
HTML
Papers citing
"Counterfactual Multi-Agent Policy Gradients"
50 / 52 papers shown
Title
Bi-level Mean Field: Dynamic Grouping for Large-Scale MARL
Yuxuan Zheng
Yihe Zhou
Feiyang Xu
Mingli Song
Shunyu Liu
OffRL
56
0
0
10 May 2025
MARFT: Multi-Agent Reinforcement Fine-Tuning
Junwei Liao
Muning Wen
Jun Wang
Weinan Zhang
OffRL
117
5
0
21 Apr 2025
QLLM: Do We Really Need a Mixing Network for Credit Assignment in Multi-Agent Reinforcement Learning?
Zhouyang Jiang
Bin Zhang
Airong Wei
Zhiwei Xu
OffRL
131
0
0
17 Apr 2025
An Efficient Approach for Cooperative Multi-Agent Learning Problems
Ángel Aso-Mollar
Eva Onaindia
68
0
0
07 Apr 2025
Reinforcement Learning of Multi-robot Task Allocation for Multi-object Transportation with Infeasible Tasks
Yuma Shida
Tomohiko Jimbo
Tadashi Odashima
Takamitsu Matsubara
74
1
0
20 Feb 2025
Causal Mean Field Multi-Agent Reinforcement Learning
Hao Ma
Zhiqiang Pu
Yi Pan
Boyin Liu
Junlong Gao
Zhenyu Guo
148
0
0
20 Feb 2025
Single-Agent Planning in a Multi-Agent System: A Unified Framework for Type-Based Planners
Fengming Zhu
Fangzhen Lin
124
1
0
13 Feb 2025
Low-Rank Agent-Specific Adaptation (LoRASA) for Multi-Agent Policy Learning
Beining Zhang
Aditya Kapoor
Mingfei Sun
228
0
0
08 Feb 2025
Learning from Suboptimal Data in Continuous Control via Auto-Regressive Soft Q-Network
Jijia Liu
Feng Gao
Q. Liao
Chao Yu
Yu Wang
OffRL
138
0
0
01 Feb 2025
PIMAEX: Multi-Agent Exploration through Peer Incentivization
Michael Kolle
Johannes Tochtermann
Julian Schonberger
Gerhard Stenzel
Philipp Altmann
Claudia Linnhoff-Popien
85
0
0
03 Jan 2025
Solving Hierarchical Information-Sharing Dec-POMDPs: An Extensive-Form Game Approach
Johan Peralez
Aurélien Delage
Olivier Buffet
J. Dibangoye
82
1
0
03 Jan 2025
MADiff: Offline Multi-agent Learning with Diffusion Models
Zhengbang Zhu
Minghuan Liu
Liyuan Mao
Bingyi Kang
Minkai Xu
Yong Yu
Stefano Ermon
Weinan Zhang
DiffM
OffRL
154
40
0
03 Jan 2025
OffLight: An Offline Multi-Agent Reinforcement Learning Framework for Traffic Signal Control
Rohit Bokade
Xiaoning Jin
OffRL
172
0
0
10 Nov 2024
Learning to Balance Altruism and Self-interest Based on Empathy in Mixed-Motive Games
Fanqi Kong
Yizhe Huang
Song-Chun Zhu
Siyuan Qi
Xue Feng
58
2
0
10 Oct 2024
What If We Had Used a Different App? Reliable Counterfactual KPI Analysis in Wireless Systems
Qiushuo Hou
Sangwoo Park
Matteo Zecchin
Yunlong Cai
Guanding Yu
Osvaldo Simeone
436
1
0
30 Sep 2024
Optimally Solving Simultaneous-Move Dec-POMDPs: The Sequential Central Planning Approach
Johan Peralez
Aurélien Delage
Jacopo Castellini
Rafael F. Cunha
J. Dibangoye
83
0
0
23 Aug 2024
A Survey on Self-play Methods in Reinforcement Learning
Chao Yu
Zelai Xu
Chengdong Ma
Chao Yu
Weijuan Tu
...
Deheng Ye
Wenbo Ding
Yaodong Yang
Yu Wang
Yu Wang
SyDa
SSL
OnRL
100
9
0
02 Aug 2024
Simplifying Deep Temporal Difference Learning
Matteo Gallici
Mattie Fellows
Benjamin Ellis
B. Pou
Ivan Masmitja
Jakob Foerster
Mario Martin
OffRL
120
26
0
05 Jul 2024
The Overcooked Generalisation Challenge
Constantin Ruhdorfer
Matteo Bortoletto
Anna Penzkofer
Andreas Bulling
99
4
0
25 Jun 2024
Reciprocal Reward Influence Encourages Cooperation From Self-Interested Agents
John L. Zhou
Weizhe Hong
Jonathan C. Kao
94
0
0
03 Jun 2024
eQMARL: Entangled Quantum Multi-Agent Reinforcement Learning for Distributed Cooperation over Quantum Channels
Alexander C. DeRieux
Walid Saad
100
1
0
24 May 2024
POWQMIX: Weighted Value Factorization with Potentially Optimal Joint Actions Recognition for Cooperative Multi-Agent Reinforcement Learning
Chang Huang
Junqiao Zhao
Shatong Zhu
Hongtu Zhou
Chen Ye
T. Feng
Changjun Jiang
109
0
0
13 May 2024
Is Centralized Training with Decentralized Execution Framework Centralized Enough for MARL?
Yihe Zhou
Shunyu Liu
Yunpeng Qing
Kaixuan Chen
Tongya Zheng
Jie Song
Mingli Song
58
20
0
27 May 2023
Explainability in Deep Reinforcement Learning
Alexandre Heuillet
Fabien Couthouis
Natalia Díaz Rodríguez
XAI
182
282
0
15 Aug 2020
Agent Modelling under Partial Observability for Deep Reinforcement Learning
Georgios Papoudakis
Filippos Christianos
Stefano V. Albrecht
67
65
0
16 Jun 2020
The Emergence of Individuality
Jiechuan Jiang
Zongqing Lu
52
40
0
10 Jun 2020
Reward Design in Cooperative Multi-agent Reinforcement Learning for Packet Routing
Hangyu Mao
Zhibo Gong
Zhen Xiao
65
16
0
05 Mar 2020
Efficient Multi-robot Exploration via Multi-head Attention-based Cooperation Strategy
Shuqi Liu
Zhaoxia Wu
45
2
0
05 Nov 2019
Deep Decentralized Reinforcement Learning for Cooperative Control
Florian Köpf
Samuel Tesfazgi
M. Flad
Sören Hohmann
76
2
0
29 Oct 2019
The Multi-Agent Reinforcement Learning in MalmÖ (MARLÖ) Competition
Diego Perez-Liebana
Katja Hofmann
Sharada Mohanty
Noburu Kuno
André Kramer
Sam Devlin
Raluca D. Gaina
Daniel Ionita
LRM
79
35
0
23 Jan 2019
Multi-Agent Actor-Critic for Mixed Cooperative-Competitive Environments
Ryan J. Lowe
Yi Wu
Aviv Tamar
J. Harb
Pieter Abbeel
Igor Mordatch
162
4,509
0
07 Jun 2017
Multiagent Bidirectionally-Coordinated Nets: Emergence of Human-level Coordination in Learning to Play StarCraft Combat Games
Peng Peng
Ying Wen
Yaodong Yang
Quan Yuan
Zhenkun Tang
Haitao Long
Jun Wang
65
335
0
29 Mar 2017
Learning Cooperative Visual Dialog Agents with Deep Reinforcement Learning
Abhishek Das
Satwik Kottur
J. M. F. Moura
Stefan Lee
Dhruv Batra
OffRL
124
425
0
20 Mar 2017
Deep Decentralized Multi-task Multi-Agent Reinforcement Learning under Partial Observability
Shayegan Omidshafiei
Jason Pazis
Chris Amato
Jonathan P. How
J. Vian
132
498
0
17 Mar 2017
Emergence of Grounded Compositional Language in Multi-Agent Populations
Igor Mordatch
Pieter Abbeel
LLMAG
139
704
0
15 Mar 2017
Stabilising Experience Replay for Deep Multi-Agent Reinforcement Learning
Jakob N. Foerster
Nantas Nardelli
Gregory Farquhar
Triantafyllos Afouras
Philip Torr
Pushmeet Kohli
Shimon Whiteson
OffRL
189
599
0
28 Feb 2017
Multi-agent Reinforcement Learning in Sequential Social Dilemmas
Joel Z Leibo
V. Zambaldi
Marc Lanctot
J. Marecki
T. Graepel
78
611
0
10 Feb 2017
Multi-Agent Cooperation and the Emergence of (Natural) Language
Angeliki Lazaridou
A. Peysakhovich
Marco Baroni
LLMAG
115
434
0
21 Dec 2016
Learning to Play Guess Who? and Inventing a Grounded Language as a Consequence
Emilio Jorge
Mikael Kågebäck
Fredrik D. Johansson
E. Gustavsson
70
67
0
10 Nov 2016
Sample Efficient Actor-Critic with Experience Replay
Ziyun Wang
V. Bapst
N. Heess
Volodymyr Mnih
Rémi Munos
Koray Kavukcuoglu
Nando de Freitas
102
762
0
03 Nov 2016
TorchCraft: a Library for Machine Learning Research on Real-Time Strategy Games
Gabriel Synnaeve
Nantas Nardelli
Alex Auvolat
Soumith Chintala
Timothée Lacroix
Zeming Lin
Florian Richoux
Nicolas Usunier
GNN
60
105
0
01 Nov 2016
Episodic Exploration for Deep Deterministic Policies: An Application to StarCraft Micromanagement Tasks
Nicolas Usunier
Gabriel Synnaeve
Zeming Lin
Soumith Chintala
72
138
0
10 Sep 2016
Learning Multiagent Communication with Backpropagation
Sainbayar Sukhbaatar
Arthur Szlam
Rob Fergus
227
1,150
0
25 May 2016
Learning to Communicate with Deep Multi-Agent Reinforcement Learning
Jakob N. Foerster
Yannis Assael
Nando de Freitas
Shimon Whiteson
155
1,614
0
21 May 2016
Multiagent Cooperation and Competition with Deep Reinforcement Learning
Ardi Tampuu
Tambet Matiisen
Dorian Kodelja
Ilya Kuzovkin
Kristjan Korjus
Juhan Aru
Jaan Aru
Raul Vicente
96
865
0
27 Nov 2015
Deep Recurrent Q-Learning for Partially Observable MDPs
Matthew J. Hausknecht
Peter Stone
108
1,685
0
23 Jul 2015
High-Dimensional Continuous Control Using Generalized Advantage Estimation
John Schulman
Philipp Moritz
Sergey Levine
Michael I. Jordan
Pieter Abbeel
OffRL
107
3,434
0
08 Jun 2015
Empirical Evaluation of Gated Recurrent Neural Networks on Sequence Modeling
Junyoung Chung
Çağlar Gülçehre
Kyunghyun Cho
Yoshua Bengio
601
12,734
0
11 Dec 2014
On the Properties of Neural Machine Translation: Encoder-Decoder Approaches
Kyunghyun Cho
B. V. Merrienboer
Dzmitry Bahdanau
Yoshua Bengio
AI4CE
AIMat
257
6,784
0
03 Sep 2014
Empirically Evaluating Multiagent Learning Algorithms
Erik Zawadzki
A. Lipson
Kevin Leyton-Brown
85
28
0
31 Jan 2014
1
2
Next