ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1705.08926
  4. Cited By
Counterfactual Multi-Agent Policy Gradients
v1v2 (latest)

Counterfactual Multi-Agent Policy Gradients

24 May 2017
Jakob N. Foerster
Gregory Farquhar
Triantafyllos Afouras
Nantas Nardelli
Shimon Whiteson
ArXiv (abs)PDFHTML

Papers citing "Counterfactual Multi-Agent Policy Gradients"

50 / 52 papers shown
Title
Bi-level Mean Field: Dynamic Grouping for Large-Scale MARL
Bi-level Mean Field: Dynamic Grouping for Large-Scale MARL
Yuxuan Zheng
Yihe Zhou
Feiyang Xu
Mingli Song
Shunyu Liu
OffRL
56
0
0
10 May 2025
MARFT: Multi-Agent Reinforcement Fine-Tuning
MARFT: Multi-Agent Reinforcement Fine-Tuning
Junwei Liao
Muning Wen
Jun Wang
Weinan Zhang
OffRL
117
5
0
21 Apr 2025
QLLM: Do We Really Need a Mixing Network for Credit Assignment in Multi-Agent Reinforcement Learning?
QLLM: Do We Really Need a Mixing Network for Credit Assignment in Multi-Agent Reinforcement Learning?
Zhouyang Jiang
Bin Zhang
Airong Wei
Zhiwei Xu
OffRL
131
0
0
17 Apr 2025
An Efficient Approach for Cooperative Multi-Agent Learning Problems
An Efficient Approach for Cooperative Multi-Agent Learning Problems
Ángel Aso-Mollar
Eva Onaindia
68
0
0
07 Apr 2025
Reinforcement Learning of Multi-robot Task Allocation for Multi-object Transportation with Infeasible Tasks
Reinforcement Learning of Multi-robot Task Allocation for Multi-object Transportation with Infeasible Tasks
Yuma Shida
Tomohiko Jimbo
Tadashi Odashima
Takamitsu Matsubara
74
1
0
20 Feb 2025
Causal Mean Field Multi-Agent Reinforcement Learning
Causal Mean Field Multi-Agent Reinforcement Learning
Hao Ma
Zhiqiang Pu
Yi Pan
Boyin Liu
Junlong Gao
Zhenyu Guo
148
0
0
20 Feb 2025
Single-Agent Planning in a Multi-Agent System: A Unified Framework for Type-Based Planners
Single-Agent Planning in a Multi-Agent System: A Unified Framework for Type-Based Planners
Fengming Zhu
Fangzhen Lin
124
1
0
13 Feb 2025
Low-Rank Agent-Specific Adaptation (LoRASA) for Multi-Agent Policy Learning
Low-Rank Agent-Specific Adaptation (LoRASA) for Multi-Agent Policy Learning
Beining Zhang
Aditya Kapoor
Mingfei Sun
228
0
0
08 Feb 2025
Learning from Suboptimal Data in Continuous Control via Auto-Regressive Soft Q-Network
Learning from Suboptimal Data in Continuous Control via Auto-Regressive Soft Q-Network
Jijia Liu
Feng Gao
Q. Liao
Chao Yu
Yu Wang
OffRL
138
0
0
01 Feb 2025
PIMAEX: Multi-Agent Exploration through Peer Incentivization
Michael Kolle
Johannes Tochtermann
Julian Schonberger
Gerhard Stenzel
Philipp Altmann
Claudia Linnhoff-Popien
85
0
0
03 Jan 2025
Solving Hierarchical Information-Sharing Dec-POMDPs: An Extensive-Form Game Approach
Solving Hierarchical Information-Sharing Dec-POMDPs: An Extensive-Form Game Approach
Johan Peralez
Aurélien Delage
Olivier Buffet
J. Dibangoye
82
1
0
03 Jan 2025
MADiff: Offline Multi-agent Learning with Diffusion Models
MADiff: Offline Multi-agent Learning with Diffusion Models
Zhengbang Zhu
Minghuan Liu
Liyuan Mao
Bingyi Kang
Minkai Xu
Yong Yu
Stefano Ermon
Weinan Zhang
DiffMOffRL
154
40
0
03 Jan 2025
OffLight: An Offline Multi-Agent Reinforcement Learning Framework for Traffic Signal Control
OffLight: An Offline Multi-Agent Reinforcement Learning Framework for Traffic Signal Control
Rohit Bokade
Xiaoning Jin
OffRL
172
0
0
10 Nov 2024
Learning to Balance Altruism and Self-interest Based on Empathy in Mixed-Motive Games
Learning to Balance Altruism and Self-interest Based on Empathy in Mixed-Motive Games
Fanqi Kong
Yizhe Huang
Song-Chun Zhu
Siyuan Qi
Xue Feng
58
2
0
10 Oct 2024
What If We Had Used a Different App? Reliable Counterfactual KPI Analysis in Wireless Systems
What If We Had Used a Different App? Reliable Counterfactual KPI Analysis in Wireless Systems
Qiushuo Hou
Sangwoo Park
Matteo Zecchin
Yunlong Cai
Guanding Yu
Osvaldo Simeone
436
1
0
30 Sep 2024
Optimally Solving Simultaneous-Move Dec-POMDPs: The Sequential Central Planning Approach
Optimally Solving Simultaneous-Move Dec-POMDPs: The Sequential Central Planning Approach
Johan Peralez
Aurélien Delage
Jacopo Castellini
Rafael F. Cunha
J. Dibangoye
83
0
0
23 Aug 2024
A Survey on Self-play Methods in Reinforcement Learning
A Survey on Self-play Methods in Reinforcement Learning
Chao Yu
Zelai Xu
Chengdong Ma
Chao Yu
Weijuan Tu
...
Deheng Ye
Wenbo Ding
Yaodong Yang
Yu Wang
Yu Wang
SyDaSSLOnRL
100
9
0
02 Aug 2024
Simplifying Deep Temporal Difference Learning
Simplifying Deep Temporal Difference Learning
Matteo Gallici
Mattie Fellows
Benjamin Ellis
B. Pou
Ivan Masmitja
Jakob Foerster
Mario Martin
OffRL
120
26
0
05 Jul 2024
The Overcooked Generalisation Challenge
The Overcooked Generalisation Challenge
Constantin Ruhdorfer
Matteo Bortoletto
Anna Penzkofer
Andreas Bulling
99
4
0
25 Jun 2024
Reciprocal Reward Influence Encourages Cooperation From Self-Interested Agents
Reciprocal Reward Influence Encourages Cooperation From Self-Interested Agents
John L. Zhou
Weizhe Hong
Jonathan C. Kao
94
0
0
03 Jun 2024
eQMARL: Entangled Quantum Multi-Agent Reinforcement Learning for Distributed Cooperation over Quantum Channels
eQMARL: Entangled Quantum Multi-Agent Reinforcement Learning for Distributed Cooperation over Quantum Channels
Alexander C. DeRieux
Walid Saad
100
1
0
24 May 2024
POWQMIX: Weighted Value Factorization with Potentially Optimal Joint Actions Recognition for Cooperative Multi-Agent Reinforcement Learning
POWQMIX: Weighted Value Factorization with Potentially Optimal Joint Actions Recognition for Cooperative Multi-Agent Reinforcement Learning
Chang Huang
Junqiao Zhao
Shatong Zhu
Hongtu Zhou
Chen Ye
T. Feng
Changjun Jiang
109
0
0
13 May 2024
Is Centralized Training with Decentralized Execution Framework Centralized Enough for MARL?
Is Centralized Training with Decentralized Execution Framework Centralized Enough for MARL?
Yihe Zhou
Shunyu Liu
Yunpeng Qing
Kaixuan Chen
Tongya Zheng
Jie Song
Mingli Song
58
20
0
27 May 2023
Explainability in Deep Reinforcement Learning
Explainability in Deep Reinforcement Learning
Alexandre Heuillet
Fabien Couthouis
Natalia Díaz Rodríguez
XAI
182
282
0
15 Aug 2020
Agent Modelling under Partial Observability for Deep Reinforcement
  Learning
Agent Modelling under Partial Observability for Deep Reinforcement Learning
Georgios Papoudakis
Filippos Christianos
Stefano V. Albrecht
67
65
0
16 Jun 2020
The Emergence of Individuality
The Emergence of Individuality
Jiechuan Jiang
Zongqing Lu
52
40
0
10 Jun 2020
Reward Design in Cooperative Multi-agent Reinforcement Learning for
  Packet Routing
Reward Design in Cooperative Multi-agent Reinforcement Learning for Packet Routing
Hangyu Mao
Zhibo Gong
Zhen Xiao
65
16
0
05 Mar 2020
Efficient Multi-robot Exploration via Multi-head Attention-based
  Cooperation Strategy
Efficient Multi-robot Exploration via Multi-head Attention-based Cooperation Strategy
Shuqi Liu
Zhaoxia Wu
45
2
0
05 Nov 2019
Deep Decentralized Reinforcement Learning for Cooperative Control
Deep Decentralized Reinforcement Learning for Cooperative Control
Florian Köpf
Samuel Tesfazgi
M. Flad
Sören Hohmann
76
2
0
29 Oct 2019
The Multi-Agent Reinforcement Learning in MalmÖ (MARLÖ) Competition
The Multi-Agent Reinforcement Learning in MalmÖ (MARLÖ) Competition
Diego Perez-Liebana
Katja Hofmann
Sharada Mohanty
Noburu Kuno
André Kramer
Sam Devlin
Raluca D. Gaina
Daniel Ionita
LRM
79
35
0
23 Jan 2019
Multi-Agent Actor-Critic for Mixed Cooperative-Competitive Environments
Multi-Agent Actor-Critic for Mixed Cooperative-Competitive Environments
Ryan J. Lowe
Yi Wu
Aviv Tamar
J. Harb
Pieter Abbeel
Igor Mordatch
162
4,509
0
07 Jun 2017
Multiagent Bidirectionally-Coordinated Nets: Emergence of Human-level
  Coordination in Learning to Play StarCraft Combat Games
Multiagent Bidirectionally-Coordinated Nets: Emergence of Human-level Coordination in Learning to Play StarCraft Combat Games
Peng Peng
Ying Wen
Yaodong Yang
Quan Yuan
Zhenkun Tang
Haitao Long
Jun Wang
65
335
0
29 Mar 2017
Learning Cooperative Visual Dialog Agents with Deep Reinforcement
  Learning
Learning Cooperative Visual Dialog Agents with Deep Reinforcement Learning
Abhishek Das
Satwik Kottur
J. M. F. Moura
Stefan Lee
Dhruv Batra
OffRL
124
425
0
20 Mar 2017
Deep Decentralized Multi-task Multi-Agent Reinforcement Learning under
  Partial Observability
Deep Decentralized Multi-task Multi-Agent Reinforcement Learning under Partial Observability
Shayegan Omidshafiei
Jason Pazis
Chris Amato
Jonathan P. How
J. Vian
132
498
0
17 Mar 2017
Emergence of Grounded Compositional Language in Multi-Agent Populations
Emergence of Grounded Compositional Language in Multi-Agent Populations
Igor Mordatch
Pieter Abbeel
LLMAG
139
704
0
15 Mar 2017
Stabilising Experience Replay for Deep Multi-Agent Reinforcement
  Learning
Stabilising Experience Replay for Deep Multi-Agent Reinforcement Learning
Jakob N. Foerster
Nantas Nardelli
Gregory Farquhar
Triantafyllos Afouras
Philip Torr
Pushmeet Kohli
Shimon Whiteson
OffRL
189
599
0
28 Feb 2017
Multi-agent Reinforcement Learning in Sequential Social Dilemmas
Multi-agent Reinforcement Learning in Sequential Social Dilemmas
Joel Z Leibo
V. Zambaldi
Marc Lanctot
J. Marecki
T. Graepel
78
611
0
10 Feb 2017
Multi-Agent Cooperation and the Emergence of (Natural) Language
Multi-Agent Cooperation and the Emergence of (Natural) Language
Angeliki Lazaridou
A. Peysakhovich
Marco Baroni
LLMAG
115
434
0
21 Dec 2016
Learning to Play Guess Who? and Inventing a Grounded Language as a
  Consequence
Learning to Play Guess Who? and Inventing a Grounded Language as a Consequence
Emilio Jorge
Mikael Kågebäck
Fredrik D. Johansson
E. Gustavsson
70
67
0
10 Nov 2016
Sample Efficient Actor-Critic with Experience Replay
Sample Efficient Actor-Critic with Experience Replay
Ziyun Wang
V. Bapst
N. Heess
Volodymyr Mnih
Rémi Munos
Koray Kavukcuoglu
Nando de Freitas
102
762
0
03 Nov 2016
TorchCraft: a Library for Machine Learning Research on Real-Time
  Strategy Games
TorchCraft: a Library for Machine Learning Research on Real-Time Strategy Games
Gabriel Synnaeve
Nantas Nardelli
Alex Auvolat
Soumith Chintala
Timothée Lacroix
Zeming Lin
Florian Richoux
Nicolas Usunier
GNN
60
105
0
01 Nov 2016
Episodic Exploration for Deep Deterministic Policies: An Application to
  StarCraft Micromanagement Tasks
Episodic Exploration for Deep Deterministic Policies: An Application to StarCraft Micromanagement Tasks
Nicolas Usunier
Gabriel Synnaeve
Zeming Lin
Soumith Chintala
72
138
0
10 Sep 2016
Learning Multiagent Communication with Backpropagation
Learning Multiagent Communication with Backpropagation
Sainbayar Sukhbaatar
Arthur Szlam
Rob Fergus
227
1,150
0
25 May 2016
Learning to Communicate with Deep Multi-Agent Reinforcement Learning
Learning to Communicate with Deep Multi-Agent Reinforcement Learning
Jakob N. Foerster
Yannis Assael
Nando de Freitas
Shimon Whiteson
155
1,614
0
21 May 2016
Multiagent Cooperation and Competition with Deep Reinforcement Learning
Multiagent Cooperation and Competition with Deep Reinforcement Learning
Ardi Tampuu
Tambet Matiisen
Dorian Kodelja
Ilya Kuzovkin
Kristjan Korjus
Juhan Aru
Jaan Aru
Raul Vicente
96
865
0
27 Nov 2015
Deep Recurrent Q-Learning for Partially Observable MDPs
Deep Recurrent Q-Learning for Partially Observable MDPs
Matthew J. Hausknecht
Peter Stone
108
1,685
0
23 Jul 2015
High-Dimensional Continuous Control Using Generalized Advantage
  Estimation
High-Dimensional Continuous Control Using Generalized Advantage Estimation
John Schulman
Philipp Moritz
Sergey Levine
Michael I. Jordan
Pieter Abbeel
OffRL
107
3,434
0
08 Jun 2015
Empirical Evaluation of Gated Recurrent Neural Networks on Sequence
  Modeling
Empirical Evaluation of Gated Recurrent Neural Networks on Sequence Modeling
Junyoung Chung
Çağlar Gülçehre
Kyunghyun Cho
Yoshua Bengio
601
12,734
0
11 Dec 2014
On the Properties of Neural Machine Translation: Encoder-Decoder
  Approaches
On the Properties of Neural Machine Translation: Encoder-Decoder Approaches
Kyunghyun Cho
B. V. Merrienboer
Dzmitry Bahdanau
Yoshua Bengio
AI4CEAIMat
257
6,784
0
03 Sep 2014
Empirically Evaluating Multiagent Learning Algorithms
Empirically Evaluating Multiagent Learning Algorithms
Erik Zawadzki
A. Lipson
Kevin Leyton-Brown
85
28
0
31 Jan 2014
12
Next