ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2301.01649
  4. Cited By
Attention-Based Recurrence for Multi-Agent Reinforcement Learning under
  Stochastic Partial Observability
v1v2v3v4v5v6 (latest)

Attention-Based Recurrence for Multi-Agent Reinforcement Learning under Stochastic Partial Observability

4 January 2023
Thomy Phan
Fabian Ritz
Philipp Altmann
Maximilian Zorn
Jonas Nusslein
Michael Kolle
Thomas Gabor
Claudia Linnhoff-Popien
ArXiv (abs)PDFHTML

Papers citing "Attention-Based Recurrence for Multi-Agent Reinforcement Learning under Stochastic Partial Observability"

17 / 17 papers shown
Title
Is Centralized Training with Decentralized Execution Framework Centralized Enough for MARL?
Is Centralized Training with Decentralized Execution Framework Centralized Enough for MARL?
Yihe Zhou
Shunyu Liu
Yunpeng Qing
Kaixuan Chen
Tongya Zheng
Jie Song
Mingli Song
69
20
0
27 May 2023
Multi-Agent Reinforcement Learning is a Sequence Modeling Problem
Multi-Agent Reinforcement Learning is a Sequence Modeling Problem
Muning Wen
J. Kuba
Runji Lin
Weinan Zhang
Ying Wen
Jun Wang
Yaodong Yang
96
188
0
30 May 2022
A Deeper Understanding of State-Based Critics in Multi-Agent
  Reinforcement Learning
A Deeper Understanding of State-Based Critics in Multi-Agent Reinforcement Learning
Xueguang Lyu
Andrea Baisero
Yuchen Xiao
Chris Amato
OffRL
72
16
0
03 Jan 2022
The Surprising Effectiveness of PPO in Cooperative, Multi-Agent Games
The Surprising Effectiveness of PPO in Cooperative, Multi-Agent Games
Chao Yu
Akash Velu
Eugene Vinitsky
Jiaxuan Gao
Yu Wang
Alexandre M. Bayen
Yi Wu
OffRL
161
1,272
0
02 Mar 2021
QPLEX: Duplex Dueling Multi-Agent Q-Learning
QPLEX: Duplex Dueling Multi-Agent Q-Learning
Jianhao Wang
Zhizhou Ren
Terry Liu
Yang Yu
Chongjie Zhang
OffRL
110
457
0
03 Aug 2020
Weighted QMIX: Expanding Monotonic Value Function Factorisation for Deep
  Multi-Agent Reinforcement Learning
Weighted QMIX: Expanding Monotonic Value Function Factorisation for Deep Multi-Agent Reinforcement Learning
Tabish Rashid
Gregory Farquhar
Bei Peng
Shimon Whiteson
127
356
0
18 Jun 2020
Monotonic Value Function Factorisation for Deep Multi-Agent
  Reinforcement Learning
Monotonic Value Function Factorisation for Deep Multi-Agent Reinforcement Learning
Tabish Rashid
Mikayel Samvelyan
Christian Schroeder de Witt
Gregory Farquhar
Jakob N. Foerster
Shimon Whiteson
122
817
0
19 Mar 2020
QTRAN: Learning to Factorize with Transformation for Cooperative
  Multi-Agent Reinforcement Learning
QTRAN: Learning to Factorize with Transformation for Cooperative Multi-Agent Reinforcement Learning
Kyunghwan Son
Daewoo Kim
Wan Ju Kang
D. Hostallero
Yung Yi
OffRL
69
809
0
14 May 2019
The StarCraft Multi-Agent Challenge
The StarCraft Multi-Agent Challenge
Mikayel Samvelyan
Tabish Rashid
Christian Schroeder de Witt
Gregory Farquhar
Nantas Nardelli
Tim G. J. Rudner
Chia-Man Hung
Philip Torr
Jakob N. Foerster
Shimon Whiteson
98
958
0
11 Feb 2019
Actor-Attention-Critic for Multi-Agent Reinforcement Learning
Actor-Attention-Critic for Multi-Agent Reinforcement Learning
Shariq Iqbal
Fei Sha
72
755
0
05 Oct 2018
QMIX: Monotonic Value Function Factorisation for Deep Multi-Agent
  Reinforcement Learning
QMIX: Monotonic Value Function Factorisation for Deep Multi-Agent Reinforcement Learning
Tabish Rashid
Mikayel Samvelyan
Christian Schroeder de Witt
Gregory Farquhar
Jakob N. Foerster
Shimon Whiteson
166
1,676
0
30 Mar 2018
Multi-Agent Actor-Critic for Mixed Cooperative-Competitive Environments
Multi-Agent Actor-Critic for Mixed Cooperative-Competitive Environments
Ryan J. Lowe
Yi Wu
Aviv Tamar
J. Harb
Pieter Abbeel
Igor Mordatch
162
4,520
0
07 Jun 2017
Counterfactual Multi-Agent Policy Gradients
Counterfactual Multi-Agent Policy Gradients
Jakob N. Foerster
Gregory Farquhar
Triantafyllos Afouras
Nantas Nardelli
Shimon Whiteson
156
2,090
0
24 May 2017
Deep Recurrent Q-Learning for Partially Observable MDPs
Deep Recurrent Q-Learning for Partially Observable MDPs
Matthew J. Hausknecht
Peter Stone
114
1,685
0
23 Jul 2015
On the Properties of Neural Machine Translation: Encoder-Decoder
  Approaches
On the Properties of Neural Machine Translation: Encoder-Decoder Approaches
Kyunghyun Cho
B. V. Merrienboer
Dzmitry Bahdanau
Yoshua Bengio
AI4CEAIMat
265
6,791
0
03 Sep 2014
MAA*: A Heuristic Search Algorithm for Solving Decentralized POMDPs
MAA*: A Heuristic Search Algorithm for Solving Decentralized POMDPs
Daniel Szer
François Charpillet
S. Zilberstein
77
218
0
04 Jul 2012
Optimal and Approximate Q-value Functions for Decentralized POMDPs
Optimal and Approximate Q-value Functions for Decentralized POMDPs
F. Oliehoek
M. Spaan
N. Vlassis
OffRL
116
503
0
31 Oct 2011
1