ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1911.10635
  4. Cited By
Multi-Agent Reinforcement Learning: A Selective Overview of Theories and
  Algorithms

Multi-Agent Reinforcement Learning: A Selective Overview of Theories and Algorithms

24 November 2019
Kaipeng Zhang
Zhuoran Yang
Tamer Basar
ArXivPDFHTML

Papers citing "Multi-Agent Reinforcement Learning: A Selective Overview of Theories and Algorithms"

50 / 169 papers shown
Title
Artificial Collective Intelligence Engineering: a Survey of Concepts and
  Perspectives
Artificial Collective Intelligence Engineering: a Survey of Concepts and Perspectives
Roberto Casadei
AI4CE
18
15
0
11 Apr 2023
The challenge of redundancy on multi-agent value factorisation
The challenge of redundancy on multi-agent value factorisation
Siddarth S. Singh
Benjamin Rosman
36
1
0
28 Mar 2023
Boundary-aware Supervoxel-level Iteratively Refined Interactive 3D Image
  Segmentation with Multi-agent Reinforcement Learning
Boundary-aware Supervoxel-level Iteratively Refined Interactive 3D Image Segmentation with Multi-agent Reinforcement Learning
Chaofan Ma
Qisen Xu
Xiangfeng Wang
Bo Jin
Xiaoyun Zhang
Yanfeng Wang
Ya-Qin Zhang
32
22
0
19 Mar 2023
A New Policy Iteration Algorithm For Reinforcement Learning in Zero-Sum
  Markov Games
A New Policy Iteration Algorithm For Reinforcement Learning in Zero-Sum Markov Games
Anna Winnicki
R. Srikant
34
1
0
17 Mar 2023
Decentralized Multi-Agent Reinforcement Learning for Continuous-Space
  Stochastic Games
Decentralized Multi-Agent Reinforcement Learning for Continuous-Space Stochastic Games
Awni Altabaa
Bora Yongacoglu
S. Yüksel
28
3
0
16 Mar 2023
Mastering Strategy Card Game (Legends of Code and Magic) via End-to-End
  Policy and Optimistic Smooth Fictitious Play
Mastering Strategy Card Game (Legends of Code and Magic) via End-to-End Policy and Optimistic Smooth Fictitious Play
Wei Xi
Yongxin Zhang
Changnan Xiao
Xuefeng Huang
Shihong Deng
Haowei Liang
Jie Chen
Peng Sun
OffRL
42
8
0
07 Mar 2023
Distributed Learning Meets 6G: A Communication and Computing Perspective
Distributed Learning Meets 6G: A Communication and Computing Perspective
Shashank Jere
Yifei Song
Yuhao Yi
Lingjia Liu
23
10
0
02 Mar 2023
Graph Attention Multi-Agent Fleet Autonomy for Advanced Air Mobility
Graph Attention Multi-Agent Fleet Autonomy for Advanced Air Mobility
Malintha Fernando
Ransalu Senanayake
Heeyoul Choi
Martin Swany
37
4
0
14 Feb 2023
A Theory of Mind Approach as Test-Time Mitigation Against Emergent
  Adversarial Communication
A Theory of Mind Approach as Test-Time Mitigation Against Emergent Adversarial Communication
Nancirose Piazza
Vahid Behzadan
AAML
27
7
0
14 Feb 2023
Universal Agent Mixtures and the Geometry of Intelligence
Universal Agent Mixtures and the Geometry of Intelligence
S. Alexander
David Quarel
Len Du
Marcus Hutter
18
1
0
13 Feb 2023
Graph Learning Based Decision Support for Multi-Aircraft Take-Off and
  Landing at Urban Air Mobility Vertiports
Graph Learning Based Decision Support for Multi-Aircraft Take-Off and Landing at Urban Air Mobility Vertiports
Prajit K. Kumar
Jhoel Witter
Steve Paul
Karthik Dantu
Souma Chowdhury
14
3
0
12 Feb 2023
Efficient Planning in Combinatorial Action Spaces with Applications to
  Cooperative Multi-Agent Reinforcement Learning
Efficient Planning in Combinatorial Action Spaces with Applications to Cooperative Multi-Agent Reinforcement Learning
Volodymyr Tkachuk
Seyed Alireza Bakhtiari
Johannes Kirschner
Matej Jusup
Ilija Bogunovic
Csaba Szepesvári
24
4
0
08 Feb 2023
Generalization of Deep Reinforcement Learning for Jammer-Resilient
  Frequency and Power Allocation
Generalization of Deep Reinforcement Learning for Jammer-Resilient Frequency and Power Allocation
Swatantra Kafle
Jithin Jagannath
Zackary Kane
Noor Biswas
P. Kumar
Anu Jagannath
23
1
0
04 Feb 2023
Deep Reinforcement Learning for Traffic Light Control in Intelligent Transportation Systems
Deep Reinforcement Learning for Traffic Light Control in Intelligent Transportation Systems
Xiao-Yang Liu
Ming Zhu
S. Borst
A. Elwalid
38
8
0
04 Feb 2023
A Reduction-based Framework for Sequential Decision Making with Delayed
  Feedback
A Reduction-based Framework for Sequential Decision Making with Delayed Feedback
Yunchang Yang
Hangshi Zhong
Tianhao Wu
B. Liu
Liwei Wang
S. Du
OffRL
27
8
0
03 Feb 2023
Best Possible Q-Learning
Best Possible Q-Learning
Jiechuan Jiang
Zongqing Lu
OffRL
20
5
0
02 Feb 2023
A Deep Reinforcement Learning Framework for Optimizing Congestion
  Control in Data Centers
A Deep Reinforcement Learning Framework for Optimizing Congestion Control in Data Centers
Shiva Ketabi
Hongkai Chen
Haiwei Dong
Y. Ganjali
16
0
0
29 Jan 2023
Online Learning in Stackelberg Games with an Omniscient Follower
Online Learning in Stackelberg Games with an Omniscient Follower
Geng Zhao
Banghua Zhu
Jiantao Jiao
Michael I. Jordan
35
14
0
27 Jan 2023
Effect of Swarm Density on Collective Tracking Performance
Effect of Swarm Density on Collective Tracking Performance
H. L. Kwa
J. Philippot
Roland Bouffanais
14
9
0
25 Jan 2023
Heterogeneous Multi-Robot Reinforcement Learning
Heterogeneous Multi-Robot Reinforcement Learning
Matteo Bettini
Ajay Shankar
Amanda Prorok
22
40
0
17 Jan 2023
Approximate Information States for Worst-Case Control and Learning in
  Uncertain Systems
Approximate Information States for Worst-Case Control and Learning in Uncertain Systems
Aditya Dave
N. Venkatesh
Andreas A. Malikopoulos
29
7
0
12 Jan 2023
Asynchronous Multi-Agent Reinforcement Learning for Efficient Real-Time
  Multi-Robot Cooperative Exploration
Asynchronous Multi-Agent Reinforcement Learning for Efficient Real-Time Multi-Robot Cooperative Exploration
Chao Yu
Xinyi Yang
Jiaxuan Gao
Jiayu Chen
Yunfei Li
...
Yunfei Xiang
Rui Huang
Huazhong Yang
Yi Wu
Yu Wang
33
35
0
09 Jan 2023
Asynchronous Hybrid Reinforcement Learning for Latency and Reliability
  Optimization in the Metaverse over Wireless Communications
Asynchronous Hybrid Reinforcement Learning for Latency and Reliability Optimization in the Metaverse over Wireless Communications
Wen-li Yu
Terence Jie Chua
Jun Zhao
OffRL
16
20
0
30 Dec 2022
Enhancing Cyber Resilience of Networked Microgrids using Vertical
  Federated Reinforcement Learning
Enhancing Cyber Resilience of Networked Microgrids using Vertical Federated Reinforcement Learning
Sayak Mukherjee
Ramij-Raja Hossain
Yuan Liu
W. Du
Veronica Adetola
Sheik M. Mohiuddin
Qiuhua Huang
Tianzhixi Yin
Ankit Singhal
23
4
0
17 Dec 2022
Multi-Agent Dynamic Pricing in a Blockchain Protocol Using Gaussian
  Bandits
Multi-Agent Dynamic Pricing in a Blockchain Protocol Using Gaussian Bandits
Alexis Asseman
Tomasz Kornuta
Aniruth Patel
Matt Deible
Sam Green
11
0
0
13 Dec 2022
DACOM: Learning Delay-Aware Communication for Multi-Agent Reinforcement
  Learning
DACOM: Learning Delay-Aware Communication for Multi-Agent Reinforcement Learning
Tingting Yuan
Hwei-Ming Chung
Jie Yuan
Xiaoming Fu
24
13
0
03 Dec 2022
A Hierarchical Approach for Strategic Motion Planning in Autonomous
  Racing
A Hierarchical Approach for Strategic Motion Planning in Autonomous Racing
Rudolf Reiter
Jasper Hoffmann
Joschka Boedecker
Moritz Diehl
24
13
0
03 Dec 2022
Global Convergence of Localized Policy Iteration in Networked
  Multi-Agent Reinforcement Learning
Global Convergence of Localized Policy Iteration in Networked Multi-Agent Reinforcement Learning
Yizhou Zhang
Guannan Qu
Pan Xu
Yiheng Lin
Zaiwei Chen
Adam Wierman
34
25
0
30 Nov 2022
A survey on multi-player bandits
A survey on multi-player bandits
Etienne Boursier
Vianney Perchet
24
12
0
29 Nov 2022
CLAS: Coordinating Multi-Robot Manipulation with Central Latent Action
  Spaces
CLAS: Coordinating Multi-Robot Manipulation with Central Latent Action Spaces
Elie Aljalbout
Maximilian Karl
Patrick van der Smagt
23
5
0
28 Nov 2022
A Reinforcement Learning Approach for Process Parameter Optimization in
  Additive Manufacturing
A Reinforcement Learning Approach for Process Parameter Optimization in Additive Manufacturing
Susheel Dharmadhikari
Nandana Menon
A. Basak
OffRL
AI4CE
16
27
0
17 Nov 2022
Parallel Automatic History Matching Algorithm Using Reinforcement
  Learning
Parallel Automatic History Matching Algorithm Using Reinforcement Learning
Omar S. Alolayan
Abdullah O. Alomar
John R. Williams
23
6
0
14 Nov 2022
Decentralized Policy Optimization
Decentralized Policy Optimization
Kefan Su
Zongqing Lu
11
8
0
06 Nov 2022
A Survey on Reinforcement Learning in Aviation Applications
A Survey on Reinforcement Learning in Aviation Applications
Pouria Razzaghi
Amin Tabrizian
Wei Guo
Shulu Chen
Abenezer Taye
Ellis E. Thompson
Alexis Bregeon
Ali Baheri
Peng Wei
OffRL
23
52
0
03 Nov 2022
SOCIALMAPF: Optimal and Efficient Multi-Agent Path Finding with
  Strategic Agents for Social Navigation
SOCIALMAPF: Optimal and Efficient Multi-Agent Path Finding with Strategic Agents for Social Navigation
Rohan Chandra
Rahul Maligi
Arya Anantula
Joydeep Biswas
31
20
0
15 Oct 2022
Multi-agent Dynamic Algorithm Configuration
Multi-agent Dynamic Algorithm Configuration
Ke Xue
Jiacheng Xu
Lei Yuan
M. Li
Chao Qian
Zongzhang Zhang
Yang Yu
37
29
0
13 Oct 2022
Spatial-Temporal-Aware Safe Multi-Agent Reinforcement Learning of
  Connected Autonomous Vehicles in Challenging Scenarios
Spatial-Temporal-Aware Safe Multi-Agent Reinforcement Learning of Connected Autonomous Vehicles in Challenging Scenarios
Zhili Zhang
Songyang Han
Jiangwei Wang
Fei Miao
35
19
0
05 Oct 2022
A Self-Play Posterior Sampling Algorithm for Zero-Sum Markov Games
A Self-Play Posterior Sampling Algorithm for Zero-Sum Markov Games
Wei Xiong
Han Zhong
Chengshuai Shi
Cong Shen
Tong Zhang
66
18
0
04 Oct 2022
$O(T^{-1})$ Convergence of Optimistic-Follow-the-Regularized-Leader in
  Two-Player Zero-Sum Markov Games
O(T−1)O(T^{-1})O(T−1) Convergence of Optimistic-Follow-the-Regularized-Leader in Two-Player Zero-Sum Markov Games
Yuepeng Yang
Cong Ma
37
14
0
26 Sep 2022
MA2QL: A Minimalist Approach to Fully Decentralized Multi-Agent
  Reinforcement Learning
MA2QL: A Minimalist Approach to Fully Decentralized Multi-Agent Reinforcement Learning
Kefan Su
Siyuan Zhou
Jiechuan Jiang
Chuang Gan
Xiangjun Wang
Zongqing Lu
OffRL
33
6
0
17 Sep 2022
Scalable Task-Driven Robotic Swarm Control via Collision Avoidance and
  Learning Mean-Field Control
Scalable Task-Driven Robotic Swarm Control via Collision Avoidance and Learning Mean-Field Control
Kai Cui
Mengguang Li
Christian Fabian
Heinz Koeppl
AI4CE
37
5
0
15 Sep 2022
A New Approach to Training Multiple Cooperative Agents for Autonomous
  Driving
A New Approach to Training Multiple Cooperative Agents for Autonomous Driving
Ruiyang Yang
Siheng Li
Beihong Jin
21
0
0
05 Sep 2022
Minimax-Optimal Multi-Agent RL in Markov Games With a Generative Model
Minimax-Optimal Multi-Agent RL in Markov Games With a Generative Model
Gen Li
Yuejie Chi
Yuting Wei
Yuxin Chen
32
18
0
22 Aug 2022
Efficiently Computing Nash Equilibria in Adversarial Team Markov Games
Efficiently Computing Nash Equilibria in Adversarial Team Markov Games
Fivos Kalogiannis
Ioannis Anagnostides
Ioannis Panageas
Emmanouil-Vasileios Vlatakis-Gkaragkounis
Vaggos Chatziafratis
S. Stavroulakis
33
13
0
03 Aug 2022
Regret Minimization and Convergence to Equilibria in General-sum Markov
  Games
Regret Minimization and Convergence to Equilibria in General-sum Markov Games
Liad Erez
Tal Lancewicki
Uri Sherman
Tomer Koren
Yishay Mansour
40
25
0
28 Jul 2022
Cooperative Actor-Critic via TD Error Aggregation
Cooperative Actor-Critic via TD Error Aggregation
Martin Figura
Yixuan Lin
Ji Liu
V. Gupta
23
1
0
25 Jul 2022
Provably Efficient Fictitious Play Policy Optimization for Zero-Sum
  Markov Games with Structured Transitions
Provably Efficient Fictitious Play Policy Optimization for Zero-Sum Markov Games with Structured Transitions
Shuang Qiu
Xiaohan Wei
Jieping Ye
Zhaoran Wang
Zhuoran Yang
OffRL
27
11
0
25 Jul 2022
Scalable Model-based Policy Optimization for Decentralized Networked
  Systems
Scalable Model-based Policy Optimization for Decentralized Networked Systems
Yali Du
Chengdong Ma
Yuchen Liu
Runji Lin
Hao Dong
Jun Wang
Yaodong Yang
31
8
0
13 Jul 2022
Learning-based Autonomous Channel Access in the Presence of Hidden
  Terminals
Learning-based Autonomous Channel Access in the Presence of Hidden Terminals
Yulin Shao
Yucheng Cai
Taotao Wang
Ziyang Guo
Peng Liu
Jiajun Luo
Deniz Gunduz
23
7
0
07 Jul 2022
Hierarchical Dynamic Routing in Complex Networks via
  Topologically-decoupled and Cooperative Reinforcement Learning Agents
Hierarchical Dynamic Routing in Complex Networks via Topologically-decoupled and Cooperative Reinforcement Learning Agents
Shiyuan Hu
Shihan Xiao
17
1
0
02 Jul 2022
Previous
1234
Next