ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1706.02275
  4. Cited By
Multi-Agent Actor-Critic for Mixed Cooperative-Competitive Environments
v1v2v3v4 (latest)

Multi-Agent Actor-Critic for Mixed Cooperative-Competitive Environments

7 June 2017
Ryan J. Lowe
Yi Wu
Aviv Tamar
J. Harb
Pieter Abbeel
Igor Mordatch
ArXiv (abs)PDFHTML

Papers citing "Multi-Agent Actor-Critic for Mixed Cooperative-Competitive Environments"

50 / 1,627 papers shown
Title
Collective Conditioned Reflex: A Bio-Inspired Fast Emergency Reaction
  Mechanism for Designing Safe Multi-Robot Systems
Collective Conditioned Reflex: A Bio-Inspired Fast Emergency Reaction Mechanism for Designing Safe Multi-Robot Systems
Bowei He
Zhen Zhao
Wenhao Luo
R. Liu
94
4
0
24 Feb 2022
Comparative analysis of machine learning methods for active flow control
Comparative analysis of machine learning methods for active flow control
F. Pino
Lorenzo Schena
Jean Rabault
M. A. Mendez
107
44
0
23 Feb 2022
A Decentralized Communication Framework based on Dual-Level Recurrence
  for Multi-Agent Reinforcement Learning
A Decentralized Communication Framework based on Dual-Level Recurrence for Multi-Agent Reinforcement Learning
Jingchen Li
Haobin Shi
Kao-Shing Hwang
34
3
0
22 Feb 2022
Multi-Agent Reinforcement Learning for Network Selection and Resource
  Allocation in Heterogeneous multi-RAT Networks
Multi-Agent Reinforcement Learning for Network Selection and Resource Allocation in Heterogeneous multi-RAT Networks
Mhd Saria Allahham
A. Abdellatif
N. Mhaisen
Amr M. Mohamed
A. Erbad
Mohsen Guizani
81
32
0
21 Feb 2022
Cooperative Artificial Intelligence
Cooperative Artificial Intelligence
T. Baumann
34
0
0
20 Feb 2022
PooL: Pheromone-inspired Communication Framework forLarge Scale
  Multi-Agent Reinforcement Learning
PooL: Pheromone-inspired Communication Framework forLarge Scale Multi-Agent Reinforcement Learning
Zixuan Cao
Mengzhi Shi
Zhanbo Zhao
Xiujun Ma
58
1
0
20 Feb 2022
Shaping Advice in Deep Reinforcement Learning
Shaping Advice in Deep Reinforcement Learning
Baicen Xiao
Bhaskar Ramasubramanian
Radha Poovendran
46
0
0
19 Feb 2022
Communication-Efficient Actor-Critic Methods for Homogeneous Markov
  Games
Communication-Efficient Actor-Critic Methods for Homogeneous Markov Games
Dingyang Chen
Yile Li
Qi Zhang
OffRL
125
11
0
18 Feb 2022
Disentangling Successor Features for Coordination in Multi-agent
  Reinforcement Learning
Disentangling Successor Features for Coordination in Multi-agent Reinforcement Learning
Seungchan Kim
Neale Van Stralen
Girish Chowdhary
Huy T. Tran
41
0
0
15 Feb 2022
Learning to Mitigate AI Collusion on Economic Platforms
Learning to Mitigate AI Collusion on Economic Platforms
Gianluca Brero
N. Lepore
Eric Mibuari
David C. Parkes
73
15
0
15 Feb 2022
Motivating Physical Activity via Competitive Human-Robot Interaction
Motivating Physical Activity via Competitive Human-Robot Interaction
Boling Yang
Golnaz Habibi
Patrick E. Lancaster
Byron Boots
Joshua R. Smith
62
8
0
14 Feb 2022
Individual-Level Inverse Reinforcement Learning for Mean Field Games
Individual-Level Inverse Reinforcement Learning for Mean Field Games
Yang Chen
Libo Zhang
Jiamou Liu
Shuyue Hu
AI4CE
82
9
0
13 Feb 2022
Strategy Synthesis for Zero-Sum Neuro-Symbolic Concurrent Stochastic
  Games
Strategy Synthesis for Zero-Sum Neuro-Symbolic Concurrent Stochastic Games
Rui Yan
G. Santos
G. Norman
David Parker
Marta Kwiatkowska
54
9
0
13 Feb 2022
Group-Agent Reinforcement Learning
Group-Agent Reinforcement Learning
Kaiyue Wu
Xiaoming Zeng
OODOffRL
41
3
0
10 Feb 2022
Revisiting QMIX: Discriminative Credit Assignment by Gradient Entropy
  Regularization
Revisiting QMIX: Discriminative Credit Assignment by Gradient Entropy Regularization
Jian Zhao
Yue Zhang
Xu Hu
Weixun Wang
Wen-gang Zhou
Jianye Hao
Jiangcheng Zhu
Houqiang Li
60
4
0
09 Feb 2022
Independent Policy Gradient for Large-Scale Markov Potential Games:
  Sharper Rates, Function Approximation, and Game-Agnostic Convergence
Independent Policy Gradient for Large-Scale Markov Potential Games: Sharper Rates, Function Approximation, and Game-Agnostic Convergence
Dongsheng Ding
Chen-Yu Wei
Jianchao Tan
M. Jovanović
88
69
0
08 Feb 2022
Attacking c-MARL More Effectively: A Data Driven Approach
Attacking c-MARL More Effectively: A Data Driven Approach
Nhan H. Pham
Lam M. Nguyen
Jie Chen
Hoang Thanh Lam
Subhro Das
Tsui-Wei Weng
AAML
99
2
0
07 Feb 2022
A Survey on Safety-Critical Driving Scenario Generation -- A
  Methodological Perspective
A Survey on Safety-Critical Driving Scenario Generation -- A Methodological Perspective
Wenhao Ding
Chejian Xu
Mansur Arief
Hao-ming Lin
Yue Liu
Ding Zhao
119
165
0
04 Feb 2022
Adaptive Discrete Communication Bottlenecks with Dynamic Vector
  Quantization
Adaptive Discrete Communication Bottlenecks with Dynamic Vector Quantization
Dianbo Liu
Alex Lamb
Xu Ji
Pascal Junior Tikeng Notsawo
Michael C. Mozer
Yoshua Bengio
Kenji Kawaguchi
63
16
0
02 Feb 2022
Federated Reinforcement Learning for Collective Navigation of Robotic
  Swarms
Federated Reinforcement Learning for Collective Navigation of Robotic Swarms
Seongin Na
Tomáš Rouček
Jiří Ulrich
Jan Pikman
T. Krajník
Barry Lennox
F. Arvin
74
34
0
02 Feb 2022
Robustness and Adaptability of Reinforcement Learning based Cooperative
  Autonomous Driving in Mixed-autonomy Traffic
Robustness and Adaptability of Reinforcement Learning based Cooperative Autonomous Driving in Mixed-autonomy Traffic
Rodolfo Valiente
Behrad Toghi
Ramtin Pedarsani
Y. P. Fallah
129
58
0
02 Feb 2022
Trust Region Bounds for Decentralized PPO Under Non-stationarity
Trust Region Bounds for Decentralized PPO Under Non-stationarity
Mingfei Sun
Sam Devlin
Jacob Beck
Katja Hofmann
Shimon Whiteson
140
11
0
31 Jan 2022
Learning Collective Action under Risk Diversity
Learning Collective Action under Risk Diversity
Ramona Merhej
Fernando P. Santos
Francisco S. Melo
Mohamed Chetouani
Francisco C. Santos
32
1
0
30 Jan 2022
Any-Play: An Intrinsic Augmentation for Zero-Shot Coordination
Any-Play: An Intrinsic Augmentation for Zero-Shot Coordination
Keane Lucas
R. Allen
103
26
0
28 Jan 2022
Overcoming Exploration: Deep Reinforcement Learning for Continuous
  Control in Cluttered Environments from Temporal Logic Specifications
Overcoming Exploration: Deep Reinforcement Learning for Continuous Control in Cluttered Environments from Temporal Logic Specifications
Mingyu Cai
Erfan Aasi
C. Belta
C. Vasile
105
25
0
28 Jan 2022
Competition over data: how does data purchase affect users?
Competition over data: how does data purchase affect users?
Yongchan Kwon
Antonio A. Ginart
James Zou
55
5
0
26 Jan 2022
Multi-Agent Adversarial Attacks for Multi-Channel Communications
Multi-Agent Adversarial Attacks for Multi-Channel Communications
Juncheng Dong
Suya Wu
Mohammadreza Soltani
Vahid Tarokh
AAML
70
3
0
22 Jan 2022
Reinforcement Learning Your Way: Agent Characterization through Policy
  Regularization
Reinforcement Learning Your Way: Agent Characterization through Policy Regularization
Charl Maree
C. Omlin
59
8
0
21 Jan 2022
Interpretable Learned Emergent Communication for Human-Agent Teams
Interpretable Learned Emergent Communication for Human-Agent Teams
Seth Karten
Mycal Tucker
Huao Li
Siva Kailas
Michael Lewis
Katia Sycara
AI4CE
80
10
0
19 Jan 2022
K-nearest Multi-agent Deep Reinforcement Learning for Collaborative
  Tasks with a Variable Number of Agents
K-nearest Multi-agent Deep Reinforcement Learning for Collaborative Tasks with a Variable Number of Agents
H. Khorasgani
Haiyan Wang
Hsiu-Khuern Tang
Chetan Gupta
40
3
0
18 Jan 2022
GCS: Graph-based Coordination Strategy for Multi-Agent Reinforcement
  Learning
GCS: Graph-based Coordination Strategy for Multi-Agent Reinforcement Learning
Jingqing Ruan
Yali Du
Xuantang Xiong
Dengpeng Xing
Xiyun Li
Linghui Meng
Haifeng Zhang
Jun Wang
Bo Xu
89
30
0
17 Jan 2022
Agent-Temporal Attention for Reward Redistribution in Episodic
  Multi-Agent Reinforcement Learning
Agent-Temporal Attention for Reward Redistribution in Episodic Multi-Agent Reinforcement Learning
Baicen Xiao
Bhaskar Ramasubramanian
Radha Poovendran
106
6
0
12 Jan 2022
Offsetting Unequal Competition through RL-assisted Incentive Schemes
Offsetting Unequal Competition through RL-assisted Incentive Schemes
Paramita Koley
Aurghya Maiti
Sourangshu Bhattacharya
Niloy Ganguly
64
1
0
05 Jan 2022
Conditional Imitation Learning for Multi-Agent Games
Conditional Imitation Learning for Multi-Agent Games
Andy Shih
Stefano Ermon
Dorsa Sadigh
88
11
0
05 Jan 2022
Value Functions Factorization with Latent State Information Sharing in
  Decentralized Multi-Agent Policy Gradients
Value Functions Factorization with Latent State Information Sharing in Decentralized Multi-Agent Policy Gradients
Hanhan Zhou
Tian-Shing Lan
Vaneet Aggarwal
108
32
0
04 Jan 2022
Analyzing Micro-Founded General Equilibrium Models with Many Agents
  using Deep Reinforcement Learning
Analyzing Micro-Founded General Equilibrium Models with Many Agents using Deep Reinforcement Learning
Michael J. Curry
Alexander R. Trott
Soham R. Phade
Yunru Bai
Stephan Zheng
90
5
0
03 Jan 2022
A Deeper Understanding of State-Based Critics in Multi-Agent
  Reinforcement Learning
A Deeper Understanding of State-Based Critics in Multi-Agent Reinforcement Learning
Xueguang Lyu
Andrea Baisero
Yuchen Xiao
Chris Amato
OffRL
85
16
0
03 Jan 2022
3DPG: Distributed Deep Deterministic Policy Gradient Algorithms for
  Networked Multi-Agent Systems
3DPG: Distributed Deep Deterministic Policy Gradient Algorithms for Networked Multi-Agent Systems
Adrian Redder
Arunselvan Ramaswamy
Holger Karl
OffRL
43
2
0
03 Jan 2022
The Introspective Agent: Interdependence of Strategy, Physiology, and
  Sensing for Embodied Agents
The Introspective Agent: Interdependence of Strategy, Physiology, and Sensing for Embodied Agents
Sarah M Pratt
Luca Weihs
Ali Farhadi
53
2
0
02 Jan 2022
Multi-Agent Reinforcement Learning via Adaptive Kalman Temporal
  Difference and Successor Representation
Multi-Agent Reinforcement Learning via Adaptive Kalman Temporal Difference and Successor Representation
Mohammad Salimibeni
Arash Mohammadi
Parvin Malekzadeh
Konstantinos N. Plataniotis
55
5
0
30 Dec 2021
Multiagent Model-based Credit Assignment for Continuous Control
Multiagent Model-based Credit Assignment for Continuous Control
Dongge Han
Chris Xiaoxuan Lu
Tomasz P. Michalak
Michael Wooldridge
63
6
0
27 Dec 2021
Learning Cooperative Multi-Agent Policies with Partial Reward Decoupling
Learning Cooperative Multi-Agent Policies with Partial Reward Decoupling
B. Freed
Aditya Kapoor
Ian Abraham
J. Schneider
Howie Choset
78
5
0
23 Dec 2021
Local Advantage Networks for Cooperative Multi-Agent Reinforcement
  Learning
Local Advantage Networks for Cooperative Multi-Agent Reinforcement Learning
Raphael Avalos
Mathieu Reymond
Ann Nowé
D. Roijers
OffRL
67
6
0
23 Dec 2021
Maximum Entropy Population-Based Training for Zero-Shot Human-AI
  Coordination
Maximum Entropy Population-Based Training for Zero-Shot Human-AI Coordination
Rui Zhao
Jinming Song
Yufeng Yuan
Haifeng Hu
Yang Gao
Yi Wu
Zhongqian Sun
Yang Wei
95
69
0
22 Dec 2021
Centralizing State-Values in Dueling Networks for Multi-Robot
  Reinforcement Learning Mapless Navigation
Centralizing State-Values in Dueling Networks for Multi-Robot Reinforcement Learning Mapless Navigation
Enrico Marchesini
Alessandro Farinelli
56
18
0
16 Dec 2021
Learning to Share in Multi-Agent Reinforcement Learning
Learning to Share in Multi-Agent Reinforcement Learning
Yuxuan Yi
G. Li
Yaowei Wang
Zongqing Lu
88
13
0
16 Dec 2021
Learning to Guide and to Be Guided in the Architect-Builder Problem
Learning to Guide and to Be Guided in the Architect-Builder Problem
Paul Barde
Tristan Karch
Derek Nowrouzezahrai
Clément Moulin-Frier
C. Pal
Pierre-Yves Oudeyer
68
5
0
14 Dec 2021
Meta-CPR: Generalize to Unseen Large Number of Agents with Communication
  Pattern Recognition Module
Meta-CPR: Generalize to Unseen Large Number of Agents with Communication Pattern Recognition Module
Wei-Cheng Tseng
Wei Wei
Da-Cheng Juan
Min Sun
92
2
0
14 Dec 2021
Multi-agent Soft Actor-Critic Based Hybrid Motion Planner for Mobile
  Robots
Multi-agent Soft Actor-Critic Based Hybrid Motion Planner for Mobile Robots
Zicheng He
Lu Dong
Chunwei Song
Changyin Sun
74
28
0
13 Dec 2021
Federated Reinforcement Learning at the Edge
Federated Reinforcement Learning at the Edge
Konstantinos Gatsis
FedML
68
5
0
11 Dec 2021
Previous
123...181920...313233
Next