Learning Zero-Sum Simultaneous-Move Markov Games Using Function Approximation and Correlated Equilibrium

17 February 2020

Papers citing "Learning Zero-Sum Simultaneous-Move Markov Games Using Function Approximation and Correlated Equilibrium"

50 / 92 papers shown

Title
The Lagrangian Method for Solving Constrained Markov Games Soham Das Santiago Paternain Luiz F. O. Chamon Ceyhun Eksin 47 0 0 13 Mar 2025
Incentivize without Bonus: Provably Efficient Model-based Online Multi-agent RL for Markov Games Tong Yang Bo Dai Lin Xiao Yuejie Chi OffRL 61 2 0 13 Feb 2025
Model Selection for Average Reward RL with Application to Utility Maximization in Repeated Games Alireza Masoumian James R. Wright 57 1 0 09 Nov 2024
Learning in Markov Games with Adaptive Adversaries: Policy Regret, Fundamental Barriers, and Efficient Algorithms Thanh Nguyen-Tang Raman Arora 74 1 0 01 Nov 2024
The Bandit Whisperer: Communication Learning for Restless Bandits Yunfan Zhao Tonghan Wang Dheeraj M. Nagaraj Aparna Taneja Milind Tambe 49 5 0 11 Aug 2024
Locally Interdependent Multi-Agent MDP: Theoretical Framework for Decentralized Agents with Dynamic Dependencies Alex DeWeese Guannan Qu 32 2 0 10 Jun 2024
Taming Equilibrium Bias in Risk-Sensitive Multi-Agent Reinforcement Learning Yingjie Fei Ruitu Xu 33 0 0 04 May 2024
Provably Efficient Information-Directed Sampling Algorithms for Multi-Agent Reinforcement Learning Qiaosheng Zhang Chenjia Bai Shuyue Hu Zhen Wang Xuelong Li 39 1 0 30 Apr 2024
Differentially Private Reinforcement Learning with Self-Play Dan Qiao Yu-Xiang Wang 36 0 0 11 Apr 2024
RL in Markov Games with Independent Function Approximation: Improved Sample Complexity Bound under the Local Access Model Junyi Fan Yuxuan Han Jialin Zeng Jian-Feng Cai Yang Wang Yang Xiang Jiheng Zhang 32 1 0 18 Mar 2024
Corruption-Robust Offline Two-Player Zero-Sum Markov Games Andi Nika Debmalya Mandal Adish Singla Goran Radanović OffRL 34 2 0 04 Mar 2024
Refined Sample Complexity for Markov Games with Independent Linear Function Approximation Yan Dai Qiwen Cui S. S. Du 44 1 0 11 Feb 2024
Near-Optimal Reinforcement Learning with Self-Play under Adaptivity Constraints Dan Qiao Yu-Xiang Wang OffRL 24 3 0 02 Feb 2024
Towards a Pretrained Model for Restless Bandits via Multi-arm Generalization Yunfan Zhao Nikhil Behari Edward Hughes Edwin Zhang Dheeraj M. Nagaraj K. Tuyls Aparna Taneja Milind Tambe 21 7 0 23 Oct 2023
Sample-Efficient Multi-Agent RL: An Optimization Perspective Nuoya Xiong Zhihan Liu Zhaoran Wang Zhuoran Yang 36 1 0 10 Oct 2023
Local and adaptive mirror descents in extensive-form games Côme Fiegel Pierre Ménard Tadashi Kozuno Rémi Munos Vianney Perchet Michal Valko 19 2 0 01 Sep 2023
Improving Sample Efficiency of Model-Free Algorithms for Zero-Sum Markov Games Songtao Feng Ming Yin Yu-Xiang Wang J. Yang Yitao Liang 34 0 0 17 Aug 2023
Efficient Adversarial Attacks on Online Multi-agent Reinforcement Learning Guanlin Liu Lifeng Lai AAML 38 6 0 15 Jul 2023
Multi-Player Zero-Sum Markov Games with Networked Separable Interactions Chanwoo Park Kaipeng Zhang Asuman Ozdaglar 30 8 0 13 Jul 2023
Theoretical Hardness and Tractability of POMDPs in RL with Partial Online State Information Ming Shi Yingbin Liang Ness B. Shroff 29 2 0 14 Jun 2023
Provably Efficient Generalized Lagrangian Policy Optimization for Safe Multi-Agent Reinforcement Learning Dongsheng Ding Xiaohan Wei Zhuoran Yang Zhaoran Wang Mihailo R. Jovanović OffRL 32 11 0 31 May 2023
Maximize to Explore: One Objective Function Fusing Estimation, Planning, and Exploration Zhihan Liu Miao Lu Wei Xiong Han Zhong Haotian Hu Shenao Zhang Sirui Zheng Zhuoran Yang Zhaoran Wang OffRL 32 22 0 29 May 2023
On the Statistical Efficiency of Mean Field Reinforcement Learning with General Function Approximation Jiawei Huang Batuhan Yardim Niao He 39 10 0 18 May 2023
A New Policy Iteration Algorithm For Reinforcement Learning in Zero-Sum Markov Games Anna Winnicki R. Srikant 34 1 0 17 Mar 2023
Learning Strategic Value and Cooperation in Multi-Player Stochastic Games through Side Payments Alan Kuhnle J. Richley Darleen Perez-Lavin 31 1 0 09 Mar 2023
Uncoupled and Convergent Learning in Two-Player Zero-Sum Markov Games with Bandit Feedback Yang Cai Haipeng Luo Chen-Yu Wei Weiqiang Zheng 26 17 0 05 Mar 2023
A Finite-Sample Analysis of Payoff-Based Independent Learning in Zero-Sum Stochastic Games Zaiwei Chen Kaipeng Zhang Eric Mazumdar Asuman Ozdaglar Adam Wierman 45 6 0 03 Mar 2023
Can We Find Nash Equilibria at a Linear Rate in Markov Games? Zhuoqing Song Jason D. Lee Zhuoran Yang 29 8 0 03 Mar 2023
Breaking the Curse of Multiagency: Provably Efficient Decentralized Multi-Agent RL with Function Approximation Yuanhao Wang Qinghua Liu Yunru Bai Chi Jin 24 28 0 13 Feb 2023
Breaking the Curse of Multiagents in a Large State Space: RL in Markov Games with Independent Linear Function Approximation Qiwen Cui Kaipeng Zhang S. Du 28 23 0 07 Feb 2023
Population-size-Aware Policy Optimization for Mean-Field Games Pengdeng Li Xinrun Wang Shuxin Li Hau Chan Bo An 21 2 0 07 Feb 2023
Offline Learning in Markov Games with General Function Approximation Yuheng Zhang Yunru Bai Nan Jiang OffRL 18 8 0 06 Feb 2023
A Reduction-based Framework for Sequential Decision Making with Delayed Feedback Yunchang Yang Hangshi Zhong Tianhao Wu B. Liu Liwei Wang S. Du OffRL 27 8 0 03 Feb 2023
Decentralized model-free reinforcement learning in stochastic games with average-reward objective Romain Cravic Nicolas Gast B. Gaujal 29 2 0 13 Jan 2023
Adapting to game trees in zero-sum imperfect information games Côme Fiegel Pierre Ménard Tadashi Kozuno Rémi Munos Vianney Perchet Michal Valko 24 9 0 23 Dec 2022
Provably Efficient Model-free RL in Leader-Follower MDP with Linear Function Approximation A. Ghosh 17 1 0 28 Nov 2022
On the convergence of policy gradient methods to Nash equilibria in general stochastic games Angeliki Giannou Kyriakos Lotidis P. Mertikopoulos Emmanouil-Vasileios Vlatakis-Gkaragkounis 29 17 0 17 Oct 2022
A Self-Play Posterior Sampling Algorithm for Zero-Sum Markov Games Wei Xiong Han Zhong Chengshuai Shi Cong Shen Tong Zhang 63 18 0 04 Oct 2022
Faster Last-iterate Convergence of Policy Optimization in Zero-Sum Markov Games Shicong Cen Yuejie Chi S. Du Lin Xiao 51 35 0 03 Oct 2022
$$O(T^{-1})$ Convergence of Optimistic-Follow-the-Regularized-Leader in Two-Player Zero-Sum Markov Games$ $O(T^{-1})$ Convergence of Optimistic-Follow-the-Regularized-Leader in Two-Player Zero-Sum Markov Games Yuepeng Yang Cong Ma 37 14 0 26 Sep 2022
Strategic Decision-Making in the Presence of Information Asymmetry: Provably Efficient RL with Algorithmic Instruments Mengxin Yu Zhuoran Yang Jianqing Fan OffRL 15 8 0 23 Aug 2022
Minimax-Optimal Multi-Agent RL in Markov Games With a Generative Model Gen Li Yuejie Chi Yuting Wei Yuxin Chen 32 18 0 22 Aug 2022
Learning Two-Player Mixture Markov Games: Kernel Function Approximation and Correlated Equilibrium C. J. Li Dongruo Zhou Quanquan Gu Michael I. Jordan 21 2 0 10 Aug 2022
Contrastive UCB: Provably Efficient Contrastive Self-Supervised Learning in Online Reinforcement Learning Shuang Qiu Lingxiao Wang Chenjia Bai Zhuoran Yang Zhaoran Wang SSL OffRL 26 32 0 29 Jul 2022
Regret Minimization and Convergence to Equilibria in General-sum Markov Games Liad Erez Tal Lancewicki Uri Sherman Tomer Koren Yishay Mansour 40 25 0 28 Jul 2022
Provably Efficient Fictitious Play Policy Optimization for Zero-Sum Markov Games with Structured Transitions Shuang Qiu Xiaohan Wei Jieping Ye Zhaoran Wang Zhuoran Yang OffRL 27 11 0 25 Jul 2022
A Deep Reinforcement Learning Approach for Finding Non-Exploitable Strategies in Two-Player Atari Games Zihan Ding DiJia Su Qinghua Liu Chi Jin 33 3 0 18 Jul 2022
Self-Play PSRO: Toward Optimal Populations in Two-Player Zero-Sum Games Stephen Marcus McAleer JB Lanier Kevin A. Wang Pierre Baldi Roy Fox T. Sandholm 27 18 0 13 Jul 2022
Provably Efficient Model-Free Constrained RL with Linear Function Approximation A. Ghosh Xingyu Zhou Ness B. Shroff 64 23 0 23 Jun 2022
ESCHER: Eschewing Importance Sampling in Games by Computing a History Value Function to Estimate Regret Stephen Marcus McAleer Gabriele Farina Marc Lanctot T. Sandholm 30 24 0 08 Jun 2022