Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2207.14211
Cited By
Regret Minimization and Convergence to Equilibria in General-sum Markov Games
28 July 2022
Liad Erez
Tal Lancewicki
Uri Sherman
Tomer Koren
Yishay Mansour
Re-assign community
ArXiv
PDF
HTML
Papers citing
"Regret Minimization and Convergence to Equilibria in General-sum Markov Games"
35 / 35 papers shown
Title
Incentivize without Bonus: Provably Efficient Model-based Online Multi-agent RL for Markov Games
Tong Yang
Bo Dai
Lin Xiao
Yuejie Chi
OffRL
90
2
0
13 Feb 2025
Hardness of Independent Learning and Sparse Equilibrium Computation in Markov Games
Dylan J. Foster
Noah Golowich
Sham Kakade
67
10
0
22 Mar 2023
Symmetric (Optimistic) Natural Policy Gradient for Multi-agent Learning with Parameter Convergence
S. Pattathil
Kai Zhang
Asuman Ozdaglar
58
14
0
23 Oct 2022
On the convergence of policy gradient methods to Nash equilibria in general stochastic games
Angeliki Giannou
Kyriakos Lotidis
P. Mertikopoulos
Emmanouil-Vasileios Vlatakis-Gkaragkounis
77
18
0
17 Oct 2022
Faster Last-iterate Convergence of Policy Optimization in Zero-Sum Markov Games
Shicong Cen
Yuejie Chi
S. Du
Lin Xiao
86
38
0
03 Oct 2022
O
(
T
−
1
)
O(T^{-1})
O
(
T
−
1
)
Convergence of Optimistic-Follow-the-Regularized-Leader in Two-Player Zero-Sum Markov Games
Yuepeng Yang
Cong Ma
57
14
0
26 Sep 2022
Policy Optimization for Markov Games: Unified Framework and Faster Convergence
Runyu Zhang
Qinghua Liu
Haiquan Wang
Caiming Xiong
Na Li
Yu Bai
49
26
0
06 Jun 2022
Uncoupled Learning Dynamics with
O
(
log
T
)
O(\log T)
O
(
lo
g
T
)
Swap Regret in Multiplayer Games
Ioannis Anagnostides
Gabriele Farina
Christian Kroer
Chung-Wei Lee
Haipeng Luo
Tuomas Sandholm
39
29
0
25 Apr 2022
The Complexity of Markov Equilibrium in Stochastic Games
C. Daskalakis
Noah Golowich
Kai Zhang
46
59
0
08 Apr 2022
Learning Markov Games with Adversarial Opponents: Efficient Algorithms and Fundamental Limits
Qinghua Liu
Yuanhao Wang
Chi Jin
AAML
42
15
0
14 Mar 2022
Kernelized Multiplicative Weights for 0/1-Polyhedral Games: Bridging the Gap Between Learning in Extensive-Form and Normal-Form Games
Gabriele Farina
Chung-Wei Lee
Haipeng Luo
Christian Kroer
29
32
0
01 Feb 2022
V-Learning -- A Simple, Efficient, Decentralized Algorithm for Multiagent RL
Chi Jin
Qinghua Liu
Yuanhao Wang
Tiancheng Yu
OffRL
47
132
0
27 Oct 2021
On Improving Model-Free Algorithms for Decentralized Multi-Agent Reinforcement Learning
Weichao Mao
Lin F. Yang
Kai Zhang
Tamer Bacsar
59
57
0
12 Oct 2021
Provably Efficient Reinforcement Learning in Decentralized General-Sum Markov Games
Weichao Mao
Tamer Basar
52
67
0
12 Oct 2021
When Can We Learn General-Sum Markov Games with a Large Number of Players Sample-Efficiently?
Ziang Song
Song Mei
Yu Bai
86
68
0
08 Oct 2021
Near-Optimal No-Regret Learning in General Games
C. Daskalakis
Maxwell Fishelson
Noah Golowich
49
104
0
16 Aug 2021
Fast Policy Extragradient Methods for Competitive Games with Entropy Regularization
Shicong Cen
Yuting Wei
Yuejie Chi
78
78
0
31 May 2021
Provably Efficient Policy Optimization for Two-Player Zero-Sum Markov Games
Yulai Zhao
Yuandong Tian
Jason D. Lee
S. Du
OffRL
58
18
0
17 Feb 2021
Last-iterate Convergence of Decentralized Optimistic Gradient Descent/Ascent in Infinite-horizon Competitive Markov Games
Chen-Yu Wei
Chung-Wei Lee
Mengxiao Zhang
Haipeng Luo
42
82
0
08 Feb 2021
Independent Policy Gradient Methods for Competitive Reinforcement Learning
C. Daskalakis
Dylan J. Foster
Noah Golowich
169
161
0
11 Jan 2021
A Sharp Analysis of Model-based Reinforcement Learning with Self-Play
Qinghua Liu
Tiancheng Yu
Yu Bai
Chi Jin
62
122
0
04 Oct 2020
Model-Based Multi-Agent RL in Zero-Sum Markov Games with Near-Optimal Sample Complexity
Kai Zhang
Sham Kakade
Tamer Bacsar
Lin F. Yang
86
123
0
15 Jul 2020
Near-Optimal Reinforcement Learning with Self-Play
Yunru Bai
Chi Jin
Tiancheng Yu
121
132
0
22 Jun 2020
Simultaneously Learning Stochastic and Adversarial Episodic MDPs with Known Transition
Tiancheng Jin
Haipeng Luo
38
57
0
10 Jun 2020
Optimistic Policy Optimization with Bandit Feedback
Yonathan Efroni
Lior Shani
Aviv A. Rosenberg
Shie Mannor
41
90
0
19 Feb 2020
Learning Zero-Sum Simultaneous-Move Markov Games Using Function Approximation and Correlated Equilibrium
Qiaomin Xie
Yudong Chen
Zhaoran Wang
Zhuoran Yang
109
125
0
17 Feb 2020
A Closer Look at Small-loss Bounds for Bandits with Graph Feedback
Chung-Wei Lee
Haipeng Luo
Mengxiao Zhang
38
24
0
02 Feb 2020
Provably Efficient Exploration in Policy Optimization
Qi Cai
Zhuoran Yang
Chi Jin
Zhaoran Wang
44
278
0
12 Dec 2019
Multi-Agent Reinforcement Learning: A Selective Overview of Theories and Algorithms
Kai Zhang
Zhuoran Yang
Tamer Basar
153
1,207
0
24 Nov 2019
Solving Discounted Stochastic Two-Player Games with Near-Optimal Time and Sample Complexity
Aaron Sidford
Mengdi Wang
Lin F. Yang
Yinyu Ye
56
70
0
29 Aug 2019
More Adaptive Algorithms for Adversarial Bandits
Chen-Yu Wei
Haipeng Luo
99
182
0
10 Jan 2018
Online Reinforcement Learning in Stochastic Games
Chen-Yu Wei
Yi-Te Hong
Chi-Jen Lu
OffRL
43
120
0
02 Dec 2017
Unifying PAC and Regret: Uniform PAC Bounds for Episodic Reinforcement Learning
Christoph Dann
Tor Lattimore
Emma Brunskill
69
308
0
22 Mar 2017
Fast Convergence of Regularized Learning in Games
Vasilis Syrgkanis
Alekh Agarwal
Haipeng Luo
Robert Schapire
60
254
0
02 Jul 2015
Optimization, Learning, and Games with Predictable Sequences
Alexander Rakhlin
Karthik Sridharan
76
379
0
08 Nov 2013
1