Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2312.01587
Cited By
Scalable and Independent Learning of Nash Equilibrium Policies in
n
n
n
-Player Stochastic Games with Unknown Independent Chains
4 December 2023
Tiancheng Qin
S. Rasoul Etesami
Re-assign community
ArXiv (abs)
PDF
HTML
Papers citing
"Scalable and Independent Learning of Nash Equilibrium Policies in $n$-Player Stochastic Games with Unknown Independent Chains"
12 / 12 papers shown
Title
Policy Gradient Methods Find the Nash Equilibrium in N-player General-sum Linear-quadratic Games
B. Hambly
Renyuan Xu
Huining Yang
61
28
0
27 Jul 2021
Decentralized Q-Learning in Zero-sum Markov Games
M. O. Sayin
Kai Zhang
David S. Leslie
Tamer Basar
Asuman Ozdaglar
55
83
0
04 Jun 2021
Global Convergence of Multi-Agent Policy Gradient in Markov Potential Games
Stefanos Leonardos
W. Overman
Ioannis Panageas
Georgios Piliouras
73
123
0
03 Jun 2021
Independent Policy Gradient Methods for Competitive Reinforcement Learning
C. Daskalakis
Dylan J. Foster
Noah Golowich
217
162
0
11 Jan 2021
Fictitious play in zero-sum stochastic games
M. O. Sayin
F. Parise
Asuman Ozdaglar
22
48
0
08 Oct 2020
Efficiently Solving MDPs with Stochastic Mirror Descent
Yujia Jin
Aaron Sidford
54
71
0
28 Aug 2020
Learning Adversarial MDPs with Bandit Feedback and Unknown Transition
Chi Jin
Tiancheng Jin
Haipeng Luo
S. Sra
Tiancheng Yu
75
105
0
03 Dec 2019
Multi-Agent Reinforcement Learning: A Selective Overview of Theories and Algorithms
Kai Zhang
Zhuoran Yang
Tamer Basar
196
1,218
0
24 Nov 2019
Learning Parametric Closed-Loop Policies for Markov Potential Games
Sergio Valcarcel Macua
Javier Zazo
S. Zazo
64
46
0
03 Feb 2018
Primal-Dual
π
π
π
Learning: Sample Complexity and Sublinear Run Time for Ergodic Markov Decision Problems
Mengdi Wang
147
70
0
17 Oct 2017
Learning in games with continuous action sets and unknown payoff functions
P. Mertikopoulos
Zhengyuan Zhou
59
266
0
25 Aug 2016
Explore no more: Improved high-probability regret bounds for non-stochastic bandits
Gergely Neu
389
185
0
10 Jun 2015
1