ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2312.01587
  4. Cited By
Scalable and Independent Learning of Nash Equilibrium Policies in
  $n$-Player Stochastic Games with Unknown Independent Chains

Scalable and Independent Learning of Nash Equilibrium Policies in nnn-Player Stochastic Games with Unknown Independent Chains

4 December 2023
Tiancheng Qin
S. Rasoul Etesami
ArXiv (abs)PDFHTML

Papers citing "Scalable and Independent Learning of Nash Equilibrium Policies in $n$-Player Stochastic Games with Unknown Independent Chains"

12 / 12 papers shown
Title
Policy Gradient Methods Find the Nash Equilibrium in N-player
  General-sum Linear-quadratic Games
Policy Gradient Methods Find the Nash Equilibrium in N-player General-sum Linear-quadratic Games
B. Hambly
Renyuan Xu
Huining Yang
61
28
0
27 Jul 2021
Decentralized Q-Learning in Zero-sum Markov Games
Decentralized Q-Learning in Zero-sum Markov Games
M. O. Sayin
Kai Zhang
David S. Leslie
Tamer Basar
Asuman Ozdaglar
55
83
0
04 Jun 2021
Global Convergence of Multi-Agent Policy Gradient in Markov Potential
  Games
Global Convergence of Multi-Agent Policy Gradient in Markov Potential Games
Stefanos Leonardos
W. Overman
Ioannis Panageas
Georgios Piliouras
73
123
0
03 Jun 2021
Independent Policy Gradient Methods for Competitive Reinforcement
  Learning
Independent Policy Gradient Methods for Competitive Reinforcement Learning
C. Daskalakis
Dylan J. Foster
Noah Golowich
217
162
0
11 Jan 2021
Fictitious play in zero-sum stochastic games
Fictitious play in zero-sum stochastic games
M. O. Sayin
F. Parise
Asuman Ozdaglar
22
48
0
08 Oct 2020
Efficiently Solving MDPs with Stochastic Mirror Descent
Efficiently Solving MDPs with Stochastic Mirror Descent
Yujia Jin
Aaron Sidford
54
71
0
28 Aug 2020
Learning Adversarial MDPs with Bandit Feedback and Unknown Transition
Learning Adversarial MDPs with Bandit Feedback and Unknown Transition
Chi Jin
Tiancheng Jin
Haipeng Luo
S. Sra
Tiancheng Yu
75
105
0
03 Dec 2019
Multi-Agent Reinforcement Learning: A Selective Overview of Theories and
  Algorithms
Multi-Agent Reinforcement Learning: A Selective Overview of Theories and Algorithms
Kai Zhang
Zhuoran Yang
Tamer Basar
196
1,218
0
24 Nov 2019
Learning Parametric Closed-Loop Policies for Markov Potential Games
Learning Parametric Closed-Loop Policies for Markov Potential Games
Sergio Valcarcel Macua
Javier Zazo
S. Zazo
64
46
0
03 Feb 2018
Primal-Dual $π$ Learning: Sample Complexity and Sublinear Run Time for
  Ergodic Markov Decision Problems
Primal-Dual πππ Learning: Sample Complexity and Sublinear Run Time for Ergodic Markov Decision Problems
Mengdi Wang
147
70
0
17 Oct 2017
Learning in games with continuous action sets and unknown payoff
  functions
Learning in games with continuous action sets and unknown payoff functions
P. Mertikopoulos
Zhengyuan Zhou
59
266
0
25 Aug 2016
Explore no more: Improved high-probability regret bounds for
  non-stochastic bandits
Explore no more: Improved high-probability regret bounds for non-stochastic bandits
Gergely Neu
389
185
0
10 Jun 2015
1