Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2210.12812
Cited By
Symmetric (Optimistic) Natural Policy Gradient for Multi-agent Learning with Parameter Convergence
23 October 2022
S. Pattathil
Kaipeng Zhang
Asuman Ozdaglar
Re-assign community
ArXiv
PDF
HTML
Papers citing
"Symmetric (Optimistic) Natural Policy Gradient for Multi-agent Learning with Parameter Convergence"
5 / 5 papers shown
Title
Faster WIND: Accelerating Iterative Best-of-
N
N
N
Distillation for LLM Alignment
Tong Yang
Jincheng Mei
H. Dai
Zixin Wen
Shicong Cen
Dale Schuurmans
Yuejie Chi
Bo Dai
45
4
0
20 Feb 2025
Multi-Player Zero-Sum Markov Games with Networked Separable Interactions
Chanwoo Park
Kaipeng Zhang
Asuman Ozdaglar
30
8
0
13 Jul 2023
Regret Minimization and Convergence to Equilibria in General-sum Markov Games
Liad Erez
Tal Lancewicki
Uri Sherman
Tomer Koren
Yishay Mansour
40
25
0
28 Jul 2022
Policy Mirror Descent for Reinforcement Learning: Linear Convergence, New Sampling Complexity, and Generalized Problem Classes
Guanghui Lan
89
136
0
30 Jan 2021
Independent Policy Gradient Methods for Competitive Reinforcement Learning
C. Daskalakis
Dylan J. Foster
Noah Golowich
62
159
0
11 Jan 2021
1