ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2208.09452
  4. Cited By
Unified Policy Optimization for Continuous-action Reinforcement Learning
  in Non-stationary Tasks and Games

Unified Policy Optimization for Continuous-action Reinforcement Learning in Non-stationary Tasks and Games

19 August 2022
Rongjun Qin
Fan Luo
Hong Qian
Yang Yu
ArXivPDFHTML

Papers citing "Unified Policy Optimization for Continuous-action Reinforcement Learning in Non-stationary Tasks and Games"

9 / 9 papers shown
Title
Policy Optimization for Markov Games: Unified Framework and Faster
  Convergence
Policy Optimization for Markov Games: Unified Framework and Faster Convergence
Runyu Zhang
Qinghua Liu
Haiquan Wang
Caiming Xiong
Na Li
Yu Bai
49
26
0
06 Jun 2022
Last-iterate Convergence in Extensive-Form Games
Last-iterate Convergence in Extensive-Form Games
Chung-Wei Lee
Christian Kroer
Haipeng Luo
137
40
0
27 Jun 2021
Fast Policy Extragradient Methods for Competitive Games with Entropy
  Regularization
Fast Policy Extragradient Methods for Competitive Games with Entropy Regularization
Shicong Cen
Yuting Wei
Yuejie Chi
83
78
0
31 May 2021
Introduction to Online Convex Optimization
Introduction to Online Convex Optimization
Elad Hazan
OffRL
146
1,927
0
07 Sep 2019
Soft Actor-Critic Algorithms and Applications
Soft Actor-Critic Algorithms and Applications
Tuomas Haarnoja
Aurick Zhou
Kristian Hartikainen
George Tucker
Sehoon Ha
...
Vikash Kumar
Henry Zhu
Abhishek Gupta
Pieter Abbeel
Sergey Levine
133
2,418
0
13 Dec 2018
Emergence of Grounded Compositional Language in Multi-Agent Populations
Emergence of Grounded Compositional Language in Multi-Agent Populations
Igor Mordatch
Pieter Abbeel
LLMAG
113
701
0
15 Mar 2017
Reinforcement Learning with Deep Energy-Based Policies
Reinforcement Learning with Deep Energy-Based Policies
Tuomas Haarnoja
Haoran Tang
Pieter Abbeel
Sergey Levine
95
1,339
0
27 Feb 2017
Generative Adversarial Imitation Learning
Generative Adversarial Imitation Learning
Jonathan Ho
Stefano Ermon
GAN
131
3,101
0
10 Jun 2016
Deep Reinforcement Learning from Self-Play in Imperfect-Information
  Games
Deep Reinforcement Learning from Self-Play in Imperfect-Information Games
Johannes Heinrich
David Silver
SSL
48
400
0
03 Mar 2016
1