ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1901.10314
  4. Cited By
Trust Region-Guided Proximal Policy Optimization

Trust Region-Guided Proximal Policy Optimization

29 January 2019
Yuhui Wang
Hao He
Xiaoyang Tan
Yaozhong Gan
    OffRL
ArXivPDFHTML

Papers citing "Trust Region-Guided Proximal Policy Optimization"

11 / 11 papers shown
Title
Distributed Stochastic Gradient Descent with Staleness: A Stochastic Delay Differential Equation Based Framework
Distributed Stochastic Gradient Descent with Staleness: A Stochastic Delay Differential Equation Based Framework
Siyuan Yu
Wei Chen
H. V. Poor
32
0
0
17 Jun 2024
Joint action loss for proximal policy optimization
Joint action loss for proximal policy optimization
Xiulei Song
Yi-Fan Jin
Greg Slabaugh
Simon Lucas
21
0
0
26 Jan 2023
Entropy Augmented Reinforcement Learning
Entropy Augmented Reinforcement Learning
Jianfei Ma
36
0
0
19 Aug 2022
Generalized Policy Improvement Algorithms with Theoretically Supported
  Sample Reuse
Generalized Policy Improvement Algorithms with Theoretically Supported Sample Reuse
James Queeney
I. Paschalidis
Christos G. Cassandras
OffRL
32
2
0
28 Jun 2022
The Sufficiency of Off-Policyness and Soft Clipping: PPO is still
  Insufficient according to an Off-Policy Measure
The Sufficiency of Off-Policyness and Soft Clipping: PPO is still Insufficient according to an Off-Policy Measure
Xing Chen
Dongcui Diao
Hechang Chen
Hengshuai Yao
Haiyin Piao
Zhixiao Sun
Zhiwei Yang
Randy Goebel
Bei Jiang
Yi-Ju Chang
OffRL
41
8
0
20 May 2022
Learning to Constrain Policy Optimization with Virtual Trust Region
Learning to Constrain Policy Optimization with Virtual Trust Region
Hung Le
Thommen Karimpanal George
Majid Abdolshah
D. Nguyen
Kien Do
Sunil R. Gupta
Svetha Venkatesh
30
3
0
20 Apr 2022
Spatiotemporal Costmap Inference for MPC via Deep Inverse Reinforcement
  Learning
Spatiotemporal Costmap Inference for MPC via Deep Inverse Reinforcement Learning
Keuntaek Lee
David Isele
Evangelos A. Theodorou
S. Bae
32
24
0
17 Jan 2022
Generalized Proximal Policy Optimization with Sample Reuse
Generalized Proximal Policy Optimization with Sample Reuse
James Queeney
I. Paschalidis
Christos G. Cassandras
OffRL
37
47
0
29 Oct 2021
Decaying Clipping Range in Proximal Policy Optimization
Decaying Clipping Range in Proximal Policy Optimization
Mónika Farsang
Luca Szegletes
OffRL
8
4
0
20 Feb 2021
Differentiable Trust Region Layers for Deep Reinforcement Learning
Differentiable Trust Region Layers for Deep Reinforcement Learning
Fabian Otto
P. Becker
Ngo Anh Vien
Hanna Ziesche
Gerhard Neumann
OffRL
41
19
0
22 Jan 2021
Queueing Network Controls via Deep Reinforcement Learning
Queueing Network Controls via Deep Reinforcement Learning
J. Dai
Mark O. Gluzman
OffRL
32
50
0
31 Jul 2020
1