Trust Region-Guided Proximal Policy Optimization

Trust Region-Guided Proximal Policy Optimization

29 January 2019

Papers citing "Trust Region-Guided Proximal Policy Optimization"

11 / 11 papers shown

Title
Distributed Stochastic Gradient Descent with Staleness: A Stochastic Delay Differential Equation Based Framework Siyuan Yu Wei Chen H. V. Poor 32 0 0 17 Jun 2024
Joint action loss for proximal policy optimization Xiulei Song Yi-Fan Jin Greg Slabaugh Simon Lucas 21 0 0 26 Jan 2023
Entropy Augmented Reinforcement Learning Jianfei Ma 36 0 0 19 Aug 2022
Generalized Policy Improvement Algorithms with Theoretically Supported Sample Reuse James Queeney I. Paschalidis Christos G. Cassandras OffRL 32 2 0 28 Jun 2022
The Sufficiency of Off-Policyness and Soft Clipping: PPO is still Insufficient according to an Off-Policy Measure Xing Chen Dongcui Diao Hechang Chen Hengshuai Yao Haiyin Piao Zhixiao Sun Zhiwei Yang Randy Goebel Bei Jiang Yi-Ju Chang OffRL 41 8 0 20 May 2022
Learning to Constrain Policy Optimization with Virtual Trust Region Hung Le Thommen Karimpanal George Majid Abdolshah D. Nguyen Kien Do Sunil R. Gupta Svetha Venkatesh 30 3 0 20 Apr 2022
Spatiotemporal Costmap Inference for MPC via Deep Inverse Reinforcement Learning Keuntaek Lee David Isele Evangelos A. Theodorou S. Bae 32 24 0 17 Jan 2022
Generalized Proximal Policy Optimization with Sample Reuse James Queeney I. Paschalidis Christos G. Cassandras OffRL 37 47 0 29 Oct 2021
Decaying Clipping Range in Proximal Policy Optimization Mónika Farsang Luca Szegletes OffRL 8 4 0 20 Feb 2021
Differentiable Trust Region Layers for Deep Reinforcement Learning Fabian Otto P. Becker Ngo Anh Vien Hanna Ziesche Gerhard Neumann OffRL 41 19 0 22 Jan 2021
Queueing Network Controls via Deep Reinforcement Learning J. Dai Mark O. Gluzman OffRL 32 50 0 31 Jul 2020