ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1903.07940
  4. Cited By
Truly Proximal Policy Optimization

Truly Proximal Policy Optimization

19 March 2019
Yuhui Wang
Hao He
Chao Wen
Xiaoyang Tan
ArXivPDFHTML

Papers citing "Truly Proximal Policy Optimization"

17 / 17 papers shown
Title
Enhancing Sample Efficiency and Exploration in Reinforcement Learning through the Integration of Diffusion Models and Proximal Policy Optimization
Enhancing Sample Efficiency and Exploration in Reinforcement Learning through the Integration of Diffusion Models and Proximal Policy Optimization
Gao Tianci
Dmitriev D. Dmitry
Konstantin A. Neusypin
Yang Bo
Rao Shengren
OffRL
35
1
0
02 Sep 2024
Machine Learning Meets Advanced Robotic Manipulation
Machine Learning Meets Advanced Robotic Manipulation
Saeid Nahavandi
R. Alizadehsani
D. Nahavandi
Chee Peng Lim
Kevin Kelly
Fernando Bello
29
17
0
22 Sep 2023
Heterogeneous Multi-Agent Reinforcement Learning via Mirror Descent
  Policy Optimization
Heterogeneous Multi-Agent Reinforcement Learning via Mirror Descent Policy Optimization
Mohammad Mehdi Nasiri
M. Rezghi
43
0
0
13 Aug 2023
Joint action loss for proximal policy optimization
Joint action loss for proximal policy optimization
Xiulei Song
Yi-Fan Jin
Greg Slabaugh
Simon Lucas
21
0
0
26 Jan 2023
Entropy Augmented Reinforcement Learning
Entropy Augmented Reinforcement Learning
Jianfei Ma
36
0
0
19 Aug 2022
Heterogeneous-Agent Mirror Learning: A Continuum of Solutions to
  Cooperative MARL
Heterogeneous-Agent Mirror Learning: A Continuum of Solutions to Cooperative MARL
J. Kuba
Xidong Feng
Shiyao Ding
Hao Dong
Jun Wang
Yaodong Yang
26
16
0
02 Aug 2022
Generalized Policy Improvement Algorithms with Theoretically Supported
  Sample Reuse
Generalized Policy Improvement Algorithms with Theoretically Supported Sample Reuse
James Queeney
I. Paschalidis
Christos G. Cassandras
OffRL
32
2
0
28 Jun 2022
Good Time to Ask: A Learning Framework for Asking for Help in Embodied
  Visual Navigation
Good Time to Ask: A Learning Framework for Asking for Help in Embodied Visual Navigation
Jenny Zhang
Samson Yu
Jiafei Duan
Cheston Tan
43
4
0
20 Jun 2022
The Sufficiency of Off-Policyness and Soft Clipping: PPO is still
  Insufficient according to an Off-Policy Measure
The Sufficiency of Off-Policyness and Soft Clipping: PPO is still Insufficient according to an Off-Policy Measure
Xing Chen
Dongcui Diao
Hechang Chen
Hengshuai Yao
Haiyin Piao
Zhixiao Sun
Zhiwei Yang
Randy Goebel
Bei Jiang
Yi-Ju Chang
OffRL
43
8
0
20 May 2022
Proximal Policy Optimization Learning based Control of Congested Freeway
  Traffic
Proximal Policy Optimization Learning based Control of Congested Freeway Traffic
Shurong Mo
Nailong Wu
Jie Qi
Anqi Pan
Zhiguang Feng
Huaicheng Yan
Yueying Wang
17
0
0
12 Apr 2022
Autonomous Drone Swarm Navigation and Multi-target Tracking in 3D
  Environments with Dynamic Obstacles
Autonomous Drone Swarm Navigation and Multi-target Tracking in 3D Environments with Dynamic Obstacles
Suleman Qamar
Dr. Saddam Hussain Khan
Muhammad Arif Arshad
Maryam Qamar
Asifullah Khan
29
16
0
13 Feb 2022
You May Not Need Ratio Clipping in PPO
You May Not Need Ratio Clipping in PPO
Mingfei Sun
Vitaly Kurin
Guoqing Liu
Sam Devlin
Tao Qin
Katja Hofmann
Shimon Whiteson
18
15
0
31 Jan 2022
Mirror Learning: A Unifying Framework of Policy Optimisation
Mirror Learning: A Unifying Framework of Policy Optimisation
J. Kuba
Christian Schroeder de Witt
Jakob N. Foerster
29
24
0
07 Jan 2022
Generalized Proximal Policy Optimization with Sample Reuse
Generalized Proximal Policy Optimization with Sample Reuse
James Queeney
I. Paschalidis
Christos G. Cassandras
OffRL
42
47
0
29 Oct 2021
Offline Reinforcement Learning with Soft Behavior Regularization
Offline Reinforcement Learning with Soft Behavior Regularization
Haoran Xu
Xianyuan Zhan
Jianxiong Li
Honglei Yin
OffRL
31
31
0
14 Oct 2021
Mirror Descent Policy Optimization
Mirror Descent Policy Optimization
Manan Tomar
Lior Shani
Yonathan Efroni
Mohammad Ghavamzadeh
25
83
0
20 May 2020
Stable Policy Optimization via Off-Policy Divergence Regularization
Stable Policy Optimization via Off-Policy Divergence Regularization
Ahmed Touati
Amy Zhang
Joelle Pineau
Pascal Vincent
OffRL
36
17
0
09 Mar 2020
1