ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1502.05477
  4. Cited By
Trust Region Policy Optimization
v1v2v3v4v5 (latest)

Trust Region Policy Optimization

19 February 2015
John Schulman
Sergey Levine
Philipp Moritz
Michael I. Jordan
Pieter Abbeel
ArXiv (abs)PDFHTML

Papers citing "Trust Region Policy Optimization"

50 / 2,009 papers shown
Title
Federated Reinforcement Learning: Techniques, Applications, and Open
  Challenges
Federated Reinforcement Learning: Techniques, Applications, and Open Challenges
Jiaju Qi
Qihao Zhou
Lei Lei
Kan Zheng
FedML
113
161
0
26 Aug 2021
MimicBot: Combining Imitation and Reinforcement Learning to win in Bot
  Bowl
MimicBot: Combining Imitation and Reinforcement Learning to win in Bot Bowl
Nicola Pezzotti
65
1
0
21 Aug 2021
Provably Efficient Generative Adversarial Imitation Learning for Online
  and Offline Setting with Linear Function Approximation
Provably Efficient Generative Adversarial Imitation Learning for Online and Offline Setting with Linear Function Approximation
Zhihan Liu
Yufeng Zhang
Zuyue Fu
Zhuoran Yang
Zhaoran Wang
OffRL
67
6
0
19 Aug 2021
Settling the Variance of Multi-Agent Policy Gradients
Settling the Variance of Multi-Agent Policy Gradients
J. Kuba
Muning Wen
Yaodong Yang
Linghui Meng
Shangding Gu
Haifeng Zhang
D. Mguni
Jun Wang
81
67
0
19 Aug 2021
Optimal Actor-Critic Policy with Optimized Training Datasets
Optimal Actor-Critic Policy with Optimized Training Datasets
C. Banerjee
Zhiyong Chen
N. Noman
M. Zamani
OffRL
66
7
0
16 Aug 2021
Safe Learning in Robotics: From Learning-Based Control to Safe
  Reinforcement Learning
Safe Learning in Robotics: From Learning-Based Control to Safe Reinforcement Learning
Lukas Brunke
Melissa Greeff
Adam W. Hall
Zhaocong Yuan
Siqi Zhou
Jacopo Panerati
Angela P. Schoellig
OffRL
77
638
0
13 Aug 2021
DexMV: Imitation Learning for Dexterous Manipulation from Human Videos
DexMV: Imitation Learning for Dexterous Manipulation from Human Videos
Yuzhe Qin
Yueh-hua Wu
Shaowei Liu
Hanwen Jiang
Ruihan Yang
Yang Fu
Xiaolong Wang
261
201
0
12 Aug 2021
A Survey on Deep Reinforcement Learning for Data Processing and
  Analytics
A Survey on Deep Reinforcement Learning for Data Processing and Analytics
Qingpeng Cai
Can Cui
Yiyuan Xiong
Wei Wang
Zhongle Xie
Meihui Zhang
OffRL
67
32
0
10 Aug 2021
Deep Reinforcement Learning for Demand Driven Services in Logistics and
  Transportation Systems: A Survey
Deep Reinforcement Learning for Demand Driven Services in Logistics and Transportation Systems: A Survey
Zefang Zong
Tao Feng
Tong Xia
Depeng Jin
Yong Li
62
3
0
10 Aug 2021
VeRLPy: Python Library for Verification of Digital Designs with
  Reinforcement Learning
VeRLPy: Python Library for Verification of Digital Designs with Reinforcement Learning
Aebel Joe Shibu
S. Sadhana
N. Shilpa
Pratyush Kumar
AAML
45
6
0
09 Aug 2021
Active Reinforcement Learning over MDPs
Qi Yang
Peng Yang
K. Tang
103
0
0
05 Aug 2021
A Pragmatic Look at Deep Imitation Learning
A Pragmatic Look at Deep Imitation Learning
Kai Arulkumaran
D. Lillrank
59
9
0
04 Aug 2021
Learning Barrier Certificates: Towards Safe Reinforcement Learning with
  Zero Training-time Violations
Learning Barrier Certificates: Towards Safe Reinforcement Learning with Zero Training-time Violations
Yuping Luo
Tengyu Ma
OffRL
101
44
0
04 Aug 2021
Offline Decentralized Multi-Agent Reinforcement Learning
Offline Decentralized Multi-Agent Reinforcement Learning
Jiechuan Jiang
Zongqing Lu
OffRL
91
39
0
04 Aug 2021
Deep Reinforcement Learning Based Networked Control with Network Delays
  for Signal Temporal Logic Specifications
Deep Reinforcement Learning Based Networked Control with Network Delays for Signal Temporal Logic Specifications
Junya Ikemoto
T. Ushio
76
3
0
03 Aug 2021
Variational Actor-Critic Algorithms
Variational Actor-Critic Algorithms
Yuhua Zhu
Lexing Ying
OffRL
59
0
0
03 Aug 2021
Physics-informed Dyna-Style Model-Based Deep Reinforcement Learning for
  Dynamic Control
Physics-informed Dyna-Style Model-Based Deep Reinforcement Learning for Dynamic Control
Xin-Yang Liu
Jian-Xun Wang
AI4CE
106
42
0
31 Jul 2021
Adaptive Approach Phase Guidance for a Hypersonic Glider via
  Reinforcement Meta Learning
Adaptive Approach Phase Guidance for a Hypersonic Glider via Reinforcement Meta Learning
B. Gaudet
K. Drozd
Ryan Meltzer
R. Furfaro
37
17
0
30 Jul 2021
Policy Gradient Methods Find the Nash Equilibrium in N-player
  General-sum Linear-quadratic Games
Policy Gradient Methods Find the Nash Equilibrium in N-player General-sum Linear-quadratic Games
B. Hambly
Renyuan Xu
Huining Yang
98
29
0
27 Jul 2021
A general sample complexity analysis of vanilla policy gradient
A general sample complexity analysis of vanilla policy gradient
Rui Yuan
Robert Mansel Gower
A. Lazaric
138
64
0
23 Jul 2021
A reinforcement learning approach to resource allocation in genomic
  selection
A reinforcement learning approach to resource allocation in genomic selection
Saba Moeinizade
Guiping Hu
Lizhi Wang
60
15
0
22 Jul 2021
Bayesian Controller Fusion: Leveraging Control Priors in Deep
  Reinforcement Learning for Robotics
Bayesian Controller Fusion: Leveraging Control Priors in Deep Reinforcement Learning for Robotics
Krishan Rana
Vibhavari Dasagi
Jesse Haviland
Ben Talbot
Michael Milford
Niko Sünderhauf
BDLOffRL
76
35
0
21 Jul 2021
Proximal Policy Optimization for Tracking Control Exploiting Future
  Reference Information
Proximal Policy Optimization for Tracking Control Exploiting Future Reference Information
Jana Mayer
Johannes Westermann
Juan Pedro Gutiérrez H. Muriedas
Uwe Mettin
A. Lampe
OffRL
35
0
0
20 Jul 2021
An Empirical Analysis of Measure-Valued Derivatives for Policy Gradients
An Empirical Analysis of Measure-Valued Derivatives for Policy Gradients
João Carvalho
Davide Tateo
Fabio Muratore
Jan Peters
OffRL
56
7
0
20 Jul 2021
Constraints Penalized Q-learning for Safe Offline Reinforcement Learning
Constraints Penalized Q-learning for Safe Offline Reinforcement Learning
Haoran Xu
Xianyuan Zhan
Xiangyu Zhu
OffRL
83
91
0
19 Jul 2021
Greedification Operators for Policy Optimization: Investigating Forward
  and Reverse KL Divergences
Greedification Operators for Policy Optimization: Investigating Forward and Reverse KL Divergences
Alan Chan
Hugo Silva
Sungsu Lim
Tadashi Kozuno
A. R. Mahmood
Martha White
90
31
0
17 Jul 2021
Refined Policy Improvement Bounds for MDPs
Refined Policy Improvement Bounds for MDPs
J. Dai
Mark O. Gluzman
45
3
0
16 Jul 2021
PC-MLP: Model-based Reinforcement Learning with Policy Cover Guided
  Exploration
PC-MLP: Model-based Reinforcement Learning with Policy Cover Guided Exploration
Yuda Song
Wen Sun
114
21
0
15 Jul 2021
Safer Reinforcement Learning through Transferable Instinct Networks
Safer Reinforcement Learning through Transferable Instinct Networks
Djordje Grbic
S. Risi
OffRLOnRL
80
4
0
14 Jul 2021
Recent Advances in Leveraging Human Guidance for Sequential
  Decision-Making Tasks
Recent Advances in Leveraging Human Guidance for Sequential Decision-Making Tasks
Ruohan Zhang
F. Torabi
Garrett A. Warnell
Peter Stone
163
29
0
13 Jul 2021
Cautious Policy Programming: Exploiting KL Regularization in Monotonic
  Policy Improvement for Reinforcement Learning
Cautious Policy Programming: Exploiting KL Regularization in Monotonic Policy Improvement for Reinforcement Learning
Lingwei Zhu
Toshinori Kitamura
Takamitsu Matsubara
OffRL
56
1
0
13 Jul 2021
Cautious Actor-Critic
Cautious Actor-Critic
Lingwei Zhu
Toshinori Kitamura
Takamitsu Matsubara
AAML
77
1
0
12 Jul 2021
Stabilizing Neural Control Using Self-Learned Almost Lyapunov Critics
Stabilizing Neural Control Using Self-Learned Almost Lyapunov Critics
Ya-Chien Chang
Sicun Gao
94
59
0
11 Jul 2021
LS3: Latent Space Safe Sets for Long-Horizon Visuomotor Control of
  Sparse Reward Iterative Tasks
LS3: Latent Space Safe Sets for Long-Horizon Visuomotor Control of Sparse Reward Iterative Tasks
Albert Wilcox
Ashwin Balakrishna
Brijen Thananjeyan
Joseph E. Gonzalez
Ken Goldberg
95
12
0
10 Jul 2021
Safe Exploration by Solving Early Terminated MDP
Safe Exploration by Solving Early Terminated MDP
Hao Sun
Ziping Xu
Meng Fang
Zhenghao Peng
Jiadong Guo
Bo Dai
Bolei Zhou
47
17
0
09 Jul 2021
Intelligent Link Adaptation for Grant-Free Access Cellular Networks: A
  Distributed Deep Reinforcement Learning Approach
Intelligent Link Adaptation for Grant-Free Access Cellular Networks: A Distributed Deep Reinforcement Learning Approach
Joao V. C. Evangelista
Zeeshan Sattar
Georges Kaddoum
Bassant Selim
Aydin Sarraf
32
2
0
08 Jul 2021
Towards Autonomous Pipeline Inspection with Hierarchical Reinforcement
  Learning
Towards Autonomous Pipeline Inspection with Hierarchical Reinforcement Learning
N. Botteghi
L.J.L. Grefte
M. Poel
B. Sirmaçek
C. Brune
Edwin Dertien
Stefano Stramigioli
57
4
0
08 Jul 2021
RRL: Resnet as representation for Reinforcement Learning
RRL: Resnet as representation for Reinforcement Learning
Rutav Shah
Vikash Kumar
OffRL
109
115
0
07 Jul 2021
AdaRL: What, Where, and How to Adapt in Transfer Reinforcement Learning
AdaRL: What, Where, and How to Adapt in Transfer Reinforcement Learning
Erdun Gao
Fan Feng
Chaochao Lu
Sara Magliacane
Kun Zhang
109
69
0
06 Jul 2021
Reinforcement Learning for Feedback-Enabled Cyber Resilience
Reinforcement Learning for Feedback-Enabled Cyber Resilience
Yunhan Huang
Linan Huang
Quanyan Zhu
107
71
0
02 Jul 2021
Applications of the Free Energy Principle to Machine Learning and
  Neuroscience
Applications of the Free Energy Principle to Machine Learning and Neuroscience
Beren Millidge
DRL
117
8
0
30 Jun 2021
Reinforcement Learning based Disease Progression Model for Alzheimer's
  Disease
Reinforcement Learning based Disease Progression Model for Alzheimer's Disease
Krishnakant V. Saboo
A. Choudhary
Yurui Cao
G. Worrell
David T. Jones
Ravishankar Iyer
OOD
38
14
0
30 Jun 2021
Understanding Adversarial Attacks on Observations in Deep Reinforcement
  Learning
Understanding Adversarial Attacks on Observations in Deep Reinforcement Learning
You Qiaoben
Chengyang Ying
Xinning Zhou
Hang Su
Jun Zhu
Bo Zhang
AAML
119
17
0
30 Jun 2021
Deep Multiagent Reinforcement Learning: Challenges and Directions
Deep Multiagent Reinforcement Learning: Challenges and Directions
Annie Wong
Thomas Bäck
Anna V. Kononova
Aske Plaat
AI4CE
119
98
0
29 Jun 2021
Curious Explorer: a provable exploration strategy in Policy Learning
Curious Explorer: a provable exploration strategy in Policy Learning
M. Miani
Maurizio Parton
M. Romito
131
0
0
29 Jun 2021
Policy Regularization via Noisy Advantage Values for Cooperative
  Multi-agent Actor-Critic methods
Policy Regularization via Noisy Advantage Values for Cooperative Multi-agent Actor-Critic methods
Jian Hu
Siyue Hu
Shih-Wei Liao
118
15
0
27 Jun 2021
Unifying Gradient Estimators for Meta-Reinforcement Learning via
  Off-Policy Evaluation
Unifying Gradient Estimators for Meta-Reinforcement Learning via Off-Policy Evaluation
Yunhao Tang
Tadashi Kozuno
Mark Rowland
Rémi Munos
Michal Valko
OffRL
136
9
0
24 Jun 2021
Coarse-to-Fine Q-attention: Efficient Learning for Visual Robotic
  Manipulation via Discretisation
Coarse-to-Fine Q-attention: Efficient Learning for Visual Robotic Manipulation via Discretisation
Stephen James
Kentaro Wada
Tristan Laidlow
Andrew J. Davison
106
135
0
23 Jun 2021
Local policy search with Bayesian optimization
Local policy search with Bayesian optimization
Sarah Müller
Alexander von Rohr
Sebastian Trimpe
BDL
89
42
0
22 Jun 2021
Policy Smoothing for Provably Robust Reinforcement Learning
Policy Smoothing for Provably Robust Reinforcement Learning
Aounon Kumar
Alexander Levine
Soheil Feizi
AAML
131
59
0
21 Jun 2021
Previous
123...131415...394041
Next