ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1502.05477
  4. Cited By
Trust Region Policy Optimization

Trust Region Policy Optimization

19 February 2015
John Schulman
Sergey Levine
Philipp Moritz
Michael I. Jordan
Pieter Abbeel
ArXivPDFHTML

Papers citing "Trust Region Policy Optimization"

50 / 1,149 papers shown
Title
Unsupervised Meta-Learning for Reinforcement Learning
Unsupervised Meta-Learning for Reinforcement Learning
Abhishek Gupta
Benjamin Eysenbach
Chelsea Finn
Sergey Levine
SSL
OffRL
54
106
0
12 Jun 2018
PAC-Bayes Control: Learning Policies that Provably Generalize to Novel
  Environments
PAC-Bayes Control: Learning Policies that Provably Generalize to Novel Environments
Anirudha Majumdar
M. Goldstein
Anoopkumar Sonar
23
18
0
11 Jun 2018
Self-Consistent Trajectory Autoencoder: Hierarchical Reinforcement
  Learning with Trajectory Embeddings
Self-Consistent Trajectory Autoencoder: Hierarchical Reinforcement Learning with Trajectory Embeddings
John D. Co-Reyes
YuXuan Liu
Abhishek Gupta
Benjamin Eysenbach
Pieter Abbeel
Sergey Levine
SSL
BDL
AIFin
31
142
0
07 Jun 2018
Neural Control Variates for Variance Reduction
Neural Control Variates for Variance Reduction
Ruosi Wan
Mingjun Zhong
Haoyi Xiong
Zhanxing Zhu
BDL
DRL
16
18
0
01 Jun 2018
Supervised Policy Update for Deep Reinforcement Learning
Supervised Policy Update for Deep Reinforcement Learning
Q. Vuong
Yiming Zhang
Keith Ross
19
20
0
29 May 2018
Variational Inverse Control with Events: A General Framework for
  Data-Driven Reward Definition
Variational Inverse Control with Events: A General Framework for Data-Driven Reward Definition
Justin Fu
Avi Singh
Dibya Ghosh
Larry Yang
Sergey Levine
BDL
14
125
0
29 May 2018
Truncated Horizon Policy Search: Combining Reinforcement Learning &
  Imitation Learning
Truncated Horizon Policy Search: Combining Reinforcement Learning & Imitation Learning
Wen Sun
J. Andrew Bagnell
Byron Boots
28
93
0
29 May 2018
Dual Policy Iteration
Dual Policy Iteration
Wen Sun
Geoffrey J. Gordon
Byron Boots
J. Andrew Bagnell
OffRL
18
56
0
28 May 2018
Fingerprint Policy Optimisation for Robust Reinforcement Learning
Fingerprint Policy Optimisation for Robust Reinforcement Learning
Supratik Paul
Michael A. Osborne
Shimon Whiteson
32
18
0
27 May 2018
Reliability and Learnability of Human Bandit Feedback for
  Sequence-to-Sequence Reinforcement Learning
Reliability and Learnability of Human Bandit Feedback for Sequence-to-Sequence Reinforcement Learning
Julia Kreutzer
Joshua Uyheng
Stefan Riezler
28
85
0
27 May 2018
Fast Policy Learning through Imitation and Reinforcement
Fast Policy Learning through Imitation and Reinforcement
Ching-An Cheng
Xinyan Yan
Nolan Wagener
Byron Boots
26
83
0
26 May 2018
Learning Self-Imitating Diverse Policies
Learning Self-Imitating Diverse Policies
Tanmay Gangwani
Qiang Liu
Jian Peng
24
65
0
25 May 2018
Object-Oriented Dynamics Predictor
Object-Oriented Dynamics Predictor
Guangxiang Zhu
Zhiao Huang
Chongjie Zhang
AI4CE
24
36
0
25 May 2018
Intelligent Trainer for Model-Based Reinforcement Learning
Intelligent Trainer for Model-Based Reinforcement Learning
Yuanlong Li
Linsen Dong
Xin Zhou
Yonggang Wen
K. Guan
OffRL
24
0
0
24 May 2018
Maximum Causal Tsallis Entropy Imitation Learning
Maximum Causal Tsallis Entropy Imitation Learning
Kyungjae Lee
Sungjoon Choi
Songhwai Oh
OOD
26
19
0
22 May 2018
Constrained Policy Improvement for Safe and Efficient Reinforcement
  Learning
Constrained Policy Improvement for Safe and Efficient Reinforcement Learning
Elad Sarafian
Aviv Tamar
Sarit Kraus
OffRL
32
11
0
20 May 2018
Task-Agnostic Meta-Learning for Few-shot Learning
Task-Agnostic Meta-Learning for Few-shot Learning
Muhammad Abdullah Jamal
Guo-Jun Qi
M. Shah
52
456
0
20 May 2018
Reinforced Imitation: Sample Efficient Deep Reinforcement Learning for
  Map-less Navigation by Leveraging Prior Demonstrations
Reinforced Imitation: Sample Efficient Deep Reinforcement Learning for Map-less Navigation by Leveraging Prior Demonstrations
Mark Pfeiffer
Samarth Shukla
M. Turchetta
Cesar Cadena
Andreas Krause
Roland Siegwart
Juan I. Nieto
16
157
0
18 May 2018
Exploration by Distributional Reinforcement Learning
Exploration by Distributional Reinforcement Learning
Yunhao Tang
Shipra Agrawal
OOD
41
30
0
04 May 2018
Reinforcement Learning and Control as Probabilistic Inference: Tutorial
  and Review
Reinforcement Learning and Control as Probabilistic Inference: Tutorial and Review
Sergey Levine
AI4CE
BDL
27
658
0
02 May 2018
The Limits and Potentials of Deep Learning for Robotics
The Limits and Potentials of Deep Learning for Robotics
Niko Sünderhauf
Oliver Brock
Walter J. Scheirer
R. Hadsell
Dieter Fox
...
B. Upcroft
Pieter Abbeel
Wolfram Burgard
Michael Milford
Peter Corke
17
522
0
18 Apr 2018
An Adaptive Clipping Approach for Proximal Policy Optimization
An Adaptive Clipping Approach for Proximal Policy Optimization
Gang Chen
Yiming Peng
Mengjie Zhang
14
22
0
17 Apr 2018
Derivative free optimization via repeated classification
Derivative free optimization via repeated classification
Tatsunori B. Hashimoto
Steve Yadlowsky
John C. Duchi
11
18
0
11 Apr 2018
StarCraft Micromanagement with Reinforcement Learning and Curriculum
  Transfer Learning
StarCraft Micromanagement with Reinforcement Learning and Curriculum Transfer Learning
Kun Shao
Yuanheng Zhu
Dongbin Zhao
107
170
0
03 Apr 2018
Learning to Run challenge: Synthesizing physiologically accurate motion
  using deep reinforcement learning
Learning to Run challenge: Synthesizing physiologically accurate motion using deep reinforcement learning
L. Kidzinski
Sharada Mohanty
Carmichael F. Ong
Jennifer Hicks
Sean F. Carroll
Sergey Levine
M. Salathé
Scott L. Delp
29
60
0
31 Mar 2018
Optimizing Sponsored Search Ranking Strategy by Deep Reinforcement
  Learning
Optimizing Sponsored Search Ranking Strategy by Deep Reinforcement Learning
Li He
Liang Wang
Kaipeng Liu
Bo Wu
Weinan Zhang
29
7
0
20 Mar 2018
Setting up a Reinforcement Learning Task with a Real-World Robot
Setting up a Reinforcement Learning Task with a Real-World Robot
A. R. Mahmood
D. Korenkevych
Brent Komer
James Bergstra
26
75
0
19 Mar 2018
Policy Search in Continuous Action Domains: an Overview
Policy Search in Continuous Action Domains: an Overview
Olivier Sigaud
F. Stulp
16
72
0
13 Mar 2018
The History Began from AlexNet: A Comprehensive Survey on Deep Learning
  Approaches
The History Began from AlexNet: A Comprehensive Survey on Deep Learning Approaches
Md. Zahangir Alom
T. Taha
C. Yakopcic
Stefan Westberg
P. Sidike
Mst Shamima Nasrin
B. Van Essen
A. Awwal
V. Asari
VLM
29
873
0
03 Mar 2018
Some Considerations on Learning to Explore via Meta-Reinforcement
  Learning
Some Considerations on Learning to Explore via Meta-Reinforcement Learning
Bradly C. Stadie
Ge Yang
Rein Houthooft
Xi Chen
Yan Duan
Yuhuai Wu
Pieter Abbeel
Ilya Sutskever
LRM
25
116
0
03 Mar 2018
Multi-Agent Imitation Learning for Driving Simulation
Multi-Agent Imitation Learning for Driving Simulation
Raunak P. Bhattacharyya
Derek J. Phillips
Blake Wulfe
Jeremy Morton
Alex Kuefler
Mykel J. Kochenderfer
22
118
0
02 Mar 2018
Reinforcement Learning to Rank in E-Commerce Search Engine:
  Formalization, Analysis, and Application
Reinforcement Learning to Rank in E-Commerce Search Engine: Formalization, Analysis, and Application
Yujing Hu
Qing Da
Anxiang Zeng
Yang Yu
Yinghui Xu
14
179
0
02 Mar 2018
The Mirage of Action-Dependent Baselines in Reinforcement Learning
The Mirage of Action-Dependent Baselines in Reinforcement Learning
George Tucker
Surya Bhupatiraju
S. Gu
Richard Turner
Zoubin Ghahramani
Sergey Levine
OffRL
27
126
0
27 Feb 2018
Reinforcement and Imitation Learning for Diverse Visuomotor Skills
Reinforcement and Imitation Learning for Diverse Visuomotor Skills
Yuke Zhu
Ziyun Wang
J. Merel
Andrei A. Rusu
Tom Erez
...
S. Tunyasuvunakool
János Kramár
R. Hadsell
Nando de Freitas
N. Heess
SSL
26
316
0
26 Feb 2018
Structured Control Nets for Deep Reinforcement Learning
Structured Control Nets for Deep Reinforcement Learning
Mario Srouji
Jian Zhang
Ruslan Salakhutdinov
30
43
0
22 Feb 2018
Clipped Action Policy Gradient
Clipped Action Policy Gradient
Yasuhiro Fujita
S. Maeda
OffRL
34
37
0
21 Feb 2018
Fourier Policy Gradients
Fourier Policy Gradients
M. Fellows
K. Ciosek
Shimon Whiteson
35
15
0
19 Feb 2018
Evolved Policy Gradients
Evolved Policy Gradients
Rein Houthooft
Richard Y. Chen
Phillip Isola
Bradly C. Stadie
Filip Wolski
Jonathan Ho
Pieter Abbeel
49
227
0
13 Feb 2018
Hierarchical Learning for Modular Robots
Hierarchical Learning for Modular Robots
R. Kojcev
Nora Etxezarreta
Alejandro Hernández
Víctor Mayoral
19
4
0
12 Feb 2018
Taking gradients through experiments: LSTMs and memory proximal policy
  optimization for black-box quantum control
Taking gradients through experiments: LSTMs and memory proximal policy optimization for black-box quantum control
Moritz August
José Miguel Hernández-Lobato
26
41
0
12 Feb 2018
Path Consistency Learning in Tsallis Entropy Regularized MDPs
Path Consistency Learning in Tsallis Entropy Regularized MDPs
Ofir Nachum
Yinlam Chow
Mohammad Ghavamzadeh
13
45
0
10 Feb 2018
Balancing Two-Player Stochastic Games with Soft Q-Learning
Balancing Two-Player Stochastic Games with Soft Q-Learning
Jordi Grau-Moya
Felix Leibfried
Haitham Bou-Ammar
24
42
0
09 Feb 2018
Evaluation of Deep Reinforcement Learning Methods for Modular Robots
Evaluation of Deep Reinforcement Learning Methods for Modular Robots
R. Kojcev
Nora Etxezarreta
Alejandro Hernández
Víctor Mayoral
OffRL
23
4
0
07 Feb 2018
VR-Goggles for Robots: Real-to-sim Domain Adaptation for Visual Control
VR-Goggles for Robots: Real-to-sim Domain Adaptation for Visual Control
Jingwei Zhang
L. Tai
Peng Yun
Yufeng Xiong
Ming Liu
Joschka Boedecker
Wolfram Burgard
21
121
0
01 Feb 2018
Pretraining Deep Actor-Critic Reinforcement Learning Algorithms With
  Expert Demonstrations
Pretraining Deep Actor-Critic Reinforcement Learning Algorithms With Expert Demonstrations
Xiaoqin Zhang
Huimin Ma
OffRL
27
38
0
31 Jan 2018
Constraint Estimation and Derivative-Free Recovery for Robot Learning
  from Demonstrations
Constraint Estimation and Derivative-Free Recovery for Robot Learning from Demonstrations
Jonathan Lee
Michael Laskey
Roy Fox
Ken Goldberg
13
4
0
31 Jan 2018
Understanding Human Behaviors in Crowds by Imitating the Decision-Making
  Process
Understanding Human Behaviors in Crowds by Imitating the Decision-Making Process
Haosheng Zou
Hang Su
Shihong Song
Jun Zhu
27
48
0
25 Jan 2018
An Empirical Analysis of Proximal Policy Optimization with
  Kronecker-factored Natural Gradients
An Empirical Analysis of Proximal Policy Optimization with Kronecker-factored Natural Gradients
Jiaming Song
Yuhuai Wu
24
2
0
17 Jan 2018
Global Convergence of Policy Gradient Methods for the Linear Quadratic
  Regulator
Global Convergence of Policy Gradient Methods for the Linear Quadratic Regulator
Maryam Fazel
Rong Ge
Sham Kakade
M. Mesbahi
35
597
0
15 Jan 2018
Expected Policy Gradients for Reinforcement Learning
Expected Policy Gradients for Reinforcement Learning
K. Ciosek
Shimon Whiteson
50
51
0
10 Jan 2018
Previous
123...20212223
Next