ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1502.05477
  4. Cited By
Trust Region Policy Optimization
v1v2v3v4v5 (latest)

Trust Region Policy Optimization

19 February 2015
John Schulman
Sergey Levine
Philipp Moritz
Michael I. Jordan
Pieter Abbeel
ArXiv (abs)PDFHTML

Papers citing "Trust Region Policy Optimization"

50 / 2,009 papers shown
Title
Variance Reduction for Policy Gradient with Action-Dependent Factorized
  Baselines
Variance Reduction for Policy Gradient with Action-Dependent Factorized Baselines
Cathy Wu
Aravind Rajeswaran
Yan Duan
Vikash Kumar
Alexandre M. Bayen
Sham Kakade
Igor Mordatch
Pieter Abbeel
OffRL
92
153
0
20 Mar 2018
Setting up a Reinforcement Learning Task with a Real-World Robot
Setting up a Reinforcement Learning Task with a Real-World Robot
A. R. Mahmood
D. Korenkevych
Brent Komer
James Bergstra
81
77
0
19 Mar 2018
Simple random search provides a competitive approach to reinforcement
  learning
Simple random search provides a competitive approach to reinforcement learning
Horia Mania
Aurelia Guy
Benjamin Recht
93
317
0
19 Mar 2018
Learning to Explore with Meta-Policy Gradient
Learning to Explore with Meta-Policy Gradient
Tianbing Xu
Qiang Liu
Liang Zhao
Jian Peng
74
54
0
13 Mar 2018
Comparing Task Simplifications to Learn Closed-Loop Object Picking Using
  Deep Reinforcement Learning
Comparing Task Simplifications to Learn Closed-Loop Object Picking Using Deep Reinforcement Learning
Michel Breyer
Fadri Furrer
Tonci Novkovic
Roland Siegwart
Juan I. Nieto
SSLOffRL
92
47
0
13 Mar 2018
Policy Search in Continuous Action Domains: an Overview
Policy Search in Continuous Action Domains: an Overview
Olivier Sigaud
F. Stulp
134
73
0
13 Mar 2018
Transfer Learning with Neural AutoML
Transfer Learning with Neural AutoML
Catherine Wong
N. Houlsby
Yifeng Lu
Andrea Gesmundo
78
114
0
07 Mar 2018
Smoothed Action Value Functions for Learning Gaussian Policies
Smoothed Action Value Functions for Learning Gaussian Policies
Ofir Nachum
Mohammad Norouzi
George Tucker
Dale Schuurmans
88
28
0
06 Mar 2018
Recurrent Predictive State Policy Networks
Recurrent Predictive State Policy Networks
Ahmed S. Hefny
Zita Marinho
Wen Sun
S. Srinivasa
Geoffrey J. Gordon
86
19
0
05 Mar 2018
The History Began from AlexNet: A Comprehensive Survey on Deep Learning
  Approaches
The History Began from AlexNet: A Comprehensive Survey on Deep Learning Approaches
Md. Zahangir Alom
T. Taha
C. Yakopcic
Stefan Westberg
P. Sidike
Mst Shamima Nasrin
B. Van Essen
A. Awwal
V. Asari
VLM
133
883
0
03 Mar 2018
OIL: Observational Imitation Learning
OIL: Observational Imitation Learning
Ge Li
Matthias Muller
Vincent Casser
Neil G. Smith
D. L. Michels
Guohao Li
127
41
0
03 Mar 2018
Some Considerations on Learning to Explore via Meta-Reinforcement
  Learning
Some Considerations on Learning to Explore via Meta-Reinforcement Learning
Bradly C. Stadie
Ge Yang
Rein Houthooft
Xi Chen
Yan Duan
Yuhuai Wu
Pieter Abbeel
Ilya Sutskever
LRM
102
116
0
03 Mar 2018
Multi-Agent Imitation Learning for Driving Simulation
Multi-Agent Imitation Learning for Driving Simulation
Raunak P. Bhattacharyya
Derek J. Phillips
Blake Wulfe
Jeremy Morton
Alex Kuefler
Mykel J. Kochenderfer
76
121
0
02 Mar 2018
Reinforcement Learning to Rank in E-Commerce Search Engine:
  Formalization, Analysis, and Application
Reinforcement Learning to Rank in E-Commerce Search Engine: Formalization, Analysis, and Application
Yujing Hu
Qing Da
Anxiang Zeng
Yang Yu
Yinghui Xu
118
181
0
02 Mar 2018
Model-Ensemble Trust-Region Policy Optimization
Model-Ensemble Trust-Region Policy Optimization
Thanard Kurutach
I. Clavera
Yan Duan
Aviv Tamar
Pieter Abbeel
84
453
0
28 Feb 2018
Deep Reinforcement Learning for Vision-Based Robotic Grasping: A
  Simulated Comparative Evaluation of Off-Policy Methods
Deep Reinforcement Learning for Vision-Based Robotic Grasping: A Simulated Comparative Evaluation of Off-Policy Methods
Deirdre Quillen
Eric Jang
Ofir Nachum
Chelsea Finn
Julian Ibarz
Sergey Levine
OODOffRL
102
204
0
28 Feb 2018
The Mirage of Action-Dependent Baselines in Reinforcement Learning
The Mirage of Action-Dependent Baselines in Reinforcement Learning
George Tucker
Surya Bhupatiraju
S. Gu
Richard Turner
Zoubin Ghahramani
Sergey Levine
OffRL
115
127
0
27 Feb 2018
Reinforcement and Imitation Learning for Diverse Visuomotor Skills
Reinforcement and Imitation Learning for Diverse Visuomotor Skills
Yuke Zhu
Ziyun Wang
J. Merel
Andrei A. Rusu
Tom Erez
...
S. Tunyasuvunakool
János Kramár
R. Hadsell
Nando de Freitas
N. Heess
SSL
140
320
0
26 Feb 2018
Addressing Function Approximation Error in Actor-Critic Methods
Addressing Function Approximation Error in Actor-Critic Methods
Scott Fujimoto
H. V. Hoof
David Meger
OffRL
392
5,256
0
26 Feb 2018
Structured Control Nets for Deep Reinforcement Learning
Structured Control Nets for Deep Reinforcement Learning
Mario Srouji
Jian Zhang
Ruslan Salakhutdinov
81
43
0
22 Feb 2018
Clipped Action Policy Gradient
Clipped Action Policy Gradient
Yasuhiro Fujita
S. Maeda
OffRL
53
37
0
21 Feb 2018
Meta-Reinforcement Learning of Structured Exploration Strategies
Meta-Reinforcement Learning of Structured Exploration Strategies
Abhishek Gupta
Russell Mendonca
YuXuan Liu
Pieter Abbeel
Sergey Levine
OffRL
141
350
0
20 Feb 2018
Layer-wise synapse optimization for implementing neural networks on
  general neuromorphic architectures
Layer-wise synapse optimization for implementing neural networks on general neuromorphic architectures
John Mern
Jayesh K. Gupta
Mykel Kochenderfer
55
1
0
20 Feb 2018
Fourier Policy Gradients
Fourier Policy Gradients
M. Fellows
K. Ciosek
Shimon Whiteson
53
15
0
19 Feb 2018
Accelerated Primal-Dual Policy Optimization for Safe Reinforcement
  Learning
Accelerated Primal-Dual Policy Optimization for Safe Reinforcement Learning
Qingkai Liang
Fanyu Que
E. Modiano
85
102
0
19 Feb 2018
Diversity is All You Need: Learning Skills without a Reward Function
Diversity is All You Need: Learning Skills without a Reward Function
Benjamin Eysenbach
Abhishek Gupta
Julian Ibarz
Sergey Levine
271
1,089
0
16 Feb 2018
Reinforcement Learning from Imperfect Demonstrations
Reinforcement Learning from Imperfect Demonstrations
Yang Gao
Huazhe Xu
Ji Lin
Feng Yu
Sergey Levine
Trevor Darrell
86
202
0
14 Feb 2018
DiCE: The Infinitely Differentiable Monte-Carlo Estimator
DiCE: The Infinitely Differentiable Monte-Carlo Estimator
Jakob N. Foerster
Gregory Farquhar
Maruan Al-Shedivat
Tim Rocktaschel
Eric Xing
Shimon Whiteson
123
97
0
14 Feb 2018
GEP-PG: Decoupling Exploration and Exploitation in Deep Reinforcement
  Learning Algorithms
GEP-PG: Decoupling Exploration and Exploitation in Deep Reinforcement Learning Algorithms
Cédric Colas
Olivier Sigaud
Pierre-Yves Oudeyer
138
159
0
14 Feb 2018
Evolved Policy Gradients
Evolved Policy Gradients
Rein Houthooft
Richard Y. Chen
Phillip Isola
Bradly C. Stadie
Filip Wolski
Jonathan Ho
Pieter Abbeel
109
228
0
13 Feb 2018
Diversity-Driven Exploration Strategy for Deep Reinforcement Learning
Diversity-Driven Exploration Strategy for Deep Reinforcement Learning
Zhang-Wei Hong
Tzu-Yun Shann
Shih-Yang Su
Yi-Hsiang Chang
Chun-Yi Lee
103
124
0
13 Feb 2018
Hierarchical Learning for Modular Robots
Hierarchical Learning for Modular Robots
R. Kojcev
Nora Etxezarreta
Alejandro Hernández
Víctor Mayoral
46
4
0
12 Feb 2018
Taking gradients through experiments: LSTMs and memory proximal policy
  optimization for black-box quantum control
Taking gradients through experiments: LSTMs and memory proximal policy optimization for black-box quantum control
Moritz August
José Miguel Hernández-Lobato
73
41
0
12 Feb 2018
Sample Efficient Deep Reinforcement Learning for Dialogue Systems with
  Large Action Spaces
Sample Efficient Deep Reinforcement Learning for Dialogue Systems with Large Action Spaces
Gellert Weisz
Paweł Budzianowski
Pei-hao Su
Milica Gasic
47
83
0
11 Feb 2018
Beyond the One Step Greedy Approach in Reinforcement Learning
Beyond the One Step Greedy Approach in Reinforcement Learning
Yonathan Efroni
Gal Dalal
B. Scherrer
Shie Mannor
OffRL
117
51
0
10 Feb 2018
Path Consistency Learning in Tsallis Entropy Regularized MDPs
Path Consistency Learning in Tsallis Entropy Regularized MDPs
Ofir Nachum
Yinlam Chow
Mohammad Ghavamzadeh
74
47
0
10 Feb 2018
Balancing Two-Player Stochastic Games with Soft Q-Learning
Balancing Two-Player Stochastic Games with Soft Q-Learning
Jordi Grau-Moya
Felix Leibfried
Haitham Bou-Ammar
132
43
0
09 Feb 2018
Learning and Querying Fast Generative Models for Reinforcement Learning
Learning and Querying Fast Generative Models for Reinforcement Learning
Lars Buesing
T. Weber
S. Racanière
S. M. Ali Eslami
Danilo Jimenez Rezende
...
Fabio Viola
F. Besse
Karol Gregor
Demis Hassabis
Daan Wierstra
OffRL
87
135
0
08 Feb 2018
VR-Goggles for Robots: Real-to-sim Domain Adaptation for Visual Control
VR-Goggles for Robots: Real-to-sim Domain Adaptation for Visual Control
Jingwei Zhang
L. Tai
Peng Yun
Yufeng Xiong
Ming-Yuan Liu
Joschka Boedecker
Wolfram Burgard
115
123
0
01 Feb 2018
Pretraining Deep Actor-Critic Reinforcement Learning Algorithms With
  Expert Demonstrations
Pretraining Deep Actor-Critic Reinforcement Learning Algorithms With Expert Demonstrations
Xiaoqin Zhang
Huimin Ma
OffRL
119
38
0
31 Jan 2018
A Deep Reinforcement Learning Approach for Dynamically Stable Inverse
  Kinematics of Humanoid Robots
A Deep Reinforcement Learning Approach for Dynamically Stable Inverse Kinematics of Humanoid Robots
S. Teja
Parijat Dewangan
P. Guhan
Abhishek Sarkar
K Madhava Krishna
75
55
0
31 Jan 2018
Understanding Human Behaviors in Crowds by Imitating the Decision-Making
  Process
Understanding Human Behaviors in Crowds by Imitating the Decision-Making Process
Haosheng Zou
Hang Su
Shihong Song
Jun Zhu
68
49
0
25 Jan 2018
Learning Symmetric and Low-energy Locomotion
Learning Symmetric and Low-energy Locomotion
Wenhao Yu
Greg Turk
Chenxi Liu
126
186
0
24 Jan 2018
An Empirical Analysis of Proximal Policy Optimization with
  Kronecker-factored Natural Gradients
An Empirical Analysis of Proximal Policy Optimization with Kronecker-factored Natural Gradients
Jiaming Song
Yuhuai Wu
42
2
0
17 Jan 2018
Global Convergence of Policy Gradient Methods for the Linear Quadratic
  Regulator
Global Convergence of Policy Gradient Methods for the Linear Quadratic Regulator
Maryam Fazel
Rong Ge
Sham Kakade
M. Mesbahi
117
610
0
15 Jan 2018
Autonomous Driving in Reality with Reinforcement Learning and Image
  Translation
Autonomous Driving in Reality with Reinforcement Learning and Image Translation
N. Xu
Bowen Tan
Bingyu Kong
76
36
0
13 Jan 2018
Model-Based Action Exploration for Learning Dynamic Motion Skills
Model-Based Action Exploration for Learning Dynamic Motion Skills
Glen Berseth
M. van de Panne
59
0
0
11 Jan 2018
Expected Policy Gradients for Reinforcement Learning
Expected Policy Gradients for Reinforcement Learning
K. Ciosek
Shimon Whiteson
125
53
0
10 Jan 2018
Distributed Deep Reinforcement Learning: Learn how to play Atari games
  in 21 minutes
Distributed Deep Reinforcement Learning: Learn how to play Atari games in 21 minutes
Igor Adamski
R. Adamski
T. Grel
Adam Jedrych
Kamil Kaczmarek
Henryk Michalewski
OffRL
128
37
0
09 Jan 2018
Jointly Learning to Construct and Control Agents using Deep
  Reinforcement Learning
Jointly Learning to Construct and Control Agents using Deep Reinforcement Learning
Charles B. Schaff
David Yunis
Ayan Chakrabarti
Matthew R. Walter
100
101
0
04 Jan 2018
Previous
123...353637...394041
Next