Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
1502.05477
Cited By
Trust Region Policy Optimization
19 February 2015
John Schulman
Sergey Levine
Philipp Moritz
Michael I. Jordan
Pieter Abbeel
Re-assign community
ArXiv
PDF
HTML
Papers citing
"Trust Region Policy Optimization"
49 / 1,149 papers shown
Title
Data-efficient Deep Reinforcement Learning for Dexterous Manipulation
I. Popov
N. Heess
Timothy Lillicrap
Roland Hafner
Gabriel Barth-Maron
Matej Vecerík
Thomas Lampe
Yuval Tassa
Tom Erez
Martin Riedmiller
OffRL
17
263
0
10 Apr 2017
Stochastic Neural Networks for Hierarchical Reinforcement Learning
Carlos Florensa
Yan Duan
Pieter Abbeel
BDL
47
360
0
10 Apr 2017
DART: Noise Injection for Robust Imitation Learning
Michael Laskey
Jonathan Lee
Roy Fox
Anca Dragan
Ken Goldberg
32
244
0
27 Mar 2017
Deep Deterministic Policy Gradient for Urban Traffic Light Control
Noe Casas
29
165
0
27 Mar 2017
Failures of Gradient-Based Deep Learning
Shai Shalev-Shwartz
Ohad Shamir
Shaked Shammah
ODL
UQCV
34
198
0
23 Mar 2017
Learning to Navigate Cloth using Haptics
Alexander Clegg
Wenhao Yu
Zackory M. Erickson
Jie Tan
Chenxi Liu
Greg Turk
13
23
0
20 Mar 2017
Sensor Fusion for Robot Control through Deep Reinforcement Learning
Steven Bohez
Tim Verbelen
E. D. Coninck
B. Vankeirsbilck
Pieter Simoens
Bart Dhoedt
SSL
8
29
0
13 Mar 2017
Evolution Strategies as a Scalable Alternative to Reinforcement Learning
Tim Salimans
Jonathan Ho
Xi Chen
Szymon Sidor
Ilya Sutskever
22
1,516
0
10 Mar 2017
Combining Model-Based and Model-Free Updates for Trajectory-Centric Reinforcement Learning
Yevgen Chebotar
Karol Hausman
Marvin Zhang
Gaurav Sukhatme
S. Schaal
Sergey Levine
19
159
0
08 Mar 2017
Learning a Unified Control Policy for Safe Falling
Visak C. V. Kumar
Sehoon Ha
Karen Liu
15
19
0
08 Mar 2017
Towards Generalization and Simplicity in Continuous Control
Aravind Rajeswaran
Kendall Lowrey
E. Todorov
Sham Kakade
OffRL
24
276
0
08 Mar 2017
Surprise-Based Intrinsic Motivation for Deep Reinforcement Learning
Joshua Achiam
S. Shankar Sastry
29
235
0
06 Mar 2017
Reinforcement Learning for Pivoting Task
Rika Antonova
S. Cruciani
Christian Smith
Danica Kragic
8
68
0
01 Mar 2017
Learning Control for Air Hockey Striking using Deep Reinforcement Learning
Ayal Taitler
N. Shimkin
15
10
0
26 Feb 2017
Learning to Multi-Task by Active Sampling
Sahil Sharma
Ashutosh Jha
Parikshit Hegde
Balaraman Ravindran
10
21
0
20 Feb 2017
Uncertainty-Aware Reinforcement Learning for Collision Avoidance
G. Kahn
Adam R. Villaflor
Vitchyr H. Pong
Pieter Abbeel
Sergey Levine
30
311
0
03 Feb 2017
Deep Reinforcement Learning for Robotic Manipulation-The state of the art
S. Amarjyoti
18
65
0
31 Jan 2017
Expert Level control of Ramp Metering based on Multi-task Deep Reinforcement Learning
Francois Belletti
Daniel Haziza
G. Gomes
Alexandre M. Bayen
6
139
0
30 Jan 2017
Deep Reinforcement Learning: An Overview
Yuxi Li
OffRL
VLM
104
1,503
0
25 Jan 2017
Imitating Driver Behavior with Generative Adversarial Networks
Alex Kuefler
Jeremy Morton
T. Wheeler
Mykel Kochenderfer
GAN
29
405
0
24 Jan 2017
Deep Reinforcement Learning with Successor Features for Navigation across Similar Environments
Jingwei Zhang
Jost Tobias Springenberg
Joschka Boedecker
Wolfram Burgard
11
295
0
16 Dec 2016
Model-based Adversarial Imitation Learning
Nir Baram
Oron Anschel
Shie Mannor
GAN
22
42
0
07 Dec 2016
Generalizing Skills with Semi-Supervised Reinforcement Learning
Chelsea Finn
Tianhe Yu
Justin Fu
Pieter Abbeel
Sergey Levine
OffRL
SSL
19
67
0
01 Dec 2016
#Exploration: A Study of Count-Based Exploration for Deep Reinforcement Learning
Haoran Tang
Rein Houthooft
Davis Foote
Adam Stooke
Xi Chen
Yan Duan
John Schulman
F. Turck
Pieter Abbeel
OffRL
48
760
0
15 Nov 2016
RL
2
^2
2
: Fast Reinforcement Learning via Slow Reinforcement Learning
Yan Duan
John Schulman
Xi Chen
Peter L. Bartlett
Ilya Sutskever
Pieter Abbeel
OffRL
35
1,007
0
09 Nov 2016
Q-Prop: Sample-Efficient Policy Gradient with An Off-Policy Critic
S. Gu
Timothy Lillicrap
Zoubin Ghahramani
Richard Turner
Sergey Levine
OffRL
BDL
13
342
0
07 Nov 2016
Modular Multitask Reinforcement Learning with Policy Sketches
Jacob Andreas
Dan Klein
Sergey Levine
OffRL
18
458
0
06 Nov 2016
Combining policy gradient and Q-learning
Brendan O'Donoghue
Rémi Munos
Koray Kavukcuoglu
Volodymyr Mnih
OffRL
OnRL
24
139
0
05 Nov 2016
Deep Learning Approximation for Stochastic Control Problems
Jiequn Han
E. Weinan
BDL
16
190
0
02 Nov 2016
Sim-to-Real Robot Learning from Pixels with Progressive Nets
Andrei A. Rusu
Matej Vecerík
Thomas Rothörl
N. Heess
Razvan Pascanu
R. Hadsell
34
532
0
13 Oct 2016
Reset-free Trial-and-Error Learning for Robot Damage Recovery
Konstantinos Chatzilygeroudis
Vassilis Vassiliades
Jean-Baptiste Mouret
13
102
0
13 Oct 2016
Transfer from Simulation to Real World through Learning Deep Inverse Dynamics Model
Paul Christiano
Zain Shah
Igor Mordatch
Jonas Schneider
T. Blackwell
Joshua Tobin
Pieter Abbeel
Wojciech Zaremba
PINN
16
248
0
11 Oct 2016
Collective Robot Reinforcement Learning with Distributed Asynchronous Guided Policy Search
Ali Yahya
A. Li
Mrinal Kalakrishnan
Yevgen Chebotar
Sergey Levine
OffRL
16
155
0
03 Oct 2016
Path Integral Guided Policy Search
Yevgen Chebotar
Mrinal Kalakrishnan
Ali Yahya
A. Li
S. Schaal
Sergey Levine
14
149
0
03 Oct 2016
Deep Reinforcement Learning for Tensegrity Robot Locomotion
Marvin Zhang
Xinyang Geng
J. Bruce
Ken Caluwaerts
Massimo Vespignani
Vytas SunSpiral
Pieter Abbeel
Sergey Levine
12
92
0
28 Sep 2016
A Sensorimotor Reinforcement Learning Framework for Physical Human-Robot Interaction
Ali Ghadirzadeh
Judith Butepage
A. Maki
Danica Kragic
Mårten Björkman
17
49
0
27 Jul 2016
Continuously Learning Neural Dialogue Management
Pei-hao Su
Milica Gasic
N. Mrksic
L. Rojas-Barahona
Stefan Ultes
David Vandyke
Tsung-Hsien Wen
S. Young
19
122
0
08 Jun 2016
OpenAI Gym
Greg Brockman
Vicki Cheung
Ludwig Pettersson
Jonas Schneider
John Schulman
Jie Tang
Wojciech Zaremba
OffRL
ODL
13
5,023
0
05 Jun 2016
Model-Free Imitation Learning with Policy Optimization
Jonathan Ho
Jayesh K. Gupta
Stefano Ermon
16
149
0
26 May 2016
Deep Learning for Reward Design to Improve Monte Carlo Tree Search in ATARI Games
Xiaoxiao Guo
Satinder Singh
Richard L. Lewis
Honglak Lee
14
55
0
24 Apr 2016
Benchmarking Deep Reinforcement Learning for Continuous Control
Yan Duan
Xi Chen
Rein Houthooft
John Schulman
Pieter Abbeel
OffRL
16
1,684
0
22 Apr 2016
HIRL: Hierarchical Inverse Reinforcement Learning for Long-Horizon Tasks with Delayed Rewards
S. Krishnan
Animesh Garg
Richard Liaw
Lauren Miller
Florian T. Pokorny
Ken Goldberg
20
40
0
21 Apr 2016
Continuous Deep Q-Learning with Model-based Acceleration
S. Gu
Timothy Lillicrap
Ilya Sutskever
Sergey Levine
16
1,008
0
02 Mar 2016
PLATO: Policy Learning using Adaptive Trajectory Optimization
G. Kahn
Tianhao Zhang
Sergey Levine
Pieter Abbeel
22
136
0
02 Mar 2016
Asynchronous Methods for Deep Reinforcement Learning
Volodymyr Mnih
Adria Puigdomenech Badia
M. Berk Mirza
Alex Graves
Timothy Lillicrap
Tim Harley
David Silver
Koray Kavukcuoglu
8
8,757
0
04 Feb 2016
Memory-based control with recurrent neural networks
N. Heess
Jonathan J. Hunt
Timothy Lillicrap
David Silver
18
301
0
14 Dec 2015
Adapting Deep Visuomotor Representations with Weak Pairwise Constraints
Eric Tzeng
Coline Devin
Judy Hoffman
Chelsea Finn
Pieter Abbeel
Sergey Levine
Kate Saenko
Trevor Darrell
OOD
24
138
0
23 Nov 2015
Continuous control with deep reinforcement learning
Timothy Lillicrap
Jonathan J. Hunt
Alexander Pritzel
N. Heess
Tom Erez
Yuval Tassa
David Silver
Daan Wierstra
14
13,110
0
09 Sep 2015
High-Dimensional Continuous Control Using Generalized Advantage Estimation
John Schulman
Philipp Moritz
Sergey Levine
Michael I. Jordan
Pieter Abbeel
OffRL
11
3,316
0
08 Jun 2015
Previous
1
2
3
...
21
22
23