Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
1502.05477
Cited By
Trust Region Policy Optimization
19 February 2015
John Schulman
Sergey Levine
Philipp Moritz
Michael I. Jordan
Pieter Abbeel
Re-assign community
ArXiv
PDF
HTML
Papers citing
"Trust Region Policy Optimization"
16 / 1,216 papers shown
Title
Deep Reinforcement Learning for Tensegrity Robot Locomotion
Marvin Zhang
Xinyang Geng
J. Bruce
Ken Caluwaerts
Massimo Vespignani
Vytas SunSpiral
Pieter Abbeel
Sergey Levine
17
92
0
28 Sep 2016
A Sensorimotor Reinforcement Learning Framework for Physical Human-Robot Interaction
Ali Ghadirzadeh
Judith Butepage
A. Maki
Danica Kragic
Mårten Björkman
22
49
0
27 Jul 2016
Continuously Learning Neural Dialogue Management
Pei-hao Su
Milica Gasic
N. Mrksic
L. Rojas-Barahona
Stefan Ultes
David Vandyke
Tsung-Hsien Wen
S. Young
32
122
0
08 Jun 2016
OpenAI Gym
Greg Brockman
Vicki Cheung
Ludwig Pettersson
Jonas Schneider
John Schulman
Jie Tang
Wojciech Zaremba
OffRL
ODL
15
5,023
0
05 Jun 2016
Model-Free Imitation Learning with Policy Optimization
Jonathan Ho
Jayesh K. Gupta
Stefano Ermon
21
149
0
26 May 2016
Deep Learning for Reward Design to Improve Monte Carlo Tree Search in ATARI Games
Xiaoxiao Guo
Satinder Singh
Richard L. Lewis
Honglak Lee
16
55
0
24 Apr 2016
Benchmarking Deep Reinforcement Learning for Continuous Control
Yan Duan
Xi Chen
Rein Houthooft
John Schulman
Pieter Abbeel
OffRL
18
1,685
0
22 Apr 2016
HIRL: Hierarchical Inverse Reinforcement Learning for Long-Horizon Tasks with Delayed Rewards
S. Krishnan
Animesh Garg
Richard Liaw
Lauren Miller
Florian T. Pokorny
Ken Goldberg
30
40
0
21 Apr 2016
Continuous Deep Q-Learning with Model-based Acceleration
S. Gu
Timothy Lillicrap
Ilya Sutskever
Sergey Levine
25
1,008
0
02 Mar 2016
PLATO: Policy Learning using Adaptive Trajectory Optimization
G. Kahn
Tianhao Zhang
Sergey Levine
Pieter Abbeel
26
136
0
02 Mar 2016
Asynchronous Methods for Deep Reinforcement Learning
Volodymyr Mnih
Adria Puigdomenech Badia
M. Berk Mirza
Alex Graves
Timothy Lillicrap
Tim Harley
David Silver
Koray Kavukcuoglu
13
8,765
0
04 Feb 2016
Memory-based control with recurrent neural networks
N. Heess
Jonathan J. Hunt
Timothy Lillicrap
David Silver
29
301
0
14 Dec 2015
Adapting Deep Visuomotor Representations with Weak Pairwise Constraints
Eric Tzeng
Coline Devin
Judy Hoffman
Chelsea Finn
Pieter Abbeel
Sergey Levine
Kate Saenko
Trevor Darrell
OOD
24
138
0
23 Nov 2015
Learning Continuous Control Policies by Stochastic Value Gradients
N. Heess
Greg Wayne
David Silver
Timothy Lillicrap
Yuval Tassa
Tom Erez
19
557
0
30 Oct 2015
Continuous control with deep reinforcement learning
Timothy Lillicrap
Jonathan J. Hunt
Alexander Pritzel
N. Heess
Tom Erez
Yuval Tassa
David Silver
Daan Wierstra
19
13,110
0
09 Sep 2015
High-Dimensional Continuous Control Using Generalized Advantage Estimation
John Schulman
Philipp Moritz
Sergey Levine
Michael I. Jordan
Pieter Abbeel
OffRL
13
3,316
0
08 Jun 2015
Previous
1
2
3
...
23
24
25