Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
1502.05477
Cited By
v1
v2
v3
v4
v5 (latest)
Trust Region Policy Optimization
19 February 2015
John Schulman
Sergey Levine
Philipp Moritz
Michael I. Jordan
Pieter Abbeel
Re-assign community
ArXiv (abs)
PDF
HTML
Papers citing
"Trust Region Policy Optimization"
50 / 2,008 papers shown
Title
Beating the World's Best at Super Smash Bros. with Deep Reinforcement Learning
Vlad Firoiu
William F. Whitney
J. Tenenbaum
94
36
0
21 Feb 2017
Learning to Multi-Task by Active Sampling
Sahil Sharma
Ashutosh Jha
Parikshit Hegde
Balaraman Ravindran
151
21
0
20 Feb 2017
Cognitive Mapping and Planning for Visual Navigation
Saurabh Gupta
Varun Tolani
James Davidson
Sergey Levine
Rahul Sukthankar
Jitendra Malik
143
715
0
13 Feb 2017
Preparing for the Unknown: Learning a Universal Policy with Online System Identification
Wenhao Yu
Jie Tan
Chenxi Liu
Greg Turk
OffRL
122
309
0
08 Feb 2017
Adversarial Attacks on Neural Network Policies
Sandy Huang
Nicolas Papernot
Ian Goodfellow
Yan Duan
Pieter Abbeel
MLAU
AAML
131
842
0
08 Feb 2017
Uncertainty-Aware Reinforcement Learning for Collision Avoidance
G. Kahn
Adam R. Villaflor
Vitchyr H. Pong
Pieter Abbeel
Sergey Levine
107
317
0
03 Feb 2017
Deep Reinforcement Learning for Robotic Manipulation-The state of the art
S. Amarjyoti
56
66
0
31 Jan 2017
Expert Level control of Ramp Metering based on Multi-task Deep Reinforcement Learning
Francois Belletti
Daniel Haziza
G. Gomes
Alexandre M. Bayen
59
139
0
30 Jan 2017
Deep Reinforcement Learning: An Overview
Yuxi Li
OffRL
VLM
352
1,551
0
25 Jan 2017
Imitating Driver Behavior with Generative Adversarial Networks
Alex Kuefler
Jeremy Morton
T. Wheeler
Mykel Kochenderfer
GAN
117
408
0
24 Jan 2017
Scalable and Incremental Learning of Gaussian Mixture Models
R. Pinto
P. Engel
60
10
0
14 Jan 2017
Deep Reinforcement Learning with Successor Features for Navigation across Similar Environments
Jingwei Zhang
Jost Tobias Springenberg
Joschka Boedecker
Wolfram Burgard
85
295
0
16 Dec 2016
Reinforcement Learning With Temporal Logic Rewards
Xiao Li
C. Vasile
C. Belta
98
219
0
11 Dec 2016
Model-based Adversarial Imitation Learning
Nir Baram
Oron Anschel
Shie Mannor
GAN
76
42
0
07 Dec 2016
Generalizing Skills with Semi-Supervised Reinforcement Learning
Chelsea Finn
Tianhe Yu
Justin Fu
Pieter Abbeel
Sergey Levine
OffRL
SSL
111
69
0
01 Dec 2016
Reinforcement Learning through Asynchronous Advantage Actor-Critic on a GPU
Mohammad Babaeizadeh
I. Frosio
Stephen Tyree
Jason Clemons
Jan Kautz
OffRL
86
259
0
18 Nov 2016
#Exploration: A Study of Count-Based Exploration for Deep Reinforcement Learning
Haoran Tang
Rein Houthooft
Davis Foote
Adam Stooke
Xi Chen
Yan Duan
John Schulman
F. Turck
Pieter Abbeel
OffRL
143
777
0
15 Nov 2016
Sequence Tutor: Conservative Fine-Tuning of Sequence Generation Models with KL-control
Natasha Jaques
S. Gu
Dzmitry Bahdanau
José Miguel Hernández-Lobato
Richard Turner
Douglas Eck
193
173
0
09 Nov 2016
RL
2
^2
2
: Fast Reinforcement Learning via Slow Reinforcement Learning
Yan Duan
John Schulman
Xi Chen
Peter L. Bartlett
Ilya Sutskever
Pieter Abbeel
OffRL
120
1,029
0
09 Nov 2016
Recursive Regression with Neural Networks: Approximating the HJI PDE Solution
Vicencc Rubies-Royo
Claire Tomlin
81
20
0
08 Nov 2016
Q-Prop: Sample-Efficient Policy Gradient with An Off-Policy Critic
S. Gu
Timothy Lillicrap
Zoubin Ghahramani
Richard Turner
Sergey Levine
OffRL
BDL
115
345
0
07 Nov 2016
Modular Multitask Reinforcement Learning with Policy Sketches
Jacob Andreas
Dan Klein
Sergey Levine
OffRL
219
463
0
06 Nov 2016
Combining policy gradient and Q-learning
Brendan O'Donoghue
Rémi Munos
Koray Kavukcuoglu
Volodymyr Mnih
OffRL
OnRL
109
140
0
05 Nov 2016
Sample Efficient Actor-Critic with Experience Replay
Ziyun Wang
V. Bapst
N. Heess
Volodymyr Mnih
Rémi Munos
Koray Kavukcuoglu
Nando de Freitas
142
763
0
03 Nov 2016
Deep Learning Approximation for Stochastic Control Problems
Jiequn Han
E. Weinan
BDL
67
197
0
02 Nov 2016
Towards Lifelong Self-Supervision: A Deep Learning Direction for Robotics
J. M. Wong
86
11
0
01 Nov 2016
Sim-to-Real Robot Learning from Pixels with Progressive Nets
Andrei A. Rusu
Matej Vecerík
Thomas Rothörl
N. Heess
Razvan Pascanu
R. Hadsell
131
535
0
13 Oct 2016
Reset-free Trial-and-Error Learning for Robot Damage Recovery
Konstantinos Chatzilygeroudis
Vassilis Vassiliades
Jean-Baptiste Mouret
117
102
0
13 Oct 2016
Transfer from Simulation to Real World through Learning Deep Inverse Dynamics Model
Paul Christiano
Zain Shah
Igor Mordatch
Jonas Schneider
T. Blackwell
Joshua Tobin
Pieter Abbeel
Wojciech Zaremba
PINN
117
250
0
11 Oct 2016
Connecting Generative Adversarial Networks and Actor-Critic Methods
David Pfau
Oriol Vinyals
OffRL
AI4CE
127
186
0
06 Oct 2016
Collective Robot Reinforcement Learning with Distributed Asynchronous Guided Policy Search
Ali Yahya
A. Li
Mrinal Kalakrishnan
Yevgen Chebotar
Sergey Levine
OffRL
102
155
0
03 Oct 2016
Deep Reinforcement Learning for Robotic Manipulation with Asynchronous Off-Policy Updates
S. Gu
E. Holly
Timothy Lillicrap
Sergey Levine
OffRL
SSL
186
1,485
0
03 Oct 2016
Path Integral Guided Policy Search
Yevgen Chebotar
Mrinal Kalakrishnan
Ali Yahya
A. Li
S. Schaal
Sergey Levine
104
149
0
03 Oct 2016
Deep Reinforcement Learning for Tensegrity Robot Locomotion
Marvin Zhang
Xinyang Geng
J. Bruce
Ken Caluwaerts
Massimo Vespignani
Vytas SunSpiral
Pieter Abbeel
Sergey Levine
87
94
0
28 Sep 2016
Learning Modular Neural Network Policies for Multi-Task and Multi-Robot Transfer
Coline Devin
Abhishek Gupta
Trevor Darrell
Pieter Abbeel
Sergey Levine
OffRL
96
401
0
22 Sep 2016
A Sensorimotor Reinforcement Learning Framework for Physical Human-Robot Interaction
Ali Ghadirzadeh
Judith Butepage
A. Maki
Danica Kragic
Mårten Björkman
83
52
0
27 Jul 2016
Guided Policy Search as Approximate Mirror Descent
William H. Montgomery
Sergey Levine
77
126
0
15 Jul 2016
Model-Free Trajectory-based Policy Optimization with Monotonic Improvement
R. Akrour
A. Abdolmaleki
Hany Abdulsamad
Jan Peters
Gerhard Neumann
108
49
0
29 Jun 2016
Strategic Attentive Writer for Learning Macro-Actions
Alexander
A. Vezhnevets
Volodymyr Mnih
J. Agapiou
Simon Osindero
Alex Graves
Oriol Vinyals
Koray Kavukcuoglu
69
171
0
15 Jun 2016
Generative Adversarial Imitation Learning
Jonathan Ho
Stefano Ermon
GAN
244
3,136
0
10 Jun 2016
Continuously Learning Neural Dialogue Management
Pei-hao Su
Milica Gasic
N. Mrksic
L. Rojas-Barahona
Stefan Ultes
David Vandyke
Tsung-Hsien Wen
S. Young
91
122
0
08 Jun 2016
OpenAI Gym
Greg Brockman
Vicki Cheung
Ludwig Pettersson
Jonas Schneider
John Schulman
Jie Tang
Wojciech Zaremba
OffRL
ODL
408
5,098
0
05 Jun 2016
VIME: Variational Information Maximizing Exploration
Rein Houthooft
Xi Chen
Yan Duan
John Schulman
F. Turck
Pieter Abbeel
131
78
0
31 May 2016
Predicting Personal Traits from Facial Images using Convolutional Neural Networks Augmented with Facial Landmark Information
Yoad Lewenberg
Valliappa Chockalingam
Satinder Singh
Honglak Lee
CVBM
81
305
0
29 May 2016
Model-Free Imitation Learning with Policy Optimization
Jonathan Ho
Jayesh K. Gupta
Stefano Ermon
68
149
0
26 May 2016
Deep Learning for Reward Design to Improve Monte Carlo Tree Search in ATARI Games
Xiaoxiao Guo
Satinder Singh
Richard L. Lewis
Honglak Lee
96
55
0
24 Apr 2016
Benchmarking Deep Reinforcement Learning for Continuous Control
Yan Duan
Xi Chen
Rein Houthooft
John Schulman
Pieter Abbeel
OffRL
118
1,698
0
22 Apr 2016
Continuous Deep Q-Learning with Model-based Acceleration
S. Gu
Timothy Lillicrap
Ilya Sutskever
Sergey Levine
127
1,013
0
02 Mar 2016
PLATO: Policy Learning using Adaptive Trajectory Optimization
G. Kahn
Tianhao Zhang
Sergey Levine
Pieter Abbeel
124
137
0
02 Mar 2016
A review on locomotion robophysics: the study of movement at the intersection of robotics, soft matter and dynamical systems
J. Aguilar
Tingnan Zhang
Feifei Qian
Mark Kingsbury
Benjamin W. McInroe
...
Matthew Travers
Ross L. Hatton
Howie Choset
P. Umbanhowar
Daniel I. Goldman
57
242
0
12 Feb 2016
Previous
1
2
3
...
39
40
41
Next