Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
1502.05477
Cited By
v1
v2
v3
v4
v5 (latest)
Trust Region Policy Optimization
19 February 2015
John Schulman
Sergey Levine
Philipp Moritz
Michael I. Jordan
Pieter Abbeel
Re-assign community
ArXiv (abs)
PDF
HTML
Papers citing
"Trust Region Policy Optimization"
8 / 2,008 papers shown
Title
Value Iteration Networks
Aviv Tamar
Yi Wu
G. Thomas
Sergey Levine
Pieter Abbeel
162
657
0
09 Feb 2016
Asynchronous Methods for Deep Reinforcement Learning
Volodymyr Mnih
Adria Puigdomenech Badia
M. Berk Mirza
Alex Graves
Timothy Lillicrap
Tim Harley
David Silver
Koray Kavukcuoglu
455
8,901
0
04 Feb 2016
Memory-based control with recurrent neural networks
N. Heess
Jonathan J. Hunt
Timothy Lillicrap
David Silver
112
304
0
14 Dec 2015
State of the Art Control of Atari Games Using Shallow Reinforcement Learning
Yitao Liang
Marlos C. Machado
Erik Talvitie
Michael Bowling
117
113
0
04 Dec 2015
Adapting Deep Visuomotor Representations with Weak Pairwise Constraints
Eric Tzeng
Coline Devin
Judy Hoffman
Chelsea Finn
Pieter Abbeel
Sergey Levine
Kate Saenko
Trevor Darrell
OOD
122
140
0
23 Nov 2015
Learning Continuous Control Policies by Stochastic Value Gradients
N. Heess
Greg Wayne
David Silver
Timothy Lillicrap
Yuval Tassa
Tom Erez
129
561
0
30 Oct 2015
Continuous control with deep reinforcement learning
Timothy Lillicrap
Jonathan J. Hunt
Alexander Pritzel
N. Heess
Tom Erez
Yuval Tassa
David Silver
Daan Wierstra
439
13,348
0
09 Sep 2015
High-Dimensional Continuous Control Using Generalized Advantage Estimation
John Schulman
Philipp Moritz
Sergey Levine
Michael I. Jordan
Pieter Abbeel
OffRL
171
3,453
0
08 Jun 2015
Previous
1
2
3
...
39
40
41