Trust Region Policy Optimization

v1v2v3v4v5 (latest)

Trust Region Policy Optimization

19 February 2015

Michael I. Jordan

Pieter Abbeel

ArXiv (abs)PDF HTML

Papers citing "Trust Region Policy Optimization"

8 / 2,008 papers shown

Title
Value Iteration Networks Aviv Tamar Yi Wu G. Thomas Sergey Levine Pieter Abbeel 162 657 0 09 Feb 2016
Asynchronous Methods for Deep Reinforcement Learning Volodymyr Mnih Adria Puigdomenech Badia M. Berk Mirza Alex Graves Timothy Lillicrap Tim Harley David Silver Koray Kavukcuoglu 455 8,901 0 04 Feb 2016
Memory-based control with recurrent neural networks N. Heess Jonathan J. Hunt Timothy Lillicrap David Silver 112 304 0 14 Dec 2015
State of the Art Control of Atari Games Using Shallow Reinforcement Learning Yitao Liang Marlos C. Machado Erik Talvitie Michael Bowling 117 113 0 04 Dec 2015
Adapting Deep Visuomotor Representations with Weak Pairwise Constraints Eric Tzeng Coline Devin Judy Hoffman Chelsea Finn Pieter Abbeel Sergey Levine Kate Saenko Trevor Darrell OOD 122 140 0 23 Nov 2015
Learning Continuous Control Policies by Stochastic Value Gradients N. Heess Greg Wayne David Silver Timothy Lillicrap Yuval Tassa Tom Erez 129 561 0 30 Oct 2015
Continuous control with deep reinforcement learning Timothy Lillicrap Jonathan J. Hunt Alexander Pritzel N. Heess Tom Erez Yuval Tassa David Silver Daan Wierstra 439 13,348 0 09 Sep 2015
High-Dimensional Continuous Control Using Generalized Advantage Estimation John Schulman Philipp Moritz Sergey Levine Michael I. Jordan Pieter Abbeel OffRL 171 3,453 0 08 Jun 2015