PLATO: Policy Learning using Adaptive Trajectory Optimization

v1v2v3v4 (latest)

PLATO: Policy Learning using Adaptive Trajectory Optimization

2 March 2016

Pieter Abbeel

ArXiv (abs)PDF HTML

Papers citing "PLATO: Policy Learning using Adaptive Trajectory Optimization"

14 / 14 papers shown

Title
Adaptive Information Gathering via Imitation Learning Sanjiban Choudhury Ashish Kapoor G. Ranade Sebastian Scherer Debadeepta Dey 63 22 0 22 May 2017
Deeply AggreVaTeD: Differentiable Imitation Learning for Sequential Prediction Wen Sun Arun Venkatraman Geoffrey J. Gordon Byron Boots J. Andrew Bagnell 132 235 0 03 Mar 2017
Learning Continuous Control Policies by Stochastic Value Gradients N. Heess Greg Wayne David Silver Timothy Lillicrap Yuval Tassa Tom Erez 97 560 0 30 Oct 2015
Learning Deep Control Policies for Autonomous Aerial Vehicles with MPC-Guided Policy Search Tianhao Zhang G. Kahn Sergey Levine Pieter Abbeel 76 427 0 22 Sep 2015
Continuous control with deep reinforcement learning Timothy Lillicrap Jonathan J. Hunt Alexander Pritzel N. Heess Tom Erez Yuval Tassa David Silver Daan Wierstra 323 13,272 0 09 Sep 2015
DeepDriving: Learning Affordance for Direct Perception in Autonomous Driving Chenyi Chen Ari Seff A. Kornhauser Jianxiong Xiao 99 1,765 0 01 May 2015
End-to-End Training of Deep Visuomotor Policies Sergey Levine Chelsea Finn Trevor Darrell Pieter Abbeel BDL 315 3,442 0 02 Apr 2015
Trust Region Policy Optimization John Schulman Sergey Levine Philipp Moritz Michael I. Jordan Pieter Abbeel 277 6,793 0 19 Feb 2015
Adam: A Method for Stochastic Optimization Diederik P. Kingma Jimmy Ba ODL 1.9K 150,260 0 22 Dec 2014
Sequence to Sequence Learning with Neural Networks Ilya Sutskever Oriol Vinyals Quoc V. Le AIMat 437 20,568 0 10 Sep 2014
Caffe: Convolutional Architecture for Fast Feature Embedding Yangqing Jia Evan Shelhamer Jeff Donahue Sergey Karayev Jonathan Long Ross B. Girshick S. Guadarrama Trevor Darrell VLM BDL 3DV 274 14,713 0 20 Jun 2014
Playing Atari with Deep Reinforcement Learning Volodymyr Mnih Koray Kavukcuoglu David Silver Alex Graves Ioannis Antonoglou Daan Wierstra Martin Riedmiller 127 12,261 0 19 Dec 2013
Learning Monocular Reactive UAV Control in Cluttered Natural Environments Stéphane Ross Narek Melik-Barkhudarov Kumar Shaurya Shankar Andreas Wendel Debadeepta Dey J. Andrew Bagnell M. Hebert 119 438 0 07 Nov 2012
A Reduction of Imitation Learning and Structured Prediction to No-Regret Online Learning Stéphane Ross Geoffrey J. Gordon J. Andrew Bagnell OffRL 231 3,232 0 02 Nov 2010