ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1603.00622
  4. Cited By
PLATO: Policy Learning using Adaptive Trajectory Optimization
v1v2v3v4 (latest)

PLATO: Policy Learning using Adaptive Trajectory Optimization

2 March 2016
G. Kahn
Tianhao Zhang
Sergey Levine
Pieter Abbeel
ArXiv (abs)PDFHTML

Papers citing "PLATO: Policy Learning using Adaptive Trajectory Optimization"

14 / 14 papers shown
Title
Adaptive Information Gathering via Imitation Learning
Adaptive Information Gathering via Imitation Learning
Sanjiban Choudhury
Ashish Kapoor
G. Ranade
Sebastian Scherer
Debadeepta Dey
63
22
0
22 May 2017
Deeply AggreVaTeD: Differentiable Imitation Learning for Sequential
  Prediction
Deeply AggreVaTeD: Differentiable Imitation Learning for Sequential Prediction
Wen Sun
Arun Venkatraman
Geoffrey J. Gordon
Byron Boots
J. Andrew Bagnell
132
235
0
03 Mar 2017
Learning Continuous Control Policies by Stochastic Value Gradients
Learning Continuous Control Policies by Stochastic Value Gradients
N. Heess
Greg Wayne
David Silver
Timothy Lillicrap
Yuval Tassa
Tom Erez
97
560
0
30 Oct 2015
Learning Deep Control Policies for Autonomous Aerial Vehicles with
  MPC-Guided Policy Search
Learning Deep Control Policies for Autonomous Aerial Vehicles with MPC-Guided Policy Search
Tianhao Zhang
G. Kahn
Sergey Levine
Pieter Abbeel
76
427
0
22 Sep 2015
Continuous control with deep reinforcement learning
Continuous control with deep reinforcement learning
Timothy Lillicrap
Jonathan J. Hunt
Alexander Pritzel
N. Heess
Tom Erez
Yuval Tassa
David Silver
Daan Wierstra
323
13,272
0
09 Sep 2015
DeepDriving: Learning Affordance for Direct Perception in Autonomous
  Driving
DeepDriving: Learning Affordance for Direct Perception in Autonomous Driving
Chenyi Chen
Ari Seff
A. Kornhauser
Jianxiong Xiao
99
1,765
0
01 May 2015
End-to-End Training of Deep Visuomotor Policies
End-to-End Training of Deep Visuomotor Policies
Sergey Levine
Chelsea Finn
Trevor Darrell
Pieter Abbeel
BDL
315
3,442
0
02 Apr 2015
Trust Region Policy Optimization
Trust Region Policy Optimization
John Schulman
Sergey Levine
Philipp Moritz
Michael I. Jordan
Pieter Abbeel
277
6,793
0
19 Feb 2015
Adam: A Method for Stochastic Optimization
Adam: A Method for Stochastic Optimization
Diederik P. Kingma
Jimmy Ba
ODL
1.9K
150,260
0
22 Dec 2014
Sequence to Sequence Learning with Neural Networks
Sequence to Sequence Learning with Neural Networks
Ilya Sutskever
Oriol Vinyals
Quoc V. Le
AIMat
437
20,568
0
10 Sep 2014
Caffe: Convolutional Architecture for Fast Feature Embedding
Caffe: Convolutional Architecture for Fast Feature Embedding
Yangqing Jia
Evan Shelhamer
Jeff Donahue
Sergey Karayev
Jonathan Long
Ross B. Girshick
S. Guadarrama
Trevor Darrell
VLMBDL3DV
274
14,713
0
20 Jun 2014
Playing Atari with Deep Reinforcement Learning
Playing Atari with Deep Reinforcement Learning
Volodymyr Mnih
Koray Kavukcuoglu
David Silver
Alex Graves
Ioannis Antonoglou
Daan Wierstra
Martin Riedmiller
127
12,261
0
19 Dec 2013
Learning Monocular Reactive UAV Control in Cluttered Natural
  Environments
Learning Monocular Reactive UAV Control in Cluttered Natural Environments
Stéphane Ross
Narek Melik-Barkhudarov
Kumar Shaurya Shankar
Andreas Wendel
Debadeepta Dey
J. Andrew Bagnell
M. Hebert
119
438
0
07 Nov 2012
A Reduction of Imitation Learning and Structured Prediction to No-Regret
  Online Learning
A Reduction of Imitation Learning and Structured Prediction to No-Regret Online Learning
Stéphane Ross
Geoffrey J. Gordon
J. Andrew Bagnell
OffRL
231
3,232
0
02 Nov 2010
1