Predictor-Corrector Policy Optimization

15 October 2018

Papers citing "Predictor-Corrector Policy Optimization"

35 / 35 papers shown

Title
Rapidly Adapting Policies to the Real World via Simulation-Guided Fine-Tuning Patrick Yin Tyler Westenbroek Simran Bagaria Kevin Huang Ching-an Cheng Andrey Kobolov Abhishek Gupta 113 3 0 04 Feb 2025
Functional Acceleration for Policy Mirror Descent Veronica Chelu Doina Precup 44 0 0 23 Jul 2024
Introduction to Online Convex Optimization Elad Hazan OffRL 90 1,922 0 07 Sep 2019
On the Convergence of Adam and Beyond Sashank J. Reddi Satyen Kale Surinder Kumar 58 2,482 0 19 Apr 2019
Differentiable MPC for End-to-end Planning and Control Brandon Amos I. D. Rodriguez Jacob Sacks Byron Boots J. Zico Kolter 60 366 0 31 Oct 2018
Stochastic Variance-Reduced Policy Gradient Matteo Papini Damiano Binaghi Giuseppe Canonaco Matteo Pirotta Marcello Restelli 54 174 0 14 Jun 2018
Accelerating Imitation Learning with Predictive Models Ching-An Cheng Xinyan Yan Evangelos A. Theodorou Byron Boots 48 21 0 12 Jun 2018
Dual Policy Iteration Wen Sun Geoffrey J. Gordon Byron Boots J. Andrew Bagnell OffRL 76 56 0 28 May 2018
Fast Policy Learning through Imitation and Reinforcement Ching-An Cheng Xinyan Yan Nolan Wagener Byron Boots 46 83 0 26 May 2018
Sim-to-Real: Learning Agile Locomotion For Quadruped Robots Jie Tan Tingnan Zhang Erwin Coumans Atil Iscen Yunfei Bai Danijar Hafner Steven Bohez Vincent Vanhoucke 70 798 0 27 Apr 2018
Universal Planning Networks A. Srinivas Allan Jabri Pieter Abbeel Sergey Levine Chelsea Finn SSL 61 145 0 02 Apr 2018
Convergence of Value Aggregation for Imitation Learning Ching-An Cheng Byron Boots 40 28 0 22 Jan 2018
Mastering Chess and Shogi by Self-Play with a General Reinforcement Learning Algorithm David Silver Thomas Hubert Julian Schrittwieser Ioannis Antonoglou Matthew Lai ... D. Kumaran T. Graepel Timothy Lillicrap Karen Simonyan Demis Hassabis 105 1,755 0 05 Dec 2017
Backpropagation through the Void: Optimizing control variates for black-box gradient estimation Will Grathwohl Dami Choi Yuhuai Wu Geoffrey Roeder David Duvenaud 84 300 0 31 Oct 2017
Learning model-based planning from scratch Razvan Pascanu Yujia Li Oriol Vinyals N. Heess Lars Buesing S. Racanière David P. Reichert T. Weber Daan Wierstra Peter W. Battaglia LM&Ro 86 97 0 19 Jul 2017
Value Prediction Network Junhyuk Oh Satinder Singh Honglak Lee 65 332 0 11 Jul 2017
A Unified Approach to Adaptive Regularization in Online and Stochastic Optimization Vineet Gupta Tomer Koren Y. Singer 17 21 0 20 Jun 2017
Thinking Fast and Slow with Deep Learning and Tree Search Thomas W. Anthony Zheng Tian David Barber 78 387 0 23 May 2017
Combining Model-Based and Model-Free Updates for Trajectory-Centric Reinforcement Learning Yevgen Chebotar Karol Hausman Marvin Zhang Gaurav Sukhatme S. Schaal Sergey Levine 61 160 0 08 Mar 2017
Deeply AggreVaTeD: Differentiable Imitation Learning for Sequential Prediction Wen Sun Arun Venkatraman Geoffrey J. Gordon Byron Boots J. Andrew Bagnell 110 235 0 03 Mar 2017
The Predictron: End-To-End Learning and Planning David Silver H. V. Hasselt Matteo Hessel Tom Schaul A. Guez ... Gabriel Dulac-Arnold David P. Reichert Neil C. Rabinowitz André Barreto T. Degris 50 289 0 28 Dec 2016
Benchmarking Deep Reinforcement Learning for Continuous Control Yan Duan Xi Chen Rein Houthooft John Schulman Pieter Abbeel OffRL 66 1,689 0 22 Apr 2016
High-Dimensional Continuous Control Using Generalized Advantage Estimation John Schulman Philipp Moritz Sergey Levine Michael I. Jordan Pieter Abbeel OffRL 45 3,368 0 08 Jun 2015
Strongly Adaptive Online Learning Amit Daniely Alon Gonen Shai Shalev-Shwartz ODL 105 177 0 25 Feb 2015
Trust Region Policy Optimization John Schulman Sergey Levine Philipp Moritz Michael I. Jordan Pieter Abbeel 245 6,722 0 19 Feb 2015
Adam: A Method for Stochastic Optimization Diederik P. Kingma Jimmy Ba ODL 907 149,474 0 22 Dec 2014
Reinforcement and Imitation Learning via Interactive No-Regret Learning Stéphane Ross J. Andrew Bagnell OffRL 85 262 0 23 Jun 2014
Playing Atari with Deep Reinforcement Learning Volodymyr Mnih Koray Kavukcuoglu David Silver Alex Graves Ioannis Antonoglou Daan Wierstra Martin Riedmiller 103 12,163 0 19 Dec 2013
Optimization, Learning, and Games with Predictable Sequences Alexander Rakhlin Karthik Sridharan 61 377 0 08 Nov 2013
ADADELTA: An Adaptive Learning Rate Method Matthew D. Zeiler ODL 115 6,619 0 22 Dec 2012
Online Learning with Predictable Sequences Alexander Rakhlin Karthik Sridharan 114 355 0 18 Aug 2012
Dyna-Style Planning with Linear Function Approximation and Prioritized Sweeping R. Sutton Csaba Szepesvári A. Geramifard Michael Bowling OffRL 65 203 0 13 Jun 2012
A Reduction of Imitation Learning and Structured Prediction to No-Regret Online Learning Stéphane Ross Geoffrey J. Gordon J. Andrew Bagnell OffRL 166 3,196 0 02 Nov 2010
Adaptive Bound Optimization for Online Convex Optimization H. B. McMahan Matthew J. Streeter ODL 80 386 0 26 Feb 2010
Solving variational inequalities with Stochastic Mirror-Prox algorithm A. Juditsky A. Nemirovskii Claire Tauvel 109 441 0 04 Sep 2008