Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
1810.06509
Cited By
Predictor-Corrector Policy Optimization
15 October 2018
Ching-An Cheng
Xinyan Yan
Nathan D. Ratliff
Byron Boots
OnRL
Re-assign community
ArXiv
PDF
HTML
Papers citing
"Predictor-Corrector Policy Optimization"
35 / 35 papers shown
Title
Rapidly Adapting Policies to the Real World via Simulation-Guided Fine-Tuning
Patrick Yin
Tyler Westenbroek
Simran Bagaria
Kevin Huang
Ching-an Cheng
Andrey Kobolov
Abhishek Gupta
113
3
0
04 Feb 2025
Functional Acceleration for Policy Mirror Descent
Veronica Chelu
Doina Precup
44
0
0
23 Jul 2024
Introduction to Online Convex Optimization
Elad Hazan
OffRL
90
1,922
0
07 Sep 2019
On the Convergence of Adam and Beyond
Sashank J. Reddi
Satyen Kale
Surinder Kumar
58
2,482
0
19 Apr 2019
Differentiable MPC for End-to-end Planning and Control
Brandon Amos
I. D. Rodriguez
Jacob Sacks
Byron Boots
J. Zico Kolter
60
366
0
31 Oct 2018
Stochastic Variance-Reduced Policy Gradient
Matteo Papini
Damiano Binaghi
Giuseppe Canonaco
Matteo Pirotta
Marcello Restelli
54
174
0
14 Jun 2018
Accelerating Imitation Learning with Predictive Models
Ching-An Cheng
Xinyan Yan
Evangelos A. Theodorou
Byron Boots
48
21
0
12 Jun 2018
Dual Policy Iteration
Wen Sun
Geoffrey J. Gordon
Byron Boots
J. Andrew Bagnell
OffRL
76
56
0
28 May 2018
Fast Policy Learning through Imitation and Reinforcement
Ching-An Cheng
Xinyan Yan
Nolan Wagener
Byron Boots
46
83
0
26 May 2018
Sim-to-Real: Learning Agile Locomotion For Quadruped Robots
Jie Tan
Tingnan Zhang
Erwin Coumans
Atil Iscen
Yunfei Bai
Danijar Hafner
Steven Bohez
Vincent Vanhoucke
70
798
0
27 Apr 2018
Universal Planning Networks
A. Srinivas
Allan Jabri
Pieter Abbeel
Sergey Levine
Chelsea Finn
SSL
61
145
0
02 Apr 2018
Convergence of Value Aggregation for Imitation Learning
Ching-An Cheng
Byron Boots
40
28
0
22 Jan 2018
Mastering Chess and Shogi by Self-Play with a General Reinforcement Learning Algorithm
David Silver
Thomas Hubert
Julian Schrittwieser
Ioannis Antonoglou
Matthew Lai
...
D. Kumaran
T. Graepel
Timothy Lillicrap
Karen Simonyan
Demis Hassabis
105
1,755
0
05 Dec 2017
Backpropagation through the Void: Optimizing control variates for black-box gradient estimation
Will Grathwohl
Dami Choi
Yuhuai Wu
Geoffrey Roeder
David Duvenaud
84
300
0
31 Oct 2017
Learning model-based planning from scratch
Razvan Pascanu
Yujia Li
Oriol Vinyals
N. Heess
Lars Buesing
S. Racanière
David P. Reichert
T. Weber
Daan Wierstra
Peter W. Battaglia
LM&Ro
86
97
0
19 Jul 2017
Value Prediction Network
Junhyuk Oh
Satinder Singh
Honglak Lee
65
332
0
11 Jul 2017
A Unified Approach to Adaptive Regularization in Online and Stochastic Optimization
Vineet Gupta
Tomer Koren
Y. Singer
17
21
0
20 Jun 2017
Thinking Fast and Slow with Deep Learning and Tree Search
Thomas W. Anthony
Zheng Tian
David Barber
78
387
0
23 May 2017
Combining Model-Based and Model-Free Updates for Trajectory-Centric Reinforcement Learning
Yevgen Chebotar
Karol Hausman
Marvin Zhang
Gaurav Sukhatme
S. Schaal
Sergey Levine
61
160
0
08 Mar 2017
Deeply AggreVaTeD: Differentiable Imitation Learning for Sequential Prediction
Wen Sun
Arun Venkatraman
Geoffrey J. Gordon
Byron Boots
J. Andrew Bagnell
110
235
0
03 Mar 2017
The Predictron: End-To-End Learning and Planning
David Silver
H. V. Hasselt
Matteo Hessel
Tom Schaul
A. Guez
...
Gabriel Dulac-Arnold
David P. Reichert
Neil C. Rabinowitz
André Barreto
T. Degris
50
289
0
28 Dec 2016
Benchmarking Deep Reinforcement Learning for Continuous Control
Yan Duan
Xi Chen
Rein Houthooft
John Schulman
Pieter Abbeel
OffRL
66
1,689
0
22 Apr 2016
High-Dimensional Continuous Control Using Generalized Advantage Estimation
John Schulman
Philipp Moritz
Sergey Levine
Michael I. Jordan
Pieter Abbeel
OffRL
45
3,368
0
08 Jun 2015
Strongly Adaptive Online Learning
Amit Daniely
Alon Gonen
Shai Shalev-Shwartz
ODL
105
177
0
25 Feb 2015
Trust Region Policy Optimization
John Schulman
Sergey Levine
Philipp Moritz
Michael I. Jordan
Pieter Abbeel
245
6,722
0
19 Feb 2015
Adam: A Method for Stochastic Optimization
Diederik P. Kingma
Jimmy Ba
ODL
907
149,474
0
22 Dec 2014
Reinforcement and Imitation Learning via Interactive No-Regret Learning
Stéphane Ross
J. Andrew Bagnell
OffRL
85
262
0
23 Jun 2014
Playing Atari with Deep Reinforcement Learning
Volodymyr Mnih
Koray Kavukcuoglu
David Silver
Alex Graves
Ioannis Antonoglou
Daan Wierstra
Martin Riedmiller
103
12,163
0
19 Dec 2013
Optimization, Learning, and Games with Predictable Sequences
Alexander Rakhlin
Karthik Sridharan
61
377
0
08 Nov 2013
ADADELTA: An Adaptive Learning Rate Method
Matthew D. Zeiler
ODL
115
6,619
0
22 Dec 2012
Online Learning with Predictable Sequences
Alexander Rakhlin
Karthik Sridharan
114
355
0
18 Aug 2012
Dyna-Style Planning with Linear Function Approximation and Prioritized Sweeping
R. Sutton
Csaba Szepesvári
A. Geramifard
Michael Bowling
OffRL
65
203
0
13 Jun 2012
A Reduction of Imitation Learning and Structured Prediction to No-Regret Online Learning
Stéphane Ross
Geoffrey J. Gordon
J. Andrew Bagnell
OffRL
166
3,196
0
02 Nov 2010
Adaptive Bound Optimization for Online Convex Optimization
H. B. McMahan
Matthew J. Streeter
ODL
80
386
0
26 Feb 2010
Solving variational inequalities with Stochastic Mirror-Prox algorithm
A. Juditsky
A. Nemirovskii
Claire Tauvel
109
441
0
04 Sep 2008
1