ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1810.06509
  4. Cited By
Predictor-Corrector Policy Optimization

Predictor-Corrector Policy Optimization

15 October 2018
Ching-An Cheng
Xinyan Yan
Nathan D. Ratliff
Byron Boots
    OnRL
ArXivPDFHTML

Papers citing "Predictor-Corrector Policy Optimization"

35 / 35 papers shown
Title
Rapidly Adapting Policies to the Real World via Simulation-Guided Fine-Tuning
Rapidly Adapting Policies to the Real World via Simulation-Guided Fine-Tuning
Patrick Yin
Tyler Westenbroek
Simran Bagaria
Kevin Huang
Ching-an Cheng
Andrey Kobolov
Abhishek Gupta
113
3
0
04 Feb 2025
Functional Acceleration for Policy Mirror Descent
Functional Acceleration for Policy Mirror Descent
Veronica Chelu
Doina Precup
44
0
0
23 Jul 2024
Introduction to Online Convex Optimization
Introduction to Online Convex Optimization
Elad Hazan
OffRL
90
1,922
0
07 Sep 2019
On the Convergence of Adam and Beyond
On the Convergence of Adam and Beyond
Sashank J. Reddi
Satyen Kale
Surinder Kumar
58
2,482
0
19 Apr 2019
Differentiable MPC for End-to-end Planning and Control
Differentiable MPC for End-to-end Planning and Control
Brandon Amos
I. D. Rodriguez
Jacob Sacks
Byron Boots
J. Zico Kolter
60
366
0
31 Oct 2018
Stochastic Variance-Reduced Policy Gradient
Stochastic Variance-Reduced Policy Gradient
Matteo Papini
Damiano Binaghi
Giuseppe Canonaco
Matteo Pirotta
Marcello Restelli
54
174
0
14 Jun 2018
Accelerating Imitation Learning with Predictive Models
Accelerating Imitation Learning with Predictive Models
Ching-An Cheng
Xinyan Yan
Evangelos A. Theodorou
Byron Boots
48
21
0
12 Jun 2018
Dual Policy Iteration
Dual Policy Iteration
Wen Sun
Geoffrey J. Gordon
Byron Boots
J. Andrew Bagnell
OffRL
76
56
0
28 May 2018
Fast Policy Learning through Imitation and Reinforcement
Fast Policy Learning through Imitation and Reinforcement
Ching-An Cheng
Xinyan Yan
Nolan Wagener
Byron Boots
46
83
0
26 May 2018
Sim-to-Real: Learning Agile Locomotion For Quadruped Robots
Sim-to-Real: Learning Agile Locomotion For Quadruped Robots
Jie Tan
Tingnan Zhang
Erwin Coumans
Atil Iscen
Yunfei Bai
Danijar Hafner
Steven Bohez
Vincent Vanhoucke
70
798
0
27 Apr 2018
Universal Planning Networks
Universal Planning Networks
A. Srinivas
Allan Jabri
Pieter Abbeel
Sergey Levine
Chelsea Finn
SSL
61
145
0
02 Apr 2018
Convergence of Value Aggregation for Imitation Learning
Convergence of Value Aggregation for Imitation Learning
Ching-An Cheng
Byron Boots
40
28
0
22 Jan 2018
Mastering Chess and Shogi by Self-Play with a General Reinforcement
  Learning Algorithm
Mastering Chess and Shogi by Self-Play with a General Reinforcement Learning Algorithm
David Silver
Thomas Hubert
Julian Schrittwieser
Ioannis Antonoglou
Matthew Lai
...
D. Kumaran
T. Graepel
Timothy Lillicrap
Karen Simonyan
Demis Hassabis
105
1,755
0
05 Dec 2017
Backpropagation through the Void: Optimizing control variates for
  black-box gradient estimation
Backpropagation through the Void: Optimizing control variates for black-box gradient estimation
Will Grathwohl
Dami Choi
Yuhuai Wu
Geoffrey Roeder
David Duvenaud
84
300
0
31 Oct 2017
Learning model-based planning from scratch
Learning model-based planning from scratch
Razvan Pascanu
Yujia Li
Oriol Vinyals
N. Heess
Lars Buesing
S. Racanière
David P. Reichert
T. Weber
Daan Wierstra
Peter W. Battaglia
LM&Ro
86
97
0
19 Jul 2017
Value Prediction Network
Value Prediction Network
Junhyuk Oh
Satinder Singh
Honglak Lee
65
332
0
11 Jul 2017
A Unified Approach to Adaptive Regularization in Online and Stochastic
  Optimization
A Unified Approach to Adaptive Regularization in Online and Stochastic Optimization
Vineet Gupta
Tomer Koren
Y. Singer
17
21
0
20 Jun 2017
Thinking Fast and Slow with Deep Learning and Tree Search
Thinking Fast and Slow with Deep Learning and Tree Search
Thomas W. Anthony
Zheng Tian
David Barber
78
387
0
23 May 2017
Combining Model-Based and Model-Free Updates for Trajectory-Centric
  Reinforcement Learning
Combining Model-Based and Model-Free Updates for Trajectory-Centric Reinforcement Learning
Yevgen Chebotar
Karol Hausman
Marvin Zhang
Gaurav Sukhatme
S. Schaal
Sergey Levine
61
160
0
08 Mar 2017
Deeply AggreVaTeD: Differentiable Imitation Learning for Sequential
  Prediction
Deeply AggreVaTeD: Differentiable Imitation Learning for Sequential Prediction
Wen Sun
Arun Venkatraman
Geoffrey J. Gordon
Byron Boots
J. Andrew Bagnell
110
235
0
03 Mar 2017
The Predictron: End-To-End Learning and Planning
The Predictron: End-To-End Learning and Planning
David Silver
H. V. Hasselt
Matteo Hessel
Tom Schaul
A. Guez
...
Gabriel Dulac-Arnold
David P. Reichert
Neil C. Rabinowitz
André Barreto
T. Degris
50
289
0
28 Dec 2016
Benchmarking Deep Reinforcement Learning for Continuous Control
Benchmarking Deep Reinforcement Learning for Continuous Control
Yan Duan
Xi Chen
Rein Houthooft
John Schulman
Pieter Abbeel
OffRL
66
1,689
0
22 Apr 2016
High-Dimensional Continuous Control Using Generalized Advantage
  Estimation
High-Dimensional Continuous Control Using Generalized Advantage Estimation
John Schulman
Philipp Moritz
Sergey Levine
Michael I. Jordan
Pieter Abbeel
OffRL
45
3,368
0
08 Jun 2015
Strongly Adaptive Online Learning
Strongly Adaptive Online Learning
Amit Daniely
Alon Gonen
Shai Shalev-Shwartz
ODL
105
177
0
25 Feb 2015
Trust Region Policy Optimization
Trust Region Policy Optimization
John Schulman
Sergey Levine
Philipp Moritz
Michael I. Jordan
Pieter Abbeel
245
6,722
0
19 Feb 2015
Adam: A Method for Stochastic Optimization
Adam: A Method for Stochastic Optimization
Diederik P. Kingma
Jimmy Ba
ODL
907
149,474
0
22 Dec 2014
Reinforcement and Imitation Learning via Interactive No-Regret Learning
Reinforcement and Imitation Learning via Interactive No-Regret Learning
Stéphane Ross
J. Andrew Bagnell
OffRL
85
262
0
23 Jun 2014
Playing Atari with Deep Reinforcement Learning
Playing Atari with Deep Reinforcement Learning
Volodymyr Mnih
Koray Kavukcuoglu
David Silver
Alex Graves
Ioannis Antonoglou
Daan Wierstra
Martin Riedmiller
103
12,163
0
19 Dec 2013
Optimization, Learning, and Games with Predictable Sequences
Optimization, Learning, and Games with Predictable Sequences
Alexander Rakhlin
Karthik Sridharan
61
377
0
08 Nov 2013
ADADELTA: An Adaptive Learning Rate Method
ADADELTA: An Adaptive Learning Rate Method
Matthew D. Zeiler
ODL
115
6,619
0
22 Dec 2012
Online Learning with Predictable Sequences
Online Learning with Predictable Sequences
Alexander Rakhlin
Karthik Sridharan
114
355
0
18 Aug 2012
Dyna-Style Planning with Linear Function Approximation and Prioritized
  Sweeping
Dyna-Style Planning with Linear Function Approximation and Prioritized Sweeping
R. Sutton
Csaba Szepesvári
A. Geramifard
Michael Bowling
OffRL
65
203
0
13 Jun 2012
A Reduction of Imitation Learning and Structured Prediction to No-Regret
  Online Learning
A Reduction of Imitation Learning and Structured Prediction to No-Regret Online Learning
Stéphane Ross
Geoffrey J. Gordon
J. Andrew Bagnell
OffRL
166
3,196
0
02 Nov 2010
Adaptive Bound Optimization for Online Convex Optimization
Adaptive Bound Optimization for Online Convex Optimization
H. B. McMahan
Matthew J. Streeter
ODL
80
386
0
26 Feb 2010
Solving variational inequalities with Stochastic Mirror-Prox algorithm
Solving variational inequalities with Stochastic Mirror-Prox algorithm
A. Juditsky
A. Nemirovskii
Claire Tauvel
109
441
0
04 Sep 2008
1