Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
1607.04614
Cited By
Guided Policy Search as Approximate Mirror Descent
15 July 2016
William H. Montgomery
Sergey Levine
Re-assign community
ArXiv
PDF
HTML
Papers citing
"Guided Policy Search as Approximate Mirror Descent"
25 / 25 papers shown
Title
Think Smarter not Harder: Adaptive Reasoning with Inference Aware Optimization
Zishun Yu
Tengyu Xu
Di Jin
Karthik Abinav Sankararaman
Yun He
...
Eryk Helenowski
Chen Zhu
Sinong Wang
Hao Ma
Han Fang
LRM
56
5
0
29 Jan 2025
Recent Advances in Path Integral Control for Trajectory Optimization: An Overview in Theoretical and Algorithmic Perspectives
Muhammad Kazim
JunGee Hong
Min-Gyeom Kim
Kwang-Ki K. Kim
44
16
0
22 Sep 2023
Tutorial on amortized optimization
Brandon Amos
OffRL
78
43
0
01 Feb 2022
Follow Your Path: a Progressive Method for Knowledge Distillation
Wenxian Shi
Yuxuan Song
Hao Zhou
Bohan Li
Lei Li
17
15
0
20 Jul 2021
How to Train Your Robot with Deep Reinforcement Learning; Lessons We've Learned
Julian Ibarz
Jie Tan
Chelsea Finn
Mrinal Kalakrishnan
P. Pastor
Sergey Levine
OffRL
16
520
0
04 Feb 2021
Neurosymbolic Transformers for Multi-Agent Communication
J. Inala
Yichen Yang
James Paulos
Yewen Pu
Osbert Bastani
Vijay Kumar
Martin Rinard
Armando Solar-Lezama
32
24
0
05 Jan 2021
Behavior Priors for Efficient Reinforcement Learning
Dhruva Tirumala
Alexandre Galashov
Hyeonwoo Noh
Leonard Hasenclever
Razvan Pascanu
...
Guillaume Desjardins
Wojciech M. Czarnecki
Arun Ahuja
Yee Whye Teh
N. Heess
37
39
0
27 Oct 2020
Deterministic Value-Policy Gradients
Qingpeng Cai
L. Pan
Pingzhong Tang
29
1
0
09 Sep 2019
Direct Policy Gradients: Direct Optimization of Policies in Discrete Action Spaces
Guy Lorberbom
Chris J. Maddison
N. Heess
Tamir Hazan
Daniel Tarlow
34
8
0
14 Jun 2019
Relative Entropy Regularized Policy Iteration
A. Abdolmaleki
Jost Tobias Springenberg
Jonas Degrave
Steven Bohez
Yuval Tassa
Dan Belov
N. Heess
Martin Riedmiller
27
72
0
05 Dec 2018
Policy Optimization with Model-based Explorations
Feiyang Pan
Qingpeng Cai
Anxiang Zeng
C. Pan
Qing Da
Hua-Lin He
Qing He
Pingzhong Tang
30
11
0
18 Nov 2018
Parametrized Deep Q-Networks Learning: Reinforcement Learning with Discrete-Continuous Hybrid Action Space
Jiechao Xiong
Qing Wang
Zhuoran Yang
Peng Sun
Lei Han
Yang Zheng
Haobo Fu
Tong Zhang
Ji Liu
Han Liu
37
169
0
10 Oct 2018
Learning Robust Manipulation Skills with Guided Policy Search via Generative Motor Reflexes
Philipp Ennen
Pia Bresenitz
R. Vossen
F. Hees
22
8
0
15 Sep 2018
Maximum a Posteriori Policy Optimisation
A. Abdolmaleki
Jost Tobias Springenberg
Yuval Tassa
Rémi Munos
N. Heess
Martin Riedmiller
48
471
0
14 Jun 2018
Reinforcement Learning with Function-Valued Action Spaces for Partial Differential Equation Control
Yangchen Pan
Amir-massoud Farahmand
Martha White
S. Nabi
P. Grover
D. Nikovski
51
18
0
13 Jun 2018
Dual Policy Iteration
Wen Sun
Geoffrey J. Gordon
Byron Boots
J. Andrew Bagnell
OffRL
20
56
0
28 May 2018
Policy Search in Continuous Action Domains: an Overview
Olivier Sigaud
F. Stulp
16
72
0
13 Mar 2018
A Policy Search Method For Temporal Logic Specified Reinforcement Learning Tasks
Xiao Li
Yao Ma
C. Belta
24
59
0
27 Sep 2017
Transferring End-to-End Visuomotor Control from Simulation to Real World for a Multi-Stage Task
Stephen James
Andrew J. Davison
Edward Johns
162
275
0
07 Jul 2017
Combining Model-Based and Model-Free Updates for Trajectory-Centric Reinforcement Learning
Yevgen Chebotar
Karol Hausman
Marvin Zhang
Gaurav Sukhatme
S. Schaal
Sergey Levine
37
159
0
08 Mar 2017
Generalizing Skills with Semi-Supervised Reinforcement Learning
Chelsea Finn
Tianhe Yu
Justin Fu
Pieter Abbeel
Sergey Levine
OffRL
SSL
35
68
0
01 Dec 2016
Towards Lifelong Self-Supervision: A Deep Learning Direction for Robotics
J. M. Wong
27
11
0
01 Nov 2016
Collective Robot Reinforcement Learning with Distributed Asynchronous Guided Policy Search
Ali Yahya
A. Li
Mrinal Kalakrishnan
Yevgen Chebotar
Sergey Levine
OffRL
26
155
0
03 Oct 2016
Path Integral Guided Policy Search
Yevgen Chebotar
Mrinal Kalakrishnan
Ali Yahya
A. Li
S. Schaal
Sergey Levine
36
149
0
03 Oct 2016
Deep Reinforcement Learning for Tensegrity Robot Locomotion
Marvin Zhang
Xinyang Geng
J. Bruce
Ken Caluwaerts
Massimo Vespignani
Vytas SunSpiral
Pieter Abbeel
Sergey Levine
28
92
0
28 Sep 2016
1