Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
1910.14055
Cited By
v1
v2 (latest)
Policy Continuation with Hindsight Inverse Dynamics
30 October 2019
Hao Sun
Zhizhong Li
Xiaotong Liu
Dahua Lin
Bolei Zhou
Re-assign community
ArXiv (abs)
PDF
HTML
Papers citing
"Policy Continuation with Hindsight Inverse Dynamics"
18 / 18 papers shown
Title
Self-Imitation Learning by Planning
Junhyuk Oh
Yijie Guo
Satinder Singh
SSL
139
85
0
25 Mar 2021
Exploration by Random Network Distillation
Yuri Burda
Harrison Edwards
Amos Storkey
Oleg Klimov
159
1,342
0
30 Oct 2018
Large-Scale Study of Curiosity-Driven Learning
Yuri Burda
Harrison Edwards
Deepak Pathak
Amos Storkey
Trevor Darrell
Alexei A. Efros
LRM
72
706
0
13 Aug 2018
Hierarchical Reinforcement Learning with Hindsight
Andrew Levy
Robert Platt
Kate Saenko
72
84
0
21 May 2018
Multi-Goal Reinforcement Learning: Challenging Robotics Environments and Request for Research
Matthias Plappert
Marcin Andrychowicz
Alex Ray
Bob McGrew
Bowen Baker
...
Joshua Tobin
Maciek Chociej
Peter Welinder
Vikash Kumar
Wojciech Zaremba
66
572
0
26 Feb 2018
Hindsight policy gradients
Paulo E. Rauber
Avinash Ummadisingu
Filipe Wall Mutz
J. Schmidhuber
59
68
0
16 Nov 2017
Overcoming Exploration in Reinforcement Learning with Demonstrations
Ashvin Nair
Bob McGrew
Marcin Andrychowicz
Wojciech Zaremba
Pieter Abbeel
OffRL
94
788
0
28 Sep 2017
Proximal Policy Optimization Algorithms
John Schulman
Filip Wolski
Prafulla Dhariwal
Alec Radford
Oleg Klimov
OffRL
526
19,237
0
20 Jul 2017
Reverse Curriculum Generation for Reinforcement Learning
Carlos Florensa
David Held
Markus Wulfmeier
Michael Zhang
Pieter Abbeel
76
445
0
17 Jul 2017
Hindsight Experience Replay
Marcin Andrychowicz
Dwight Crow
Alex Ray
Jonas Schneider
Rachel Fong
Peter Welinder
Bob McGrew
Joshua Tobin
Pieter Abbeel
Wojciech Zaremba
OffRL
268
2,337
0
05 Jul 2017
Automatic Goal Generation for Reinforcement Learning Agents
Carlos Florensa
David Held
Xinyang Geng
Pieter Abbeel
107
515
0
17 May 2017
Curiosity-driven Exploration by Self-supervised Prediction
Deepak Pathak
Pulkit Agrawal
Alexei A. Efros
Trevor Darrell
LRM
SSL
113
2,449
0
15 May 2017
Loss is its own Reward: Self-Supervision for Reinforcement Learning
Evan Shelhamer
Parsa Mahmoudieh
Max Argus
Trevor Darrell
SSL
83
186
0
21 Dec 2016
Learning to Navigate in Complex Environments
Piotr Wojciech Mirowski
Razvan Pascanu
Fabio Viola
Hubert Soyer
Andy Ballard
...
Ross Goroshin
Laurent Sifre
Koray Kavukcuoglu
D. Kumaran
R. Hadsell
107
880
0
11 Nov 2016
Benchmarking Deep Reinforcement Learning for Continuous Control
Yan Duan
Xi Chen
Rein Houthooft
John Schulman
Pieter Abbeel
OffRL
84
1,695
0
22 Apr 2016
Value Iteration Networks
Aviv Tamar
Yi Wu
G. Thomas
Sergey Levine
Pieter Abbeel
79
654
0
09 Feb 2016
Deep Residual Learning for Image Recognition
Kaiming He
Xinming Zhang
Shaoqing Ren
Jian Sun
MedIm
2.2K
194,322
0
10 Dec 2015
Continuous control with deep reinforcement learning
Timothy Lillicrap
Jonathan J. Hunt
Alexander Pritzel
N. Heess
Tom Erez
Yuval Tassa
David Silver
Daan Wierstra
323
13,272
0
09 Sep 2015
1