Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2009.13736
Cited By
v1
v2
v3 (latest)
Lucid Dreaming for Experience Replay: Refreshing Past States with the Current Policy
29 September 2020
Yunshu Du
Garrett A. Warnell
A. Gebremedhin
Peter Stone
Matthew E. Taylor
Re-assign community
ArXiv (abs)
PDF
HTML
Github (5★)
Papers citing
"Lucid Dreaming for Experience Replay: Refreshing Past States with the Current Policy"
33 / 33 papers shown
Title
Self-Imitation Learning by Planning
Junhyuk Oh
Yijie Guo
Satinder Singh
SSL
139
85
0
25 Mar 2021
Revisiting Fundamentals of Experience Replay
W. Fedus
Prajit Ramachandran
Rishabh Agarwal
Yoshua Bengio
Hugo Larochelle
Mark Rowland
Will Dabney
KELM
OffRL
73
241
0
13 Jul 2020
Experience Replay with Likelihood-free Importance Weights
Samarth Sinha
Jiaming Song
Animesh Garg
Stefano Ermon
OffRL
79
56
0
23 Jun 2020
The Sample Complexity of Teaching-by-Reinforcement on Q-Learning
Xuezhou Zhang
S. Bharti
Yuzhe Ma
Adish Singla
Xiaojin Zhu
94
6
0
16 Jun 2020
Self-Imitation Learning via Generalized Lower Bound Q-learning
Yunhao Tang
SSL
94
24
0
12 Jun 2020
First return, then explore
Adrien Ecoffet
Joost Huizinga
Joel Lehman
Kenneth O. Stanley
Jeff Clune
79
362
0
27 Apr 2020
Experience Replay Optimization
Daochen Zha
Kwei-Herng Lai
Kaixiong Zhou
Xia Hu
OffRL
47
103
0
19 Jun 2019
Combining Experience Replay with Exploration by Random Network Distillation
Francesco Sovrano
50
15
0
18 May 2019
Jointly Pre-training with Supervised, Autoencoder, and Value Losses for Deep Reinforcement Learning
G. V. D. L. Cruz
Yunshu Du
Matthew E. Taylor
OffRL
36
4
0
03 Apr 2019
ACTRCE: Augmenting Experience via Teacher's Advice For Multi-Goal Reinforcement Learning
Harris Chan
Yuhuai Wu
J. Kiros
Sanja Fidler
Jimmy Ba
78
34
0
12 Feb 2019
Go-Explore: a New Approach for Hard-Exploration Problems
Adrien Ecoffet
Joost Huizinga
Joel Lehman
Kenneth O. Stanley
Jeff Clune
AI4TS
88
370
0
30 Jan 2019
Pre-training with Non-expert Human Demonstration for Deep Reinforcement Learning
G. V. D. L. Cruz
Yunshu Du
Matthew E. Taylor
OffRL
41
26
0
21 Dec 2018
Learning Montezuma's Revenge from a Single Demonstration
Tim Salimans
Richard J. Chen
110
139
0
08 Dec 2018
Backplay: "Man muss immer umkehren"
Cinjon Resnick
R. Raileanu
Sanyam Kapoor
Alex Peysakhovich
Kyunghyun Cho
Joan Bruna
OffRL
62
45
0
18 Jul 2018
Remember and Forget for Experience Replay
G. Novati
Petros Koumoutsakos
OffRL
66
91
0
16 Jul 2018
Observe and Look Further: Achieving Consistent Performance on Atari
Tobias Pohlen
Bilal Piot
Todd Hester
M. G. Azar
Dan Horgan
...
John Quan
Mel Vecerík
Matteo Hessel
Rémi Munos
Olivier Pietquin
53
121
0
29 May 2018
Learning Self-Imitating Diverse Policies
Tanmay Gangwani
Qiang Liu
Jian Peng
63
68
0
25 May 2018
Distributed Prioritized Experience Replay
Dan Horgan
John Quan
David Budden
Gabriel Barth-Maron
Matteo Hessel
H. V. Hasselt
David Silver
147
741
0
02 Mar 2018
IMPALA: Scalable Distributed Deep-RL with Importance Weighted Actor-Learner Architectures
L. Espeholt
Hubert Soyer
Rémi Munos
Karen Simonyan
Volodymyr Mnih
...
Vlad Firoiu
Tim Harley
Iain Dunning
Shane Legg
Koray Kavukcuoglu
220
1,605
0
05 Feb 2018
A Deeper Look at Experience Replay
Shangtong Zhang
R. Sutton
OffRL
VLM
70
275
0
04 Dec 2017
The Effects of Memory Replay in Reinforcement Learning
Ruishan Liu
James Zou
VLM
42
112
0
18 Oct 2017
Overcoming Exploration in Reinforcement Learning with Demonstrations
Ashvin Nair
Bob McGrew
Marcin Andrychowicz
Wojciech Zaremba
Pieter Abbeel
OffRL
92
788
0
28 Sep 2017
Reverse Curriculum Generation for Reinforcement Learning
Carlos Florensa
David Held
Markus Wulfmeier
Michael Zhang
Pieter Abbeel
74
445
0
17 Jul 2017
Hindsight Experience Replay
Marcin Andrychowicz
Dwight Crow
Alex Ray
Jonas Schneider
Rachel Fong
Peter Welinder
Bob McGrew
Joshua Tobin
Pieter Abbeel
Wojciech Zaremba
OffRL
262
2,337
0
05 Jul 2017
Learning to Play in a Day: Faster Deep Reinforcement Learning by Optimality Tightening
Frank S. He
Yang Liu
Alex Schwing
Jian-wei Peng
67
84
0
05 Nov 2016
Sample Efficient Actor-Critic with Experience Replay
Ziyun Wang
V. Bapst
N. Heess
Volodymyr Mnih
Rémi Munos
Koray Kavukcuoglu
Nando de Freitas
102
762
0
03 Nov 2016
Playing Atari Games with Deep Reinforcement Learning and Human Checkpoint Replay
Ionel-Alexandru Hosu
Traian Rebedea
71
97
0
18 Jul 2016
Safe and Efficient Off-Policy Reinforcement Learning
Rémi Munos
T. Stepleton
Anna Harutyunyan
Marc G. Bellemare
OffRL
138
617
0
08 Jun 2016
Unifying Count-Based Exploration and Intrinsic Motivation
Marc G. Bellemare
S. Srinivasan
Georg Ostrovski
Tom Schaul
D. Saxton
Rémi Munos
176
1,483
0
06 Jun 2016
Asynchronous Methods for Deep Reinforcement Learning
Volodymyr Mnih
Adria Puigdomenech Badia
M. Berk Mirza
Alex Graves
Timothy Lillicrap
Tim Harley
David Silver
Koray Kavukcuoglu
202
8,875
0
04 Feb 2016
Prioritized Experience Replay
Tom Schaul
John Quan
Ioannis Antonoglou
David Silver
OffRL
223
3,789
0
18 Nov 2015
Continuous control with deep reinforcement learning
Timothy Lillicrap
Jonathan J. Hunt
Alexander Pritzel
N. Heess
Tom Erez
Yuval Tassa
David Silver
Daan Wierstra
323
13,272
0
09 Sep 2015
The Arcade Learning Environment: An Evaluation Platform for General Agents
Marc G. Bellemare
Yavar Naddaf
J. Veness
Michael Bowling
120
3,020
0
19 Jul 2012
1