ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2009.13736
  4. Cited By
Lucid Dreaming for Experience Replay: Refreshing Past States with the
  Current Policy
v1v2v3 (latest)

Lucid Dreaming for Experience Replay: Refreshing Past States with the Current Policy

29 September 2020
Yunshu Du
Garrett A. Warnell
A. Gebremedhin
Peter Stone
Matthew E. Taylor
ArXiv (abs)PDFHTMLGithub (5★)

Papers citing "Lucid Dreaming for Experience Replay: Refreshing Past States with the Current Policy"

33 / 33 papers shown
Title
Self-Imitation Learning by Planning
Self-Imitation Learning by Planning
Junhyuk Oh
Yijie Guo
Satinder Singh
SSL
139
85
0
25 Mar 2021
Revisiting Fundamentals of Experience Replay
Revisiting Fundamentals of Experience Replay
W. Fedus
Prajit Ramachandran
Rishabh Agarwal
Yoshua Bengio
Hugo Larochelle
Mark Rowland
Will Dabney
KELMOffRL
73
241
0
13 Jul 2020
Experience Replay with Likelihood-free Importance Weights
Experience Replay with Likelihood-free Importance Weights
Samarth Sinha
Jiaming Song
Animesh Garg
Stefano Ermon
OffRL
79
56
0
23 Jun 2020
The Sample Complexity of Teaching-by-Reinforcement on Q-Learning
The Sample Complexity of Teaching-by-Reinforcement on Q-Learning
Xuezhou Zhang
S. Bharti
Yuzhe Ma
Adish Singla
Xiaojin Zhu
94
6
0
16 Jun 2020
Self-Imitation Learning via Generalized Lower Bound Q-learning
Self-Imitation Learning via Generalized Lower Bound Q-learning
Yunhao Tang
SSL
94
24
0
12 Jun 2020
First return, then explore
First return, then explore
Adrien Ecoffet
Joost Huizinga
Joel Lehman
Kenneth O. Stanley
Jeff Clune
79
362
0
27 Apr 2020
Experience Replay Optimization
Experience Replay Optimization
Daochen Zha
Kwei-Herng Lai
Kaixiong Zhou
Xia Hu
OffRL
47
103
0
19 Jun 2019
Combining Experience Replay with Exploration by Random Network
  Distillation
Combining Experience Replay with Exploration by Random Network Distillation
Francesco Sovrano
50
15
0
18 May 2019
Jointly Pre-training with Supervised, Autoencoder, and Value Losses for
  Deep Reinforcement Learning
Jointly Pre-training with Supervised, Autoencoder, and Value Losses for Deep Reinforcement Learning
G. V. D. L. Cruz
Yunshu Du
Matthew E. Taylor
OffRL
36
4
0
03 Apr 2019
ACTRCE: Augmenting Experience via Teacher's Advice For Multi-Goal
  Reinforcement Learning
ACTRCE: Augmenting Experience via Teacher's Advice For Multi-Goal Reinforcement Learning
Harris Chan
Yuhuai Wu
J. Kiros
Sanja Fidler
Jimmy Ba
78
34
0
12 Feb 2019
Go-Explore: a New Approach for Hard-Exploration Problems
Go-Explore: a New Approach for Hard-Exploration Problems
Adrien Ecoffet
Joost Huizinga
Joel Lehman
Kenneth O. Stanley
Jeff Clune
AI4TS
88
370
0
30 Jan 2019
Pre-training with Non-expert Human Demonstration for Deep Reinforcement
  Learning
Pre-training with Non-expert Human Demonstration for Deep Reinforcement Learning
G. V. D. L. Cruz
Yunshu Du
Matthew E. Taylor
OffRL
41
26
0
21 Dec 2018
Learning Montezuma's Revenge from a Single Demonstration
Learning Montezuma's Revenge from a Single Demonstration
Tim Salimans
Richard J. Chen
110
139
0
08 Dec 2018
Backplay: "Man muss immer umkehren"
Backplay: "Man muss immer umkehren"
Cinjon Resnick
R. Raileanu
Sanyam Kapoor
Alex Peysakhovich
Kyunghyun Cho
Joan Bruna
OffRL
62
45
0
18 Jul 2018
Remember and Forget for Experience Replay
Remember and Forget for Experience Replay
G. Novati
Petros Koumoutsakos
OffRL
66
91
0
16 Jul 2018
Observe and Look Further: Achieving Consistent Performance on Atari
Observe and Look Further: Achieving Consistent Performance on Atari
Tobias Pohlen
Bilal Piot
Todd Hester
M. G. Azar
Dan Horgan
...
John Quan
Mel Vecerík
Matteo Hessel
Rémi Munos
Olivier Pietquin
53
121
0
29 May 2018
Learning Self-Imitating Diverse Policies
Learning Self-Imitating Diverse Policies
Tanmay Gangwani
Qiang Liu
Jian Peng
63
68
0
25 May 2018
Distributed Prioritized Experience Replay
Distributed Prioritized Experience Replay
Dan Horgan
John Quan
David Budden
Gabriel Barth-Maron
Matteo Hessel
H. V. Hasselt
David Silver
147
741
0
02 Mar 2018
IMPALA: Scalable Distributed Deep-RL with Importance Weighted
  Actor-Learner Architectures
IMPALA: Scalable Distributed Deep-RL with Importance Weighted Actor-Learner Architectures
L. Espeholt
Hubert Soyer
Rémi Munos
Karen Simonyan
Volodymyr Mnih
...
Vlad Firoiu
Tim Harley
Iain Dunning
Shane Legg
Koray Kavukcuoglu
220
1,605
0
05 Feb 2018
A Deeper Look at Experience Replay
A Deeper Look at Experience Replay
Shangtong Zhang
R. Sutton
OffRLVLM
70
275
0
04 Dec 2017
The Effects of Memory Replay in Reinforcement Learning
The Effects of Memory Replay in Reinforcement Learning
Ruishan Liu
James Zou
VLM
42
112
0
18 Oct 2017
Overcoming Exploration in Reinforcement Learning with Demonstrations
Overcoming Exploration in Reinforcement Learning with Demonstrations
Ashvin Nair
Bob McGrew
Marcin Andrychowicz
Wojciech Zaremba
Pieter Abbeel
OffRL
92
788
0
28 Sep 2017
Reverse Curriculum Generation for Reinforcement Learning
Reverse Curriculum Generation for Reinforcement Learning
Carlos Florensa
David Held
Markus Wulfmeier
Michael Zhang
Pieter Abbeel
74
445
0
17 Jul 2017
Hindsight Experience Replay
Hindsight Experience Replay
Marcin Andrychowicz
Dwight Crow
Alex Ray
Jonas Schneider
Rachel Fong
Peter Welinder
Bob McGrew
Joshua Tobin
Pieter Abbeel
Wojciech Zaremba
OffRL
262
2,337
0
05 Jul 2017
Learning to Play in a Day: Faster Deep Reinforcement Learning by
  Optimality Tightening
Learning to Play in a Day: Faster Deep Reinforcement Learning by Optimality Tightening
Frank S. He
Yang Liu
Alex Schwing
Jian-wei Peng
67
84
0
05 Nov 2016
Sample Efficient Actor-Critic with Experience Replay
Sample Efficient Actor-Critic with Experience Replay
Ziyun Wang
V. Bapst
N. Heess
Volodymyr Mnih
Rémi Munos
Koray Kavukcuoglu
Nando de Freitas
102
762
0
03 Nov 2016
Playing Atari Games with Deep Reinforcement Learning and Human
  Checkpoint Replay
Playing Atari Games with Deep Reinforcement Learning and Human Checkpoint Replay
Ionel-Alexandru Hosu
Traian Rebedea
71
97
0
18 Jul 2016
Safe and Efficient Off-Policy Reinforcement Learning
Safe and Efficient Off-Policy Reinforcement Learning
Rémi Munos
T. Stepleton
Anna Harutyunyan
Marc G. Bellemare
OffRL
138
617
0
08 Jun 2016
Unifying Count-Based Exploration and Intrinsic Motivation
Unifying Count-Based Exploration and Intrinsic Motivation
Marc G. Bellemare
S. Srinivasan
Georg Ostrovski
Tom Schaul
D. Saxton
Rémi Munos
176
1,483
0
06 Jun 2016
Asynchronous Methods for Deep Reinforcement Learning
Asynchronous Methods for Deep Reinforcement Learning
Volodymyr Mnih
Adria Puigdomenech Badia
M. Berk Mirza
Alex Graves
Timothy Lillicrap
Tim Harley
David Silver
Koray Kavukcuoglu
202
8,875
0
04 Feb 2016
Prioritized Experience Replay
Prioritized Experience Replay
Tom Schaul
John Quan
Ioannis Antonoglou
David Silver
OffRL
223
3,789
0
18 Nov 2015
Continuous control with deep reinforcement learning
Continuous control with deep reinforcement learning
Timothy Lillicrap
Jonathan J. Hunt
Alexander Pritzel
N. Heess
Tom Erez
Yuval Tassa
David Silver
Daan Wierstra
323
13,272
0
09 Sep 2015
The Arcade Learning Environment: An Evaluation Platform for General
  Agents
The Arcade Learning Environment: An Evaluation Platform for General Agents
Marc G. Bellemare
Yavar Naddaf
J. Veness
Michael Bowling
120
3,020
0
19 Jul 2012
1