Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2007.06700
Cited By
Revisiting Fundamentals of Experience Replay
13 July 2020
W. Fedus
Prajit Ramachandran
Rishabh Agarwal
Yoshua Bengio
Hugo Larochelle
Mark Rowland
Will Dabney
KELM
OffRL
Re-assign community
ArXiv
PDF
HTML
Papers citing
"Revisiting Fundamentals of Experience Replay"
41 / 41 papers shown
Title
Pretraining Generative Flow Networks with Inexpensive Rewards for Molecular Graph Generation
Mohit Pandey
G. Subbaraj
Artem Cherkasov
Martin Ester
Emmanuel Bengio
AI4CE
95
1
0
08 Mar 2025
SECURA: Sigmoid-Enhanced CUR Decomposition with Uninterrupted Retention and Low-Rank Adaptation in Large Language Models
Yuxuan Zhang
CLL
ALM
107
1
0
25 Feb 2025
Solving Finite-Horizon MDPs via Low-Rank Tensors
Sergio Rozada
Jose Luis Orejuela
Antonio G. Marques
64
0
0
17 Jan 2025
In-Context Experience Replay Facilitates Safety Red-Teaming of Text-to-Image Diffusion Models
Zhi-Yi Chin
Kuan-Chen Mu
Mario Fritz
Pin-Yu Chen
DiffM
111
1
0
25 Nov 2024
MAD-TD: Model-Augmented Data stabilizes High Update Ratio RL
C. Voelcker
Marcel Hussing
Eric Eaton
Amir-massoud Farahmand
Igor Gilitschenski
72
4
0
11 Oct 2024
RetroGFN: Diverse and Feasible Retrosynthesis using GFlowNets
Piotr Gaiñski
Michał Koziarski
Krzysztof Maziarz
Marwin H. S. Segler
Jacek Tabor
Marek Śmieja
80
3
0
26 Jun 2024
Optimal Robotic Assembly Sequence Planning: A Sequential Decision-Making Approach
Kartik Nagpal
Negar Mehr
52
0
0
26 Oct 2023
Networked Communication for Decentralised Agents in Mean-Field Games
Patrick Benjamin
Alessandro Abate
FedML
75
2
0
05 Jun 2023
Combining Q-Learning and Search with Amortized Value Estimates
Jessica B. Hamrick
V. Bapst
Alvaro Sanchez-Gonzalez
Tobias Pfaff
T. Weber
Lars Buesing
Peter W. Battaglia
OffRL
52
48
0
05 Dec 2019
Ranking Policy Gradient
Kaixiang Lin
Jiayu Zhou
OffRL
31
7
0
24 Jun 2019
Experience Replay Optimization
Daochen Zha
Kwei-Herng Lai
Kaixiong Zhou
Xia Hu
OffRL
28
102
0
19 Jun 2019
When to use parametric models in reinforcement learning?
H. V. Hasselt
Matteo Hessel
John Aslanides
67
192
0
12 Jun 2019
Importance Resampling for Off-policy Prediction
M. Schlegel
Wesley Chung
Daniel Graves
Jian Qian
Martha White
OffRL
42
41
0
11 Jun 2019
Diagnosing Bottlenecks in Deep Q-learning Algorithms
Justin Fu
Aviral Kumar
Matthew Soh
Sergey Levine
OffRL
56
142
0
26 Feb 2019
Hyperbolic Discounting and Learning over Multiple Horizons
W. Fedus
Carles Gelada
Yoshua Bengio
Marc G. Bellemare
Hugo Larochelle
48
106
0
19 Feb 2019
Dopamine: A Research Framework for Deep Reinforcement Learning
Pablo Samuel Castro
Subhodeep Moitra
Carles Gelada
Saurabh Kumar
Marc G. Bellemare
OffRL
47
277
0
14 Dec 2018
Deep Reinforcement Learning and the Deadly Triad
H. V. Hasselt
Yotam Doron
Florian Strub
Matteo Hessel
Nicolas Sonnerat
Joseph Modayil
OffRL
63
226
0
06 Dec 2018
An Introduction to Deep Reinforcement Learning
Vincent François-Lavet
Peter Henderson
Riashat Islam
Marc G. Bellemare
Joelle Pineau
OffRL
AI4CE
104
1,242
0
30 Nov 2018
Remember and Forget for Experience Replay
G. Novati
Petros Koumoutsakos
OffRL
59
90
0
16 Jul 2018
Organizing Experience: A Deeper Look at Replay Mechanisms for Sample-based Planning in Continuous State Domains
Yangchen Pan
M. Zaheer
Adam White
Andrew Patterson
Martha White
49
46
0
12 Jun 2018
Sample-Efficient Deep Reinforcement Learning via Episodic Backward Update
Su Young Lee
Sung-Ik Choi
Sae-Young Chung
BDL
43
74
0
31 May 2018
Distributed Prioritized Experience Replay
Dan Horgan
John Quan
David Budden
Gabriel Barth-Maron
Matteo Hessel
H. V. Hasselt
David Silver
134
736
0
02 Mar 2018
A Deeper Look at Experience Replay
Shangtong Zhang
R. Sutton
OffRL
VLM
61
271
0
04 Dec 2017
Distributional Reinforcement Learning with Quantile Regression
Will Dabney
Mark Rowland
Marc G. Bellemare
Rémi Munos
74
756
0
27 Oct 2017
The Effects of Memory Replay in Reinforcement Learning
Ruishan Liu
James Zou
VLM
30
111
0
18 Oct 2017
Rainbow: Combining Improvements in Deep Reinforcement Learning
Matteo Hessel
Joseph Modayil
H. V. Hasselt
Tom Schaul
Georg Ostrovski
Will Dabney
Dan Horgan
Bilal Piot
M. G. Azar
David Silver
OffRL
94
2,255
0
06 Oct 2017
Revisiting the Arcade Learning Environment: Evaluation Protocols and Open Problems for General Agents
Marlos C. Machado
Marc G. Bellemare
Erik Talvitie
J. Veness
Matthew J. Hausknecht
Michael Bowling
60
549
0
18 Sep 2017
A Distributional Perspective on Reinforcement Learning
Marc G. Bellemare
Will Dabney
Rémi Munos
OffRL
69
1,497
0
21 Jul 2017
Noisy Networks for Exploration
Meire Fortunato
M. G. Azar
Bilal Piot
Jacob Menick
Ian Osband
...
Rémi Munos
Demis Hassabis
Olivier Pietquin
Charles Blundell
Shane Legg
66
890
0
30 Jun 2017
Reinforcement Learning with Unsupervised Auxiliary Tasks
Max Jaderberg
Volodymyr Mnih
Wojciech M. Czarnecki
Tom Schaul
Joel Z Leibo
David Silver
Koray Kavukcuoglu
SSL
43
1,225
0
16 Nov 2016
Sample Efficient Actor-Critic with Experience Replay
Ziyun Wang
V. Bapst
N. Heess
Volodymyr Mnih
Rémi Munos
Koray Kavukcuoglu
Nando de Freitas
85
757
0
03 Nov 2016
Safe and Efficient Off-Policy Reinforcement Learning
Rémi Munos
T. Stepleton
Anna Harutyunyan
Marc G. Bellemare
OffRL
119
611
0
08 Jun 2016
Unifying Count-Based Exploration and Intrinsic Motivation
Marc G. Bellemare
S. Srinivasan
Georg Ostrovski
Tom Schaul
D. Saxton
Rémi Munos
162
1,465
0
06 Jun 2016
Q(
λ
λ
λ
) with Off-Policy Corrections
Anna Harutyunyan
Marc G. Bellemare
T. Stepleton
Rémi Munos
OffRL
39
95
0
16 Feb 2016
Dueling Network Architectures for Deep Reinforcement Learning
Ziyun Wang
Tom Schaul
Matteo Hessel
H. V. Hasselt
Marc Lanctot
Nando de Freitas
OffRL
60
3,742
0
20 Nov 2015
Prioritized Experience Replay
Tom Schaul
John Quan
Ioannis Antonoglou
David Silver
OffRL
198
3,781
0
18 Nov 2015
Deep Reinforcement Learning with Double Q-learning
H. V. Hasselt
A. Guez
David Silver
OffRL
131
7,590
0
22 Sep 2015
Deep Recurrent Q-Learning for Partially Observable MDPs
Matthew J. Hausknecht
Peter Stone
86
1,668
0
23 Jul 2015
Adam: A Method for Stochastic Optimization
Diederik P. Kingma
Jimmy Ba
ODL
844
149,474
0
22 Dec 2014
Playing Atari with Deep Reinforcement Learning
Volodymyr Mnih
Koray Kavukcuoglu
David Silver
Alex Graves
Ioannis Antonoglou
Daan Wierstra
Martin Riedmiller
103
12,163
0
19 Dec 2013
The Arcade Learning Environment: An Evaluation Platform for General Agents
Marc G. Bellemare
Yavar Naddaf
J. Veness
Michael Bowling
78
2,992
0
19 Jul 2012
1