ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1511.05952
  4. Cited By
Prioritized Experience Replay
v1v2v3v4 (latest)

Prioritized Experience Replay

18 November 2015
Tom Schaul
John Quan
Ioannis Antonoglou
David Silver
    OffRL
ArXiv (abs)PDFHTML

Papers citing "Prioritized Experience Replay"

50 / 1,454 papers shown
Title
Achieving mouse-level strategic evasion performance using real-time
  computational planning
Achieving mouse-level strategic evasion performance using real-time computational planning
German Espinosa
Gabrielle E. Wink
Alexander T. Lai
D. Dombeck
Malcolm A. MacIver
88
3
0
04 Nov 2022
Path Planning Using Wassertein Distributionally Robust Deep Q-learning
Path Planning Using Wassertein Distributionally Robust Deep Q-learning
Cem Alptürk
Venkatraman Renganathan
OOD
46
0
0
04 Nov 2022
Spatial-temporal recurrent reinforcement learning for autonomous ships
Spatial-temporal recurrent reinforcement learning for autonomous ships
Martin Waltz
Ostap Okhrin
100
9
0
02 Nov 2022
Multi-Agent Reinforcement Learning for Adaptive Mesh Refinement
Multi-Agent Reinforcement Learning for Adaptive Mesh Refinement
Jiachen Yang
K. Mittal
T. Dzanic
S. Petrides
B. Keith
Brenden K. Petersen
Daniel Faissol
R. Anderson
71
8
0
02 Nov 2022
Event Tables for Efficient Experience Replay
Event Tables for Efficient Experience Replay
Varun Kompella
Thomas J. Walsh
Samuel Barrett
Peter R. Wurman
Peter Stone
OffRL
59
3
0
01 Nov 2022
Teacher-student curriculum learning for reinforcement learning
Teacher-student curriculum learning for reinforcement learning
Yanick Schraner
OffRL
85
2
0
31 Oct 2022
Using Contrastive Samples for Identifying and Leveraging Possible Causal
  Relationships in Reinforcement Learning
Using Contrastive Samples for Identifying and Leveraging Possible Causal Relationships in Reinforcement Learning
H. Khadilkar
Hardik Meisheri
OffRL
61
1
0
28 Oct 2022
Adaptive Behavior Cloning Regularization for Stable Offline-to-Online
  Reinforcement Learning
Adaptive Behavior Cloning Regularization for Stable Offline-to-Online Reinforcement Learning
Yi Zhao
Rinu Boney
Alexander Ilin
Arno Solin
Joni Pajarinen
OffRLOnRL
89
40
0
25 Oct 2022
Machine and Deep Learning for IoT Security and Privacy: Applications,
  Challenges, and Future Directions
Machine and Deep Learning for IoT Security and Privacy: Applications, Challenges, and Future Directions
Subrato Bharati
Prajoy Podder
90
39
0
24 Oct 2022
MEET: A Monte Carlo Exploration-Exploitation Trade-off for Buffer
  Sampling
MEET: A Monte Carlo Exploration-Exploitation Trade-off for Buffer Sampling
Julius Ott
Lorenzo Servadei
Jose A. Arjona-Medina
E. Rinaldi
Gianfranco Mauro
Daniela Sanchez Lopera
Michael Stephan
Thomas Stadelmayer
Avik Santra
Robert Wille
66
0
0
24 Oct 2022
Climate Change Policy Exploration using Reinforcement Learning
Climate Change Policy Exploration using Reinforcement Learning
Theodore Wolf
49
1
0
23 Oct 2022
Solving Continuous Control via Q-learning
Solving Continuous Control via Q-learning
Tim Seyde
Peter Werner
Wilko Schwarting
Igor Gilitschenski
Martin Riedmiller
Daniela Rus
Markus Wulfmeier
OffRLLRM
90
23
0
22 Oct 2022
Probing Transfer in Deep Reinforcement Learning without Task Engineering
Probing Transfer in Deep Reinforcement Learning without Task Engineering
Andrei A. Rusu
Sebastian Flennerhag
Dushyant Rao
Razvan Pascanu
R. Hadsell
74
6
0
22 Oct 2022
Bridging the Gap Between Target Networks and Functional Regularization
Alexandre Piché
Valentin Thomas
Joseph Marino
Rafael Pardiñas
Gian Maria Marconi
C. Pal
Mohammad Emtiyaz Khan
63
2
0
21 Oct 2022
Deep Reinforcement Learning for Stabilization of Large-scale
  Probabilistic Boolean Networks
Deep Reinforcement Learning for Stabilization of Large-scale Probabilistic Boolean Networks
S. Moschoyiannis
Evangelos Chatzaroulas
Vytenis Sliogeris
Yuhu Wu
BDLOffRLAI4CE
62
8
0
21 Oct 2022
online and lightweight kernel-based approximated policy iteration for
  dynamic p-norm linear adaptive filtering
online and lightweight kernel-based approximated policy iteration for dynamic p-norm linear adaptive filtering
Yuki Akiyama
Minh Nhat Vu
Konstantinos Slavakis
58
1
0
21 Oct 2022
Deep reinforcement learning oriented for real world dynamic scenarios
Deep reinforcement learning oriented for real world dynamic scenarios
Diego Martinez
L. Riazuelo
Luis Montano
47
1
0
20 Oct 2022
Dynamic selection of p-norm in linear adaptive filtering via online
  kernel-based reinforcement learning
Dynamic selection of p-norm in linear adaptive filtering via online kernel-based reinforcement learning
Minh Nhat Vu
Yuki Akiyama
Konstantinos Slavakis
49
4
0
20 Oct 2022
The Pump Scheduling Problem: A Real-World Scenario for Reinforcement Learning
The Pump Scheduling Problem: A Real-World Scenario for Reinforcement Learning
Henrique Donancio
L. Vercouter
H. Roclawski
AI4CE
98
1
0
20 Oct 2022
Boosting Offline Reinforcement Learning via Data Rebalancing
Boosting Offline Reinforcement Learning via Data Rebalancing
Yang Yue
Bingyi Kang
Xiao Ma
Zhongwen Xu
Gao Huang
Shuicheng Yan
OffRL
75
23
0
17 Oct 2022
WILD-SCAV: Benchmarking FPS Gaming AI on Unity3D-based Environments
WILD-SCAV: Benchmarking FPS Gaming AI on Unity3D-based Environments
Xi Chen
Tianyuan Shi
Qing Zhao
Yuchen Sun
Yunfei Gao
Xiangjun Wang
62
2
0
14 Oct 2022
Deep reinforcement learning for automatic run-time adaptation of UWB PHY
  radio settings
Deep reinforcement learning for automatic run-time adaptation of UWB PHY radio settings
Dieter Coppens
A. Shahid
E. De Poorter
AI4CE
28
1
0
13 Oct 2022
Hybrid RL: Using Both Offline and Online Data Can Make RL Efficient
Hybrid RL: Using Both Offline and Online Data Can Make RL Efficient
Yuda Song
Yi Zhou
Ayush Sekhari
J. Andrew Bagnell
A. Krishnamurthy
Wen Sun
OffRLOnRL
94
105
0
13 Oct 2022
Censored Deep Reinforcement Patrolling with Information Criterion for
  Monitoring Large Water Resources using Autonomous Surface Vehicles
Censored Deep Reinforcement Patrolling with Information Criterion for Monitoring Large Water Resources using Autonomous Surface Vehicles
S. Luis
Daniel Gutiérrez-Reina
S. T. Marín
65
8
0
12 Oct 2022
Efficient Adversarial Training without Attacking: Worst-Case-Aware
  Robust Reinforcement Learning
Efficient Adversarial Training without Attacking: Worst-Case-Aware Robust Reinforcement Learning
Yongyuan Liang
Yanchao Sun
Ruijie Zheng
Furong Huang
OODAAMLOffRL
48
51
0
12 Oct 2022
Contrastive Retrospection: honing in on critical steps for rapid
  learning and generalization in RL
Contrastive Retrospection: honing in on critical steps for rapid learning and generalization in RL
Chen Sun
Wannan Yang
Thomas Jiralerspong
Dane Malenfant
Benjamin Alsbury-Nealy
Yoshua Bengio
Blake A. Richards
OffRL
64
2
0
12 Oct 2022
A Multi-Agent Approach for Adaptive Finger Cooperation in Learning-based
  In-Hand Manipulation
A Multi-Agent Approach for Adaptive Finger Cooperation in Learning-based In-Hand Manipulation
Lingfeng Tao
Jiucai Zhang
Michael Bowman
Xiaoli Zhang
67
6
0
11 Oct 2022
Class-Specific Explainability for Deep Time Series Classifiers
Class-Specific Explainability for Deep Time Series Classifiers
Ramesh Doddaiah
Prathyush S. Parvatharaju
Elke A. Rundensteiner
Thomas Hartvigsen
FAttAI4TS
99
5
0
11 Oct 2022
Human-AI Coordination via Human-Regularized Search and Learning
Human-AI Coordination via Human-Regularized Search and Learning
Hengyuan Hu
David J. Wu
Adam Lerer
Jakob N. Foerster
Noam Brown
70
7
0
11 Oct 2022
Real-Time Dynamic Map with Crowdsourcing Vehicles in Edge Computing
Real-Time Dynamic Map with Crowdsourcing Vehicles in Edge Computing
Qian Liu
Tao Han
Jiang Xie
Xie
Baek-Hyeon Kim
35
11
0
10 Oct 2022
Continual task learning in natural and artificial agents
Continual task learning in natural and artificial agents
Timo Flesch
Andrew M. Saxe
Christopher Summerfield
CLL
59
26
0
10 Oct 2022
Scaling up Stochastic Gradient Descent for Non-convex Optimisation
Scaling up Stochastic Gradient Descent for Non-convex Optimisation
S. Mohamad
H. Alamri
A. Bouchachia
85
3
0
06 Oct 2022
Query The Agent: Improving sample efficiency through epistemic
  uncertainty estimation
Query The Agent: Improving sample efficiency through epistemic uncertainty estimation
Julian Alverio
Boris Katz
Andrei Barbu
77
1
0
05 Oct 2022
Neuro-Planner: A 3D Visual Navigation Method for MAV with Depth Camera
  based on Neuromorphic Reinforcement Learning
Neuro-Planner: A 3D Visual Navigation Method for MAV with Depth Camera based on Neuromorphic Reinforcement Learning
Junjie Jiang
Delei Kong
Kuanxu Hou
Xinjie Huang
Zhuang Hao
Zheng Fang
76
9
0
05 Oct 2022
On Neural Consolidation for Transfer in Reinforcement Learning
On Neural Consolidation for Transfer in Reinforcement Learning
Valentin Guillet
Dennis G. Wilson
Carlos Aguilar-Melchor
Emmanuel Rachelson
CLL
54
0
0
05 Oct 2022
Hyperbolic Deep Reinforcement Learning
Hyperbolic Deep Reinforcement Learning
Edoardo Cetin
B. Chamberlain
Michael M. Bronstein
Jonathan J. Hunt
93
22
0
04 Oct 2022
Deep Learning for Wireless Networked Systems: a joint
  Estimation-Control-Scheduling Approach
Deep Learning for Wireless Networked Systems: a joint Estimation-Control-Scheduling Approach
Zihuai Zhao
Wanchun Liu
Daniel E. Quevedo
Yonghui Li
Branka Vucetic
69
18
0
03 Oct 2022
Economic-Driven Adaptive Traffic Signal Control
Economic-Driven Adaptive Traffic Signal Control
Shan Jiang
Yufei Huang
M. Jafari
M. Jalayer
22
0
0
02 Oct 2022
Bayesian Q-learning With Imperfect Expert Demonstrations
Bayesian Q-learning With Imperfect Expert Demonstrations
Fengdi Che
Xiru Zhu
Doina Precup
David Meger
Gregory Dudek
54
2
0
01 Oct 2022
Deep Recurrent Q-learning for Energy-constrained Coverage with a Mobile
  Robot
Deep Recurrent Q-learning for Energy-constrained Coverage with a Mobile Robot
Aaron Zellner
Ayan Dutta
Iliya Kulbaka
Gokarna Sharma
53
5
0
01 Oct 2022
Artificial Replay: A Meta-Algorithm for Harnessing Historical Data in Bandits
Artificial Replay: A Meta-Algorithm for Harnessing Historical Data in Bandits
Siddhartha Banerjee
Sean R. Sinclair
Milind Tambe
Lily Xu
Chao Yu
AI4TS
187
8
0
30 Sep 2022
Reinforcement Learning Algorithms: An Overview and Classification
Reinforcement Learning Algorithms: An Overview and Classification
Fadi AlMahamid
Katarina Grolinger
39
45
0
29 Sep 2022
Computational Discovery of Energy-Efficient Heat Treatment for
  Microstructure Design using Deep Reinforcement Learning
Computational Discovery of Energy-Efficient Heat Treatment for Microstructure Design using Deep Reinforcement Learning
J. Mianroodi
N. Siboni
Dierk Raabe
AI4CE
69
3
0
22 Sep 2022
Parallel Reinforcement Learning Simulation for Visual Quadrotor
  Navigation
Parallel Reinforcement Learning Simulation for Visual Quadrotor Navigation
Jack D. Saunders
Sajad Saeedi
Wenbin Li
49
3
0
22 Sep 2022
Pretraining the Vision Transformer using self-supervised methods for
  vision based Deep Reinforcement Learning
Pretraining the Vision Transformer using self-supervised methods for vision based Deep Reinforcement Learning
Manuel Goulão
Arlindo L. Oliveira
ViT
115
6
0
22 Sep 2022
M$^2$DQN: A Robust Method for Accelerating Deep Q-learning Network
M2^22DQN: A Robust Method for Accelerating Deep Q-learning Network
Zhe Zhang
Yukun Zou
Junjie Lai
Qinglong Xu
27
4
0
16 Sep 2022
Human-level Atari 200x faster
Human-level Atari 200x faster
Steven Kapturowski
Victor Campos
Ray Jiang
Nemanja Rakićević
Hado van Hasselt
Charles Blundell
Adria Puigdomenech Badia
OffRL
94
30
0
15 Sep 2022
On the Reuse Bias in Off-Policy Reinforcement Learning
On the Reuse Bias in Off-Policy Reinforcement Learning
Chengyang Ying
Zhongkai Hao
Xinning Zhou
Hang Su
Dong Yan
Jun Zhu
OffRL
81
3
0
15 Sep 2022
Inference and dynamic decision-making for deteriorating systems with
  probabilistic dependencies through Bayesian networks and deep reinforcement
  learning
Inference and dynamic decision-making for deteriorating systems with probabilistic dependencies through Bayesian networks and deep reinforcement learning
P. G. Morato
C. Andriotis
K. Papakonstantinou
P. Rigo
AI4CE
125
36
0
02 Sep 2022
Transformers are Sample-Efficient World Models
Transformers are Sample-Efficient World Models
Vincent Micheli
Eloi Alonso
Franccois Fleuret
VLMOffRL
192
189
0
01 Sep 2022
Previous
123...8910...282930
Next