ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1511.05952
  4. Cited By
Prioritized Experience Replay

Prioritized Experience Replay

18 November 2015
Tom Schaul
John Quan
Ioannis Antonoglou
David Silver
    OffRL
ArXivPDFHTML

Papers citing "Prioritized Experience Replay"

50 / 1,441 papers shown
Title
Deep Reinforcement Learning for Stabilization of Large-scale
  Probabilistic Boolean Networks
Deep Reinforcement Learning for Stabilization of Large-scale Probabilistic Boolean Networks
S. Moschoyiannis
Evangelos Chatzaroulas
Vytenis Sliogeris
Yuhu Wu
BDL
OffRL
AI4CE
24
7
0
21 Oct 2022
online and lightweight kernel-based approximated policy iteration for
  dynamic p-norm linear adaptive filtering
online and lightweight kernel-based approximated policy iteration for dynamic p-norm linear adaptive filtering
Yuki Akiyama
Minh Nhat Vu
Konstantinos Slavakis
25
1
0
21 Oct 2022
Deep reinforcement learning oriented for real world dynamic scenarios
Deep reinforcement learning oriented for real world dynamic scenarios
Diego Martinez
L. Riazuelo
Luis Montano
25
1
0
20 Oct 2022
Dynamic selection of p-norm in linear adaptive filtering via online
  kernel-based reinforcement learning
Dynamic selection of p-norm in linear adaptive filtering via online kernel-based reinforcement learning
Minh Nhat Vu
Yuki Akiyama
Konstantinos Slavakis
23
4
0
20 Oct 2022
The Pump Scheduling Problem: A Real-World Scenario for Reinforcement Learning
The Pump Scheduling Problem: A Real-World Scenario for Reinforcement Learning
Henrique Donancio
L. Vercouter
H. Roclawski
AI4CE
18
1
0
20 Oct 2022
Boosting Offline Reinforcement Learning via Data Rebalancing
Boosting Offline Reinforcement Learning via Data Rebalancing
Yang Yue
Bingyi Kang
Xiao Ma
Zhongwen Xu
Gao Huang
Shuicheng Yan
OffRL
26
22
0
17 Oct 2022
WILD-SCAV: Benchmarking FPS Gaming AI on Unity3D-based Environments
WILD-SCAV: Benchmarking FPS Gaming AI on Unity3D-based Environments
Xi Chen
Tianyuan Shi
Qing Zhao
Yuchen Sun
Yunfei Gao
Xiangjun Wang
33
2
0
14 Oct 2022
Deep reinforcement learning for automatic run-time adaptation of UWB PHY
  radio settings
Deep reinforcement learning for automatic run-time adaptation of UWB PHY radio settings
Dieter Coppens
A. Shahid
E. De Poorter
AI4CE
21
1
0
13 Oct 2022
Hybrid RL: Using Both Offline and Online Data Can Make RL Efficient
Hybrid RL: Using Both Offline and Online Data Can Make RL Efficient
Yuda Song
Yi Zhou
Ayush Sekhari
J. Andrew Bagnell
A. Krishnamurthy
Wen Sun
OffRL
OnRL
38
94
0
13 Oct 2022
Censored Deep Reinforcement Patrolling with Information Criterion for
  Monitoring Large Water Resources using Autonomous Surface Vehicles
Censored Deep Reinforcement Patrolling with Information Criterion for Monitoring Large Water Resources using Autonomous Surface Vehicles
S. Luis
Daniel Gutiérrez-Reina
S. T. Marín
31
8
0
12 Oct 2022
Efficient Adversarial Training without Attacking: Worst-Case-Aware
  Robust Reinforcement Learning
Efficient Adversarial Training without Attacking: Worst-Case-Aware Robust Reinforcement Learning
Yongyuan Liang
Yanchao Sun
Ruijie Zheng
Furong Huang
OOD
AAML
OffRL
30
47
0
12 Oct 2022
Contrastive Retrospection: honing in on critical steps for rapid
  learning and generalization in RL
Contrastive Retrospection: honing in on critical steps for rapid learning and generalization in RL
Chen Sun
Wannan Yang
Thomas Jiralerspong
Dane Malenfant
Benjamin Alsbury-Nealy
Yoshua Bengio
Blake A. Richards
OffRL
22
2
0
12 Oct 2022
A Multi-Agent Approach for Adaptive Finger Cooperation in Learning-based
  In-Hand Manipulation
A Multi-Agent Approach for Adaptive Finger Cooperation in Learning-based In-Hand Manipulation
Lingfeng Tao
Jiucai Zhang
Michael Bowman
Xiaoli Zhang
40
5
0
11 Oct 2022
Class-Specific Explainability for Deep Time Series Classifiers
Class-Specific Explainability for Deep Time Series Classifiers
Ramesh Doddaiah
Prathyush S. Parvatharaju
Elke A. Rundensteiner
Thomas Hartvigsen
FAtt
AI4TS
37
4
0
11 Oct 2022
Human-AI Coordination via Human-Regularized Search and Learning
Human-AI Coordination via Human-Regularized Search and Learning
Hengyuan Hu
David J. Wu
Adam Lerer
Jakob N. Foerster
Noam Brown
24
7
0
11 Oct 2022
Real-Time Dynamic Map with Crowdsourcing Vehicles in Edge Computing
Real-Time Dynamic Map with Crowdsourcing Vehicles in Edge Computing
Qian Liu
Tao Han
Jiang Xie
Xie
Baek-Hyeon Kim
31
10
0
10 Oct 2022
Continual task learning in natural and artificial agents
Continual task learning in natural and artificial agents
Timo Flesch
Andrew M. Saxe
Christopher Summerfield
CLL
43
24
0
10 Oct 2022
Scaling up Stochastic Gradient Descent for Non-convex Optimisation
Scaling up Stochastic Gradient Descent for Non-convex Optimisation
S. Mohamad
H. Alamri
A. Bouchachia
50
3
0
06 Oct 2022
Query The Agent: Improving sample efficiency through epistemic
  uncertainty estimation
Query The Agent: Improving sample efficiency through epistemic uncertainty estimation
Julian Alverio
Boris Katz
Andrei Barbu
40
0
0
05 Oct 2022
Neuro-Planner: A 3D Visual Navigation Method for MAV with Depth Camera
  based on Neuromorphic Reinforcement Learning
Neuro-Planner: A 3D Visual Navigation Method for MAV with Depth Camera based on Neuromorphic Reinforcement Learning
Junjie Jiang
Delei Kong
Kuanxu Hou
Xinjie Huang
Zhuang Hao
Zheng Fang
37
9
0
05 Oct 2022
On Neural Consolidation for Transfer in Reinforcement Learning
On Neural Consolidation for Transfer in Reinforcement Learning
Valentin Guillet
Dennis G. Wilson
Carlos Aguilar-Melchor
Emmanuel Rachelson
CLL
30
0
0
05 Oct 2022
Hyperbolic Deep Reinforcement Learning
Hyperbolic Deep Reinforcement Learning
Edoardo Cetin
B. Chamberlain
Michael M. Bronstein
Jonathan J. Hunt
50
21
0
04 Oct 2022
Deep Learning for Wireless Networked Systems: a joint
  Estimation-Control-Scheduling Approach
Deep Learning for Wireless Networked Systems: a joint Estimation-Control-Scheduling Approach
Zihuai Zhao
Wanchun Liu
Daniel E. Quevedo
Yonghui Li
Branka Vucetic
32
18
0
03 Oct 2022
Economic-Driven Adaptive Traffic Signal Control
Economic-Driven Adaptive Traffic Signal Control
Shan Jiang
Yufei Huang
M. Jafari
M. Jalayer
11
0
0
02 Oct 2022
Bayesian Q-learning With Imperfect Expert Demonstrations
Bayesian Q-learning With Imperfect Expert Demonstrations
Fengdi Che
Xiru Zhu
Doina Precup
David Meger
Gregory Dudek
19
2
0
01 Oct 2022
Deep Recurrent Q-learning for Energy-constrained Coverage with a Mobile
  Robot
Deep Recurrent Q-learning for Energy-constrained Coverage with a Mobile Robot
Aaron Zellner
Ayan Dutta
Iliya Kulbaka
Gokarna Sharma
26
5
0
01 Oct 2022
Artificial Replay: A Meta-Algorithm for Harnessing Historical Data in Bandits
Artificial Replay: A Meta-Algorithm for Harnessing Historical Data in Bandits
Siddhartha Banerjee
Sean R. Sinclair
Milind Tambe
Lily Xu
Chao Yu
AI4TS
36
6
0
30 Sep 2022
Reinforcement Learning Algorithms: An Overview and Classification
Reinforcement Learning Algorithms: An Overview and Classification
Fadi AlMahamid
Katarina Grolinger
21
40
0
29 Sep 2022
Computational Discovery of Energy-Efficient Heat Treatment for
  Microstructure Design using Deep Reinforcement Learning
Computational Discovery of Energy-Efficient Heat Treatment for Microstructure Design using Deep Reinforcement Learning
J. Mianroodi
N. Siboni
Dierk Raabe
AI4CE
40
2
0
22 Sep 2022
Parallel Reinforcement Learning Simulation for Visual Quadrotor
  Navigation
Parallel Reinforcement Learning Simulation for Visual Quadrotor Navigation
Jack D. Saunders
Sajad Saeedi
Wenbin Li
25
3
0
22 Sep 2022
Pretraining the Vision Transformer using self-supervised methods for
  vision based Deep Reinforcement Learning
Pretraining the Vision Transformer using self-supervised methods for vision based Deep Reinforcement Learning
Manuel Goulão
Arlindo L. Oliveira
ViT
50
6
0
22 Sep 2022
M$^2$DQN: A Robust Method for Accelerating Deep Q-learning Network
M2^22DQN: A Robust Method for Accelerating Deep Q-learning Network
Zhe Zhang
Yukun Zou
Junjie Lai
Qinglong Xu
18
4
0
16 Sep 2022
Human-level Atari 200x faster
Human-level Atari 200x faster
Steven Kapturowski
Victor Campos
Ray Jiang
Nemanja Rakićević
Hado van Hasselt
Charles Blundell
Adria Puigdomenech Badia
OffRL
52
28
0
15 Sep 2022
On the Reuse Bias in Off-Policy Reinforcement Learning
On the Reuse Bias in Off-Policy Reinforcement Learning
Chengyang Ying
Zhongkai Hao
Xinning Zhou
Hang Su
Dong Yan
Jun Zhu
OffRL
45
3
0
15 Sep 2022
Inference and dynamic decision-making for deteriorating systems with
  probabilistic dependencies through Bayesian networks and deep reinforcement
  learning
Inference and dynamic decision-making for deteriorating systems with probabilistic dependencies through Bayesian networks and deep reinforcement learning
P. G. Morato
C. Andriotis
K. Papakonstantinou
P. Rigo
AI4CE
29
35
0
02 Sep 2022
Transformers are Sample-Efficient World Models
Transformers are Sample-Efficient World Models
Vincent Micheli
Eloi Alonso
Franccois Fleuret
VLM
OffRL
33
163
0
01 Sep 2022
Actor Prioritized Experience Replay
Actor Prioritized Experience Replay
Baturay Saglam
Furkan B. Mutlu
Dogan C. Cicek
Suleyman Serdar Kozat
30
24
0
01 Sep 2022
To Store or Not? Online Data Selection for Federated Learning with
  Limited Storage
To Store or Not? Online Data Selection for Federated Learning with Limited Storage
Chen Gong
Zhenzhe Zheng
Yunfeng Shao
Bingshuai Li
Fan Wu
Guihai Chen
42
17
0
01 Sep 2022
Beyond Supervised Continual Learning: a Review
Beyond Supervised Continual Learning: a Review
Benedikt Bagus
A. Gepperth
Timothée Lesort
BDL
CLL
35
10
0
30 Aug 2022
Autonomous Unmanned Aerial Vehicle Navigation using Reinforcement
  Learning: A Systematic Review
Autonomous Unmanned Aerial Vehicle Navigation using Reinforcement Learning: A Systematic Review
Fadi AlMahamid
Katarina Grolinger
30
73
0
25 Aug 2022
Entropy Enhanced Multi-Agent Coordination Based on Hierarchical Graph
  Learning for Continuous Action Space
Entropy Enhanced Multi-Agent Coordination Based on Hierarchical Graph Learning for Continuous Action Space
Yining Chen
Ke Wang
Guang-hua Song
Xiaohong Jiang
28
3
0
23 Aug 2022
Prioritizing Samples in Reinforcement Learning with Reducible Loss
Prioritizing Samples in Reinforcement Learning with Reducible Loss
Shivakanth Sujit
Somjit Nath
Pedro H. M. Braga
Samira Ebrahimi Kahou
50
15
0
22 Aug 2022
Event-Triggered Model Predictive Control with Deep Reinforcement
  Learning for Autonomous Driving
Event-Triggered Model Predictive Control with Deep Reinforcement Learning for Autonomous Driving
Fengying Dang
Dong Chen
J. Chen
Zhaojian Li
22
24
0
22 Aug 2022
Path Planning of Cleaning Robot with Reinforcement Learning
Path Planning of Cleaning Robot with Reinforcement Learning
Woohyeon Moon
Bumgeun Park
S. Nengroo
Taeyoung Kim
Dongsoo Har
30
17
0
17 Aug 2022
PD-MORL: Preference-Driven Multi-Objective Reinforcement Learning
  Algorithm
PD-MORL: Preference-Driven Multi-Objective Reinforcement Learning Algorithm
T. Basaklar
S. Gumussoy
Ümit Y. Ogras
24
39
0
16 Aug 2022
Online 3D Bin Packing Reinforcement Learning Solution with Buffer
Online 3D Bin Packing Reinforcement Learning Solution with Buffer
Aaron Valero Puche
Sukhan Lee
OffRL
22
14
0
15 Aug 2022
Maximizing the Use of Environmental Constraints: A Pushing-Based Hybrid
  Position/Force Assembly Skill for Contact-Rich Tasks
Maximizing the Use of Environmental Constraints: A Pushing-Based Hybrid Position/Force Assembly Skill for Contact-Rich Tasks
Yunlei Shi
Zhaopeng Chen
Lin Cong
Yansong Wu
Martin Craiu-Müller
C. Yuan
Chunyang Chang
Lei Zhang
Jianwei Zhang
22
0
0
12 Aug 2022
Model-Free Generative Replay for Lifelong Reinforcement Learning:
  Application to Starcraft-2
Model-Free Generative Replay for Lifelong Reinforcement Learning: Application to Starcraft-2
Z. Daniels
Aswin Raghavan
Jesse Hostetler
Abrar Rahman
Indranil Sur
M. Piacentino
Ajay Divakaran
CLL
OffRL
36
12
0
09 Aug 2022
Flow Annealed Importance Sampling Bootstrap
Flow Annealed Importance Sampling Bootstrap
Laurence Illing Midgley
Vincent Stimper
G. Simm
Bernhard Schölkopf
José Miguel Hernández-Lobato
43
77
0
03 Aug 2022
Mitigating Off-Policy Bias in Actor-Critic Methods with One-Step
  Q-learning: A Novel Correction Approach
Mitigating Off-Policy Bias in Actor-Critic Methods with One-Step Q-learning: A Novel Correction Approach
Baturay Saglam
Dogan C. Cicek
Furkan B. Mutlu
Suleyman Serdar Kozat
OffRL
OnRL
31
1
0
01 Aug 2022
Previous
123...8910...272829
Next