ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1511.05952
  4. Cited By
Prioritized Experience Replay

Prioritized Experience Replay

18 November 2015
Tom Schaul
John Quan
Ioannis Antonoglou
David Silver
    OffRL
ArXivPDFHTML

Papers citing "Prioritized Experience Replay"

50 / 1,441 papers shown
Title
Fast-Learning Grasping and Pre-Grasping via Clutter Quantization and
  Q-map Masking
Fast-Learning Grasping and Pre-Grasping via Clutter Quantization and Q-map Masking
Dafa Ren
Xiaoqiang Ren
Xiaofan Wang
Sundara Tejaswi Digumarti
Guodong Shi
11
7
0
06 Jul 2021
Ensemble and Auxiliary Tasks for Data-Efficient Deep Reinforcement
  Learning
Ensemble and Auxiliary Tasks for Data-Efficient Deep Reinforcement Learning
Muhammad Rizki Maulana
W. Lee
30
1
0
05 Jul 2021
Cooperative Autonomous Vehicles that Sympathize with Human Drivers
Cooperative Autonomous Vehicles that Sympathize with Human Drivers
Behrad Toghi
Rodolfo Valiente
Dorsa Sadigh
Ramtin Pedarsani
Y. P. Fallah
30
45
0
02 Jul 2021
Stabilizing Deep Q-Learning with ConvNets and Vision Transformers under
  Data Augmentation
Stabilizing Deep Q-Learning with ConvNets and Vision Transformers under Data Augmentation
Nicklas Hansen
H. Su
Xiaolong Wang
OffRL
44
135
0
01 Jul 2021
Offline-to-Online Reinforcement Learning via Balanced Replay and
  Pessimistic Q-Ensemble
Offline-to-Online Reinforcement Learning via Balanced Replay and Pessimistic Q-Ensemble
Seunghyun Lee
Younggyo Seo
Kimin Lee
Pieter Abbeel
Jinwoo Shin
OffRL
OnRL
22
182
0
01 Jul 2021
Convergent and Efficient Deep Q Network Algorithm
Convergent and Efficient Deep Q Network Algorithm
Zhikang T. Wang
Masahito Ueda
27
12
0
29 Jun 2021
Autonomous Deep Quality Monitoring in Streaming Environments
Autonomous Deep Quality Monitoring in Streaming Environments
Andri Ashfahani
Mahardhika Pratama
E. Lughofer
E. Yapp
36
4
0
26 Jun 2021
Auto-Pipeline: Synthesizing Complex Data Pipelines By-Target Using
  Reinforcement Learning and Search
Auto-Pipeline: Synthesizing Complex Data Pipelines By-Target Using Reinforcement Learning and Search
Junwen Yang
Yeye He
S. Chaudhuri
AI4TS
27
26
0
25 Jun 2021
Mix and Mask Actor-Critic Methods
Mix and Mask Actor-Critic Methods
Dom Huh
27
1
0
24 Jun 2021
Stochastic Batch Acquisition: A Simple Baseline for Deep Active Learning
Stochastic Batch Acquisition: A Simple Baseline for Deep Active Learning
Andreas Kirsch
Sebastian Farquhar
Parmida Atighehchian
Andrew Jesson
Frederic Branchaud-Charron
Y. Gal
49
20
0
22 Jun 2021
Hi-Phy: A Benchmark for Hierarchical Physical Reasoning
Cheng Xue
Vimukthini Pinto
C. Gamage
Peng Zhang
Jochen Renz
28
0
0
17 Jun 2021
Modelling resource allocation in uncertain system environment through
  deep reinforcement learning
Modelling resource allocation in uncertain system environment through deep reinforcement learning
Neel Gandhi
Shakti Mishra
24
1
0
17 Jun 2021
CROP: Certifying Robust Policies for Reinforcement Learning through
  Functional Smoothing
CROP: Certifying Robust Policies for Reinforcement Learning through Functional Smoothing
Fan Wu
Linyi Li
Zijian Huang
Yevgeniy Vorobeychik
Ding Zhao
Yue Liu
AAML
OffRL
26
59
0
17 Jun 2021
Solving Continuous Control with Episodic Memory
Solving Continuous Control with Episodic Memory
Igor Kuznetsov
Andrey Filchenkov
CLL
OffRL
9
19
0
16 Jun 2021
Characterizing the Gap Between Actor-Critic and Policy Gradient
Characterizing the Gap Between Actor-Critic and Policy Gradient
Junfeng Wen
Saurabh Kumar
Ramki Gummadi
Dale Schuurmans
34
15
0
13 Jun 2021
A Deep Reinforcement Learning Approach to Marginalized Importance
  Sampling with the Successor Representation
A Deep Reinforcement Learning Approach to Marginalized Importance Sampling with the Successor Representation
Scott Fujimoto
David Meger
Doina Precup
10
16
0
12 Jun 2021
GDI: Rethinking What Makes Reinforcement Learning Different From
  Supervised Learning
GDI: Rethinking What Makes Reinforcement Learning Different From Supervised Learning
Jiajun Fan
Changnan Xiao
Yue Huang
OffRL
21
10
0
11 Jun 2021
Taylor Expansion of Discount Factors
Taylor Expansion of Discount Factors
Yunhao Tang
Mark Rowland
Rémi Munos
Michal Valko
OffRL
34
5
0
11 Jun 2021
Data-driven battery operation for energy arbitrage using rainbow deep
  reinforcement learning
Data-driven battery operation for energy arbitrage using rainbow deep reinforcement learning
Daniel J. B. Harrold
Jun Cao
Zhong Fan
17
47
0
10 Jun 2021
Simplifying Deep Reinforcement Learning via Self-Supervision
Simplifying Deep Reinforcement Learning via Self-Supervision
Daochen Zha
Kwei-Herng Lai
Kaixiong Zhou
Xia Hu
SSL
54
15
0
10 Jun 2021
Reinforcement Learning for Industrial Control Network Cyber Security
  Orchestration
Reinforcement Learning for Industrial Control Network Cyber Security Orchestration
John Mern
Kyle Hatch
Ryan Silva
J. Brush
Mykel J. Kochenderfer
22
4
0
09 Jun 2021
Who Is the Strongest Enemy? Towards Optimal and Efficient Evasion
  Attacks in Deep RL
Who Is the Strongest Enemy? Towards Optimal and Efficient Evasion Attacks in Deep RL
Yanchao Sun
Ruijie Zheng
Yongyuan Liang
Furong Huang
AAML
11
63
0
09 Jun 2021
Don't Get Yourself into Trouble! Risk-aware Decision-Making for
  Autonomous Vehicles
Don't Get Yourself into Trouble! Risk-aware Decision-Making for Autonomous Vehicles
Kasra Mokhtari
Alan R. Wagner
17
5
0
08 Jun 2021
Safe Deep Q-Network for Autonomous Vehicles at Unsignalized Intersection
Safe Deep Q-Network for Autonomous Vehicles at Unsignalized Intersection
Kasra Mokhtari
Alan R. Wagner
33
9
0
08 Jun 2021
Towards robust and domain agnostic reinforcement learning competitions
Towards robust and domain agnostic reinforcement learning competitions
William H. Guss
Stephanie Milani
Nicholay Topin
Brandon Houghton
Sharada Mohanty
...
Lu Liu
Daichi Nishio
Toi Tsuneda
Karolis Ramanauskas
Gabija Juceviciute
OOD
27
2
0
07 Jun 2021
Causal Influence Detection for Improving Efficiency in Reinforcement
  Learning
Causal Influence Detection for Improving Efficiency in Reinforcement Learning
Maximilian Seitzer
Bernhard Schölkopf
Georg Martius
CML
31
75
0
07 Jun 2021
Distributional Reinforcement Learning with Unconstrained Monotonic
  Neural Networks
Distributional Reinforcement Learning with Unconstrained Monotonic Neural Networks
Thibaut Théate
Antoine Wehenkel
Adrien Bolland
Gilles Louppe
D. Ernst
24
7
0
06 Jun 2021
Differentiable Architecture Search for Reinforcement Learning
Differentiable Architecture Search for Reinforcement Learning
Yingjie Miao
Xingyou Song
John D. Co-Reyes
Daiyi Peng
Summer Yue
E. Brevdo
Aleksandra Faust
20
4
0
04 Jun 2021
Hierarchical Representation Learning for Markov Decision Processes
Hierarchical Representation Learning for Markov Decision Processes
Lorenzo Steccanella
Simone Totaro
Anders Jonsson
28
4
0
03 Jun 2021
Towards Deeper Deep Reinforcement Learning with Spectral Normalization
Towards Deeper Deep Reinforcement Learning with Spectral Normalization
Johan Bjorck
Carla P. Gomes
Kilian Q. Weinberger
19
23
0
02 Jun 2021
Smooth Q-learning: Accelerate Convergence of Q-learning Using Similarity
Smooth Q-learning: Accelerate Convergence of Q-learning Using Similarity
Wei-zhi Liao
Xiaohui Wei
Jizhou Lai
11
3
0
02 Jun 2021
Transferable Deep Reinforcement Learning Framework for Autonomous
  Vehicles with Joint Radar-Data Communications
Transferable Deep Reinforcement Learning Framework for Autonomous Vehicles with Joint Radar-Data Communications
Nguyen Quang Hieu
D. Hoang
Dusit Niyato
Ping Wang
Dong In Kim
Chau Yuen
34
28
0
28 May 2021
FNAS: Uncertainty-Aware Fast Neural Architecture Search
FNAS: Uncertainty-Aware Fast Neural Architecture Search
Jihao Liu
Ming Zhang
Yangting Sun
B. Liu
Guanglu Song
Yu Liu
Hongsheng Li
30
7
0
25 May 2021
Searching Collaborative Agents for Multi-plane Localization in 3D
  Ultrasound
Searching Collaborative Agents for Multi-plane Localization in 3D Ultrasound
Xin Yang
Yuhao Huang
Ruobing Huang
Haoran Dou
Rui Li
...
Chaoyu Chen
Yuanji Zhang
Haixia Wang
Yi Xiong
Dong Ni
33
16
0
22 May 2021
Robo-Advising: Enhancing Investment with Inverse Optimization and Deep
  Reinforcement Learning
Robo-Advising: Enhancing Investment with Inverse Optimization and Deep Reinforcement Learning
Haoran Wang
S. Yu
AIFin
38
13
0
19 May 2021
Controlling an Inverted Pendulum with Policy Gradient Methods-A Tutorial
Controlling an Inverted Pendulum with Policy Gradient Methods-A Tutorial
Swagat Kumar
18
2
0
17 May 2021
Behavior-based Neuroevolutionary Training in Reinforcement Learning
Behavior-based Neuroevolutionary Training in Reinforcement Learning
Jörg Stork
Martin Zaefferer
Nils Eisler
Patrick Tichelmann
T. Bartz-Beielstein
A. E. Eiben
6
5
0
17 May 2021
Regret Minimization Experience Replay in Off-Policy Reinforcement
  Learning
Regret Minimization Experience Replay in Off-Policy Reinforcement Learning
Xu-Hui Liu
Zhenghai Xue
Jing-Cheng Pang
Shengyi Jiang
Feng Xu
Yang Yu
OffRL
21
37
0
15 May 2021
Non-decreasing Quantile Function Network with Efficient Exploration for
  Distributional Reinforcement Learning
Non-decreasing Quantile Function Network with Efficient Exploration for Distributional Reinforcement Learning
Fan Zhou
Zhoufan Zhu
Qi Kuang
Liwen Zhang
OffRL
28
16
0
14 May 2021
Spectral Normalisation for Deep Reinforcement Learning: an Optimisation
  Perspective
Spectral Normalisation for Deep Reinforcement Learning: an Optimisation Perspective
Florin Gogianu
Tudor Berariu
Mihaela Rosca
Claudia Clopath
L. Buşoniu
Razvan Pascanu
24
54
0
11 May 2021
Deep Bandits Show-Off: Simple and Efficient Exploration with Deep
  Networks
Deep Bandits Show-Off: Simple and Efficient Exploration with Deep Networks
Rong Zhu
Mattia Rigotti
34
7
0
10 May 2021
PEARL: Parallelized Expert-Assisted Reinforcement Learning for Scene
  Rearrangement Planning
PEARL: Parallelized Expert-Assisted Reinforcement Learning for Scene Rearrangement Planning
Hanqing Wang
Zan Wang
Wei Liang
L. Yu
21
1
0
10 May 2021
CASA: Bridging the Gap between Policy Improvement and Policy Evaluation
  with Conflict Averse Policy Iteration
CASA: Bridging the Gap between Policy Improvement and Policy Evaluation with Conflict Averse Policy Iteration
Changnan Xiao
Haosen Shi
Jiajun Fan
Shihong Deng
Haiyan Yin
19
0
0
09 May 2021
Time-Aware Q-Networks: Resolving Temporal Irregularity for Deep
  Reinforcement Learning
Time-Aware Q-Networks: Resolving Temporal Irregularity for Deep Reinforcement Learning
Yeonji Kim
Min Chi
14
0
0
06 May 2021
Density-Aware Federated Imitation Learning for Connected and Automated
  Vehicles with Unsignalized Intersection
Density-Aware Federated Imitation Learning for Connected and Automated Vehicles with Unsignalized Intersection
Tianhao Wu
Mingzhi Jiang
Yinhui Han
Zheng Yuan
Lin Zhang
18
2
0
05 May 2021
On Lottery Tickets and Minimal Task Representations in Deep
  Reinforcement Learning
On Lottery Tickets and Minimal Task Representations in Deep Reinforcement Learning
Marc Aurel Vischer
R. T. Lange
Henning Sprekeler
OOD
UQCV
OffRL
25
23
0
04 May 2021
Generative Adversarial Reward Learning for Generalized Behavior Tendency
  Inference
Generative Adversarial Reward Learning for Generalized Behavior Tendency Inference
Xiaocong Chen
Lina Yao
Xianzhi Wang
Aixin Sun
Wenjie Zhang
Quan Z. Sheng
22
8
0
03 May 2021
Curious Exploration and Return-based Memory Restoration for Deep
  Reinforcement Learning
Curious Exploration and Return-based Memory Restoration for Deep Reinforcement Learning
Saeed Tafazzol
Erfan Fathi
Mahdi Rezaei
Ehsan Asali
13
2
0
02 May 2021
Pedestrian Collision Avoidance for Autonomous Vehicles at Unsignalized
  Intersection Using Deep Q-Network
Pedestrian Collision Avoidance for Autonomous Vehicles at Unsignalized Intersection Using Deep Q-Network
Kasra Mokhtari
Alan R. Wagner
12
6
0
01 May 2021
One Backward from Ten Forward, Subsampling for Large-Scale Deep Learning
One Backward from Ten Forward, Subsampling for Large-Scale Deep Learning
Chaosheng Dong
Xiaojie Jin
Weihao Gao
Yijia Wang
Hongyi Zhang
Xiang Wu
Jianchao Yang
Xiaobing Liu
28
5
0
27 Apr 2021
Previous
123...131415...272829
Next