ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1511.05952
  4. Cited By
Prioritized Experience Replay
v1v2v3v4 (latest)

Prioritized Experience Replay

18 November 2015
Tom Schaul
John Quan
Ioannis Antonoglou
David Silver
    OffRL
ArXiv (abs)PDFHTML

Papers citing "Prioritized Experience Replay"

50 / 1,454 papers shown
Title
From Psychological Curiosity to Artificial Curiosity: Curiosity-Driven
  Learning in Artificial Intelligence Tasks
From Psychological Curiosity to Artificial Curiosity: Curiosity-Driven Learning in Artificial Intelligence Tasks
Chenyu Sun
Hangwei Qian
Chunyan Miao
73
10
0
20 Jan 2022
Hybrid Reinforcement Learning-Based Eco-Driving Strategy for Connected
  and Automated Vehicles at Signalized Intersections
Hybrid Reinforcement Learning-Based Eco-Driving Strategy for Connected and Automated Vehicles at Signalized Intersections
Zhengwei Bai
Peng Hao
ShangGuan Wei
B. Cai
Matthew Barth
49
104
0
19 Jan 2022
Online POI Recommendation: Learning Dynamic Geo-Human Interactions in
  Streams
Online POI Recommendation: Learning Dynamic Geo-Human Interactions in Streams
Dongjie Wang
Kunpeng Liu
Hui Xiong
Yanjie Fu
160
8
0
19 Jan 2022
Spatial State-Action Features for General Games
Spatial State-Action Features for General Games
Dennis J. N. J. Soemers
Éric Piette
Matthew Stephenson
C. Browne
99
4
0
17 Jan 2022
Automated Reinforcement Learning (AutoRL): A Survey and Open Problems
Automated Reinforcement Learning (AutoRL): A Survey and Open Problems
Jack Parker-Holder
Raghunandan Rajan
Xingyou Song
André Biedenkapp
Yingjie Miao
...
Vu-Linh Nguyen
Roberto Calandra
Aleksandra Faust
Frank Hutter
Marius Lindauer
AI4CE
116
107
0
11 Jan 2022
STIR$^2$: Reward Relabelling for combined Reinforcement and Imitation
  Learning on sparse-reward tasks
STIR2^22: Reward Relabelling for combined Reinforcement and Imitation Learning on sparse-reward tasks
Jesús Bujalance Martín
Fabien Moutarde
OffRL
75
2
0
11 Jan 2022
A Generalized Bootstrap Target for Value-Learning, Efficiently Combining
  Value and Feature Predictions
A Generalized Bootstrap Target for Value-Learning, Efficiently Combining Value and Feature Predictions
Anthony GX-Chen
Veronica Chelu
Blake A. Richards
Joelle Pineau
TTA
57
1
0
05 Jan 2022
FedBalancer: Data and Pace Control for Efficient Federated Learning on
  Heterogeneous Clients
FedBalancer: Data and Pace Control for Efficient Federated Learning on Heterogeneous Clients
Jaemin Shin
Yuanchun Li
Yunxin Liu
Sung-Ju Lee
FedML
76
77
0
05 Jan 2022
Multi-Stage Episodic Control for Strategic Exploration in Text Games
Multi-Stage Episodic Control for Strategic Exploration in Text Games
Jens Tuyls
Shunyu Yao
Sham Kakade
Karthik Narasimhan
100
26
0
04 Jan 2022
Hybrid intelligence for dynamic job-shop scheduling with deep
  reinforcement learning and attention mechanism
Hybrid intelligence for dynamic job-shop scheduling with deep reinforcement learning and attention mechanism
Yu-jian Zeng
Zijun Liao
Y. Dai
Rong Wang
Xiu Li
Bo Yuan
49
14
0
03 Jan 2022
Actor Loss of Soft Actor Critic Explained
Actor Loss of Soft Actor Critic Explained
Thibault Lahire
37
0
0
31 Dec 2021
Missing Velocity in Dynamic Obstacle Avoidance based on Deep
  Reinforcement Learning
Missing Velocity in Dynamic Obstacle Avoidance based on Deep Reinforcement Learning
Fabian Hart
Martin Waltz
Ostap Okhrin
43
0
0
23 Dec 2021
Maximum Entropy Population-Based Training for Zero-Shot Human-AI
  Coordination
Maximum Entropy Population-Based Training for Zero-Shot Human-AI Coordination
Rui Zhao
Jinming Song
Yufeng Yuan
Haifeng Hu
Yang Gao
Yi Wu
Zhongqian Sun
Yang Wei
95
69
0
22 Dec 2021
Creativity of AI: Hierarchical Planning Model Learning for Facilitating
  Deep Reinforcement Learning
Creativity of AI: Hierarchical Planning Model Learning for Facilitating Deep Reinforcement Learning
H. Zhuo
Shuting Deng
M. Jin
Zhihao Ma
Kebing Jin
Chong Chen
Chao Yu
115
1
0
18 Dec 2021
Distillation of RL Policies with Formal Guarantees via Variational
  Abstraction of Markov Decision Processes (Technical Report)
Distillation of RL Policies with Formal Guarantees via Variational Abstraction of Markov Decision Processes (Technical Report)
Florent Delgrange
Ann Nowé
Guillermo A. Pérez
OffRL
43
11
0
17 Dec 2021
Deep Reinforcement Learning Policies Learn Shared Adversarial Features
  Across MDPs
Deep Reinforcement Learning Policies Learn Shared Adversarial Features Across MDPs
Ezgi Korkmaz
64
26
0
16 Dec 2021
Benchmarking Safe Deep Reinforcement Learning in Aquatic Navigation
Benchmarking Safe Deep Reinforcement Learning in Aquatic Navigation
Enrico Marchesini
Davide Corsi
Alessandro Farinelli
67
19
0
16 Dec 2021
Centralizing State-Values in Dueling Networks for Multi-Robot
  Reinforcement Learning Mapless Navigation
Centralizing State-Values in Dueling Networks for Multi-Robot Reinforcement Learning Mapless Navigation
Enrico Marchesini
Alessandro Farinelli
56
18
0
16 Dec 2021
Automatic tuning of hyper-parameters of reinforcement learning
  algorithms using Bayesian optimization with behavioral cloning
Automatic tuning of hyper-parameters of reinforcement learning algorithms using Bayesian optimization with behavioral cloning
Juan Cruz Barsce
J. Palombarini
Ernesto C. Martínez
OffRL
62
1
0
15 Dec 2021
Replay For Safety
Replay For Safety
Liran Szlak
Ohad Shamir
OffRL
47
0
0
08 Dec 2021
Convergence Results For Q-Learning With Experience Replay
Convergence Results For Q-Learning With Experience Replay
Liran Szlak
Ohad Shamir
OffRL
73
5
0
08 Dec 2021
PTR-PPO: Proximal Policy Optimization with Prioritized Trajectory Replay
PTR-PPO: Proximal Policy Optimization with Prioritized Trajectory Replay
Xingxing Liang
Yang Ma
Yanghe Feng
Zhong Liu
61
10
0
07 Dec 2021
JueWu-MC: Playing Minecraft with Sample-efficient Hierarchical
  Reinforcement Learning
JueWu-MC: Playing Minecraft with Sample-efficient Hierarchical Reinforcement Learning
Zichuan Lin
Junyou Li
Jianing Shi
Deheng Ye
Qiang Fu
Wei Yang
BDL
80
36
0
07 Dec 2021
A Generic Graph Sparsification Framework using Deep Reinforcement
  Learning
A Generic Graph Sparsification Framework using Deep Reinforcement Learning
Ryan Wickman
Xiaofei Zhang
Weizi Li
OffRL
72
13
0
02 Dec 2021
Maximum Entropy Model-based Reinforcement Learning
Maximum Entropy Model-based Reinforcement Learning
Oleg Svidchenko
A. Shpilman
61
6
0
02 Dec 2021
NEORL: NeuroEvolution Optimization with Reinforcement Learning
NEORL: NeuroEvolution Optimization with Reinforcement Learning
M. Radaideh
Katelin Du
Paul Seurin
Devin Seyler
Xubo Gu
Haijiang Wang
K. Shirvan
OffRL
64
6
0
01 Dec 2021
Pessimistic Model Selection for Offline Deep Reinforcement Learning
Pessimistic Model Selection for Offline Deep Reinforcement Learning
Chao-Han Huck Yang
Zhengling Qi
Yifan Cui
Pin-Yu Chen
OffRL
98
4
0
29 Nov 2021
Improving Experience Replay with Successor Representation
Improving Experience Replay with Successor Representation
Yizhi Yuan
M. Mattar
36
1
0
29 Nov 2021
Count-Based Temperature Scheduling for Maximum Entropy Reinforcement
  Learning
Count-Based Temperature Scheduling for Maximum Entropy Reinforcement Learning
Dailin Hu
Pieter Abbeel
Roy Fox
42
2
0
28 Nov 2021
Reinforcement Learning-based Switching Controller for a Milliscale Robot
  in a Constrained Environment
Reinforcement Learning-based Switching Controller for a Milliscale Robot in a Constrained Environment
Abbas Tariverdi
Ulysse Côté-Allard
Kim Mathiassen
O. Elle
H. Kalvøy
Ø. Martinsen
J. Tørresen
110
4
0
27 Nov 2021
Episodic Multi-agent Reinforcement Learning with Curiosity-Driven
  Exploration
Episodic Multi-agent Reinforcement Learning with Curiosity-Driven Exploration
Lu Zheng
Jiarui Chen
Jianhao Wang
Jiamin He
Yujing Hu
Yingfeng Chen
Changjie Fan
Yang Gao
Chongjie Zhang
71
86
0
22 Nov 2021
Fast and Data-Efficient Training of Rainbow: an Experimental Study on
  Atari
Fast and Data-Efficient Training of Rainbow: an Experimental Study on Atari
Dominik Schmidt
Thomas Schmied
OffRL
61
12
0
19 Nov 2021
Improving Experience Replay through Modeling of Similar Transitions'
  Sets
Improving Experience Replay through Modeling of Similar Transitions' Sets
Daniel Eugênio Neves
João Pedro Oliveira Batisteli
Eduardo Felipe Lopes
Lucila Ishitani
Zenilton K. G. Patrocínio
OffRL
38
1
0
12 Nov 2021
Collaboration Promotes Group Resilience in Multi-Agent AI
Collaboration Promotes Group Resilience in Multi-Agent AI
Sarah Keren
M. Gerstgrasser
Ofir Abu
J. Rosenschein
42
0
0
12 Nov 2021
Look Before You Leap: Safe Model-Based Reinforcement Learning with Human
  Intervention
Look Before You Leap: Safe Model-Based Reinforcement Learning with Human Intervention
Yunkun Xu
Zhen-yu Liu
Guifang Duan
Jiangcheng Zhu
X. Bai
Jianrong Tan
81
9
0
10 Nov 2021
Cross Modality 3D Navigation Using Reinforcement Learning and Neural
  Style Transfer
Cross Modality 3D Navigation Using Reinforcement Learning and Neural Style Transfer
Cesare Magnetti
Hadrien Reynaud
Bernhard Kainz
MedIm
30
0
0
05 Nov 2021
Improving RNA Secondary Structure Design using Deep Reinforcement
  Learning
Improving RNA Secondary Structure Design using Deep Reinforcement Learning
Alexander Whatley
Zhekun Luo
Xiangru Tang
37
2
0
05 Nov 2021
Autonomous Attack Mitigation for Industrial Control Systems
Autonomous Attack Mitigation for Industrial Control Systems
John Mern
Kyle Hatch
Ryan Silva
Cameron Hickert
Tamim I. Sookoor
Mykel J. Kochenderfer
AAML
68
7
0
03 Nov 2021
Model-Based Episodic Memory Induces Dynamic Hybrid Controls
Model-Based Episodic Memory Induces Dynamic Hybrid Controls
Hung Le
Thommen George Karimpanal
Majid Abdolshah
T. Tran
Svetha Venkatesh
72
20
0
03 Nov 2021
One Pass ImageNet
One Pass ImageNet
Huiyi Hu
Ang Li
Daniele Calandriello
Dilan Görür
VLM
72
18
0
03 Nov 2021
Off-Policy Correction for Deep Deterministic Policy Gradient Algorithms
  via Batch Prioritized Experience Replay
Off-Policy Correction for Deep Deterministic Policy Gradient Algorithms via Batch Prioritized Experience Replay
Dogan C. Cicek
Enes Duran
Baturay Saglam
Furkan B. Mutlu
Suleyman S. Kozat
OffRL
35
11
0
02 Nov 2021
Investigation of Independent Reinforcement Learning Algorithms in
  Multi-Agent Environments
Investigation of Independent Reinforcement Learning Algorithms in Multi-Agent Environments
Ken Ming Lee
Sriram Ganapathi Subramanian
Mark Crowley
62
12
0
01 Nov 2021
Mastering Atari Games with Limited Data
Mastering Atari Games with Limited Data
Weirui Ye
Shao-Wei Liu
Thanard Kurutach
Pieter Abbeel
Yang Gao
VLM
166
242
0
30 Oct 2021
Equivariant $Q$ Learning in Spatial Action Spaces
Equivariant QQQ Learning in Spatial Action Spaces
Dian Wang
Robin Walters
Xu Zhu
Robert Platt
107
78
0
28 Oct 2021
Cooperative Deep $Q$-learning Framework for Environments Providing Image
  Feedback
Cooperative Deep QQQ-learning Framework for Environments Providing Image Feedback
Krishnan Raghavan
Vignesh Narayanan
S. Jagannathan
VLMOffRL
62
1
0
28 Oct 2021
Extracting Expert's Goals by What-if Interpretable Modeling
Extracting Expert's Goals by What-if Interpretable Modeling
C. Chang
George Adam
Rich Caruana
Anna Goldenberg
OffRL
85
0
0
28 Oct 2021
Hindsight Goal Ranking on Replay Buffer for Sparse Reward Environment
Hindsight Goal Ranking on Replay Buffer for Sparse Reward Environment
Tung M. Luu
Chang D. Yoo
80
8
0
28 Oct 2021
Temporal-Difference Value Estimation via Uncertainty-Guided Soft Updates
Temporal-Difference Value Estimation via Uncertainty-Guided Soft Updates
Litian Liang
Yaosheng Xu
Stephen Marcus McAleer
Dailin Hu
Alexander Ihler
Pieter Abbeel
Roy Fox
28
4
0
28 Oct 2021
Learning from demonstrations with SACR2: Soft Actor-Critic with Reward
  Relabeling
Learning from demonstrations with SACR2: Soft Actor-Critic with Reward Relabeling
Jesús Bujalance Martín
Raphael Chekroun
Fabien Moutarde
OffRL
64
5
0
27 Oct 2021
Transfer learning with causal counterfactual reasoning in Decision
  Transformers
Transfer learning with causal counterfactual reasoning in Decision Transformers
Ayman Boustati
Hana Chockler
Daniel C. McNamee
CMLOffRLLRM
60
9
0
27 Oct 2021
Previous
123...111213...282930
Next