Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
1511.05952
Cited By
Prioritized Experience Replay
18 November 2015
Tom Schaul
John Quan
Ioannis Antonoglou
David Silver
OffRL
Re-assign community
ArXiv
PDF
HTML
Papers citing
"Prioritized Experience Replay"
50 / 1,441 papers shown
Title
Robot Policy Learning from Demonstration Using Advantage Weighting and Early Termination
A. Mohtasib
Gerhard Neumann
Heriberto Cuayáhuitl
OffRL
44
2
0
31 Jul 2022
DRL-M4MR: An Intelligent Multicast Routing Approach Based on DQN Deep Reinforcement Learning in SDN
Chenwei Zhao
Miao Ye
Xingsi Xue
Jianhui Lv
Qiuxiang Jiang
Yong Wang
27
17
0
31 Jul 2022
Distributional Actor-Critic Ensemble for Uncertainty-Aware Continuous Control
T. Kanazawa
Haiyan Wang
Chetan Gupta
UQCV
37
4
0
27 Jul 2022
Live in the Moment: Learning Dynamics Model Adapted to Evolving Policy
Xiyao Wang
Wichayaporn Wongkamjan
Furong Huang
24
19
0
25 Jul 2022
Associative Memory Based Experience Replay for Deep Reinforcement Learning
Mengyuan Li
Arman Kazemi
Ann Franchesca Laguna
Sharon Hu
VLM
21
8
0
16 Jul 2022
BCRLSP: An Offline Reinforcement Learning Framework for Sequential Targeted Promotion
Fanglin Chen
Xiao Liu
Bo Tang
Zhiyu Li
Serim Hwang
Guomian Zhuang
OffRL
25
1
0
16 Jul 2022
Skill-based Model-based Reinforcement Learning
Lu Shi
Joseph J. Lim
Youngwoon Lee
37
45
0
15 Jul 2022
K-level Reasoning for Zero-Shot Coordination in Hanabi
Brandon Cui
Hengyuan Hu
Luis Pineda
Jakob N. Foerster
OffRL
LRM
33
26
0
14 Jul 2022
Reactive Exploration to Cope with Non-Stationarity in Lifelong Reinforcement Learning
C. Steinparz
Thomas Schmied
Fabian Paischer
Marius-Constantin Dinu
Vihang Patil
Angela Bitto-Nemling
Hamid Eghbalzadeh
Sepp Hochreiter
CLL
29
11
0
12 Jul 2022
Vessel-following model for inland waterways based on deep reinforcement learning
Fabian Hart
Ostap Okhrin
M. Treiber
51
11
0
07 Jul 2022
DRL-ISP: Multi-Objective Camera ISP with Deep Reinforcement Learning
Ukcheol Shin
Kyunghyun Lee
In So Kweon
VLM
3DV
35
2
0
07 Jul 2022
Asynchronous Curriculum Experience Replay: A Deep Reinforcement Learning Approach for UAV Autonomous Motion Control in Unknown Dynamic Environments
Zijian Hu
Xiao-guang Gao
Kaifang Wan
Qianglong Wang
Yiwei Zhai
42
10
0
04 Jul 2022
A Survey on Active Simultaneous Localization and Mapping: State of the Art and New Frontiers
Julio A. Placed
Jared Strader
Henry Carrillo
Nikolay Atanasov
Vadim Indelman
Luca Carlone
J. A. Castellanos
35
177
0
01 Jul 2022
Depth-CUPRL: Depth-Imaged Contrastive Unsupervised Prioritized Representations in Reinforcement Learning for Mapless Navigation of Unmanned Aerial Vehicles
J. C. Jesus
V. A. Kich
A. H. Kolling
Ricardo B. Grando
R. S. Guerra
P. Drews
SSL
65
18
0
30 Jun 2022
Visual Foresight With a Local Dynamics Model
Colin Kohler
Robert Platt
42
1
0
29 Jun 2022
Generalized Policy Improvement Algorithms with Theoretically Supported Sample Reuse
James Queeney
I. Paschalidis
Christos G. Cassandras
OffRL
32
2
0
28 Jun 2022
Analysis of Stochastic Processes through Replay Buffers
Shirli Di-Castro Shashua
Shie Mannor
Dotan Di-Castro
36
6
0
26 Jun 2022
Hierarchical Reinforcement Learning with Opponent Modeling for Distributed Multi-agent Cooperation
Zhixuan Liang
Jiannong Cao
Shan Jiang
Divya Saxena
Huafeng Xu
25
10
0
25 Jun 2022
Deep Reinforcement Learning-Assisted Federated Learning for Robust Short-term Utility Demand Forecasting in Electricity Wholesale Markets
Chenghao Huang
Weilong Chen
Shengrong Bu
Yanru Zhang
AI4TS
9
1
0
23 Jun 2022
Optimistic Linear Support and Successor Features as a Basis for Optimal Policy Transfer
L. N. Alegre
A. Bazzan
Bruno C. da Silva
41
26
0
22 Jun 2022
Does Self-supervised Learning Really Improve Reinforcement Learning from Pixels?
Xiang Li
Jinghuan Shang
Srijan Das
Michael S. Ryoo
SSL
40
31
0
10 Jun 2022
Social Network Structure Shapes Innovation: Experience-sharing in RL with SAPIENS
Eleni Nisioti
Matéo Mahaut
Pierre-Yves Oudeyer
Ida Momennejad
Clément Moulin-Frier
27
9
0
10 Jun 2022
Introspective Experience Replay: Look Back When Surprised
Ramnath Kumar
Dheeraj M. Nagaraj
OffRL
16
2
0
07 Jun 2022
Exploring Chemical Space with Score-based Out-of-distribution Generation
Seul Lee
Jaehyeong Jo
Sung Ju Hwang
OODD
37
77
0
06 Jun 2022
Reincarnating Reinforcement Learning: Reusing Prior Computation to Accelerate Progress
Rishabh Agarwal
Max Schwarzer
Pablo Samuel Castro
Rameswar Panda
Marc G. Bellemare
OffRL
OnRL
37
63
0
03 Jun 2022
The Phenomenon of Policy Churn
Tom Schaul
André Barreto
John Quan
Georg Ostrovski
44
26
0
01 Jun 2022
BRExIt: On Opponent Modelling in Expert Iteration
Daniel Hernández
Hendrik Baier
Michael Kaisers
6
2
0
31 May 2022
Graph Backup: Data Efficient Backup Exploiting Markovian Transitions
Zhengyao Jiang
Tianjun Zhang
Robert Kirk
Tim Rocktaschel
Edward Grefenstette
OffRL
10
2
0
31 May 2022
Critic Sequential Monte Carlo
Vasileios Lioutas
J. Lavington
Justice Sefas
Matthew Niedoba
Yunpeng Liu
Berend Zwartsenberg
Setareh Dabiri
Frank Wood
Adam Scibior
55
7
0
30 May 2022
Reinforcement Learning for Branch-and-Bound Optimisation using Retrospective Trajectories
Christopher W. F. Parsonson
Alexandre Laterre
Thomas D. Barrett
30
19
0
28 May 2022
BulletArm: An Open-Source Robotic Manipulation Benchmark and Learning Framework
Dian Wang
Colin Kohler
Xu Zhu
Ming Jia
Robert Platt
32
9
0
28 May 2022
Multi-Phase Multi-Objective Dexterous Manipulation with Adaptive Hierarchical Curriculum
Lingfeng Tao
Jiucai Zhang
Xiaoli Zhang
40
5
0
26 May 2022
The Effect of Task Ordering in Continual Learning
Samuel J. Bell
Neil D. Lawrence
CLL
53
17
0
26 May 2022
ARLO: A Framework for Automated Reinforcement Learning
Marco Mussi
Davide Lombarda
Alberto Maria Metelli
F. Trovò
Marcello Restelli
OffRL
43
4
0
20 May 2022
A Fully Controllable Agent in the Path Planning using Goal-Conditioned Reinforcement Learning
G. Lee
35
0
0
20 May 2022
Transformer with Memory Replay
R. Liu
Barzan Mozafari
OffRL
70
4
0
19 May 2022
Data Valuation for Offline Reinforcement Learning
Amir Abolfazli
Gregory Palmer
D. Kudenko
OffRL
28
0
0
19 May 2022
Optimal Adaptive Prediction Intervals for Electricity Load Forecasting in Distribution Systems via Reinforcement Learning
Yufan Zhang
Honglin Wen
Qiuwei Wu
Qiang Ai
19
21
0
18 May 2022
Neighborhood Mixup Experience Replay: Local Convex Interpolation for Improved Sample Efficiency in Continuous Control Tasks
Ryan M Sander
Wilko Schwarting
Tim Seyde
Igor Gilitschenski
S. Karaman
Daniela Rus
46
2
0
18 May 2022
The Primacy Bias in Deep Reinforcement Learning
Evgenii Nikishin
Max Schwarzer
P. DÓro
Pierre-Luc Bacon
Rameswar Panda
OnRL
96
182
0
16 May 2022
Efficient Distributed Framework for Collaborative Multi-Agent Reinforcement Learning
Shuhan Qi
Shuhao Zhang
Xiaohan Hou
Jia-jia Zhang
Xinyu Wang
Jing Xiao
24
0
0
11 May 2022
Vehicle management in a modular production context using Deep Q-Learning
Lucain Pouget
Timo Hasenbichler
Jakob Auer
K. Lichtenegger
Andreas Windisch
11
0
0
06 May 2022
Learning to Solve Vehicle Routing Problems: A Survey
Aigerim Bogyrbayeva
Meraryslan Meraliyev
Taukekhan Mustakhov
Bissenbay Dauletbayev
31
24
0
05 May 2022
State Representation Learning for Goal-Conditioned Reinforcement Learning
Lorenzo Steccanella
Anders Jonsson
SSL
OffRL
37
4
0
04 May 2022
CCLF: A Contrastive-Curiosity-Driven Learning Framework for Sample-Efficient Reinforcement Learning
Chenyu Sun
Hangwei Qian
Chunyan Miao
OffRL
34
12
0
02 May 2022
Learning user-defined sub-goals using memory editing in reinforcement learning
G. Lee
KELM
11
2
0
01 May 2022
CryoRL: Reinforcement Learning Enables Efficient Cryo-EM Data Collection
Quanfu Fan
Yilai Li
Yuguang Yao
J. M. Cohn
Sijia Liu
S. Vos
M. Cianfrocco
OffRL
28
8
0
15 Apr 2022
Retrospective on the 2021 BASALT Competition on Learning from Human Feedback
Rohin Shah
Steven H. Wang
Cody Wild
Stephanie Milani
Anssi Kanervisto
...
Alexander Fries
Alexandra Souly
Chan Jun Shern
Daniel del Castillo
Tom Lieberum
LLMAG
OffRL
24
10
0
14 Apr 2022
Safer Autonomous Driving in a Stochastic, Partially-Observable Environment by Hierarchical Contingency Planning
Ugo Lecerf
Christelle Yemdji Tchassi
Pietro Michiardi
30
1
0
13 Apr 2022
When Should We Prefer Offline Reinforcement Learning Over Behavioral Cloning?
Aviral Kumar
Joey Hong
Anika Singh
Sergey Levine
OffRL
50
77
0
12 Apr 2022
Previous
1
2
3
...
9
10
11
...
27
28
29
Next