ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1511.05952
  4. Cited By
Prioritized Experience Replay
v1v2v3v4 (latest)

Prioritized Experience Replay

18 November 2015
Tom Schaul
John Quan
Ioannis Antonoglou
David Silver
    OffRL
ArXiv (abs)PDFHTML

Papers citing "Prioritized Experience Replay"

50 / 1,454 papers shown
Title
Regularly Updated Deterministic Policy Gradient Algorithm
Regularly Updated Deterministic Policy Gradient Algorithm
Shuai Han
Wenbo Zhou
Shuai Lu
Jiayu Yu
25
22
0
01 Jul 2020
Model-based Reinforcement Learning: A Survey
Model-based Reinforcement Learning: A Survey
Thomas M. Moerland
Joost Broekens
Aske Plaat
Catholijn M. Jonker
OffRL
132
49
0
30 Jun 2020
Model-based Reinforcement Learning for Semi-Markov Decision Processes
  with Neural ODEs
Model-based Reinforcement Learning for Semi-Markov Decision Processes with Neural ODEs
Jianzhun Du
Joseph D. Futoma
Finale Doshi-Velez
84
52
0
29 Jun 2020
Learning predictive representations in autonomous driving to improve
  deep reinforcement learning
Learning predictive representations in autonomous driving to improve deep reinforcement learning
D. Graves
Nhat M. Nguyen
Kimia Hassanzadeh
Jun Jin
SSL
80
12
0
26 Jun 2020
Widening the Pipeline in Human-Guided Reinforcement Learning with
  Explanation and Context-Aware Data Augmentation
Widening the Pipeline in Human-Guided Reinforcement Learning with Explanation and Context-Aware Data Augmentation
L. Guan
Mudit Verma
Sihang Guo
Ruohan Zhang
Subbarao Kambhampati
143
43
0
26 Jun 2020
Reinforcement Learning and its Connections with Neuroscience and
  Psychology
Reinforcement Learning and its Connections with Neuroscience and Psychology
Ajay Subramanian
Sharad Chitlangia
V. Baths
OffRL
161
32
0
25 Jun 2020
Experience Replay with Likelihood-free Importance Weights
Experience Replay with Likelihood-free Importance Weights
Samarth Sinha
Jiaming Song
Animesh Garg
Stefano Ermon
OffRL
105
58
0
23 Jun 2020
The Effect of Multi-step Methods on Overestimation in Deep Reinforcement
  Learning
The Effect of Multi-step Methods on Overestimation in Deep Reinforcement Learning
Lingheng Meng
R. Gorbet
Dana Kulic
OffRL
76
27
0
23 Jun 2020
NROWAN-DQN: A Stable Noisy Network with Noise Reduction and Online
  Weight Adjustment for Exploration
NROWAN-DQN: A Stable Noisy Network with Noise Reduction and Online Weight Adjustment for Exploration
Shuai Han
Wenbo Zhou
Jing Liu
Shuai Lu
45
28
0
19 Jun 2020
Forgetful Experience Replay in Hierarchical Reinforcement Learning from
  Demonstrations
Forgetful Experience Replay in Hierarchical Reinforcement Learning from Demonstrations
Alexey Skrynnik
A. Staroverov
Ermek Aitygulov
Kirill Aksenov
Vasilii Davydov
Aleksandr I. Panov
OffRL
66
4
0
17 Jun 2020
Green Simulation Assisted Reinforcement Learning with Model Risk for
  Biomanufacturing Learning and Control
Green Simulation Assisted Reinforcement Learning with Model Risk for Biomanufacturing Learning and Control
Hua Zheng
Wei Xie
M. Feng
OffRL
19
5
0
17 Jun 2020
Learning About Objects by Learning to Interact with Them
Learning About Objects by Learning to Interact with Them
Martin Lohmann
Jordi Salvador
Aniruddha Kembhavi
Roozbeh Mottaghi
OCL
76
18
0
16 Jun 2020
Reinforcement Learning Control of Robotic Knee with Human in the Loop by
  Flexible Policy Iteration
Reinforcement Learning Control of Robotic Knee with Human in the Loop by Flexible Policy Iteration
Xiang Gao
J. Si
Yue Wen
Minhan Li
He
H. Huang
55
32
0
16 Jun 2020
Least Squares Regression with Markovian Data: Fundamental Limits and
  Algorithms
Least Squares Regression with Markovian Data: Fundamental Limits and Algorithms
Guy Bresler
Prateek Jain
Dheeraj M. Nagaraj
Praneeth Netrapalli
Xian Wu
88
61
0
16 Jun 2020
Hindsight Expectation Maximization for Goal-conditioned Reinforcement
  Learning
Hindsight Expectation Maximization for Goal-conditioned Reinforcement Learning
Yunhao Tang
A. Kucukelbir
OffRL
68
16
0
13 Jun 2020
Self-Imitation Learning via Generalized Lower Bound Q-learning
Self-Imitation Learning via Generalized Lower Bound Q-learning
Yunhao Tang
SSL
108
24
0
12 Jun 2020
Continuous Control for Searching and Planning with a Learned Model
Continuous Control for Searching and Planning with a Learned Model
Xuxi Yang
Werner Duvaud
Peng Wei
66
5
0
12 Jun 2020
Bayesian Experience Reuse for Learning from Multiple Demonstrators
Bayesian Experience Reuse for Learning from Multiple Demonstrators
Michael Gimelfarb
Scott Sanner
Chi-Guhn Lee
33
0
0
10 Jun 2020
Maximum Entropy Model Rollouts: Fast Model Based Policy Optimization
  without Compounding Errors
Maximum Entropy Model Rollouts: Fast Model Based Policy Optimization without Compounding Errors
Chi Zhang
S. Kuppannagari
Viktor Prasanna
43
4
0
08 Jun 2020
Balancing a CartPole System with Reinforcement Learning -- A Tutorial
Balancing a CartPole System with Reinforcement Learning -- A Tutorial
S. Kumar
43
24
0
08 Jun 2020
A Comparison of Self-Play Algorithms Under a Generalized Framework
A Comparison of Self-Play Algorithms Under a Generalized Framework
Daniel Hernández
Kevin Denamganai
Sam Devlin
Spyridon Samothrakis
James Alfred Walker
63
12
0
08 Jun 2020
Deep Reinforcement Learning for Human-Like Driving Policies in Collision
  Avoidance Tasks of Self-Driving Cars
Deep Reinforcement Learning for Human-Like Driving Policies in Collision Avoidance Tasks of Self-Driving Cars
Ran Emuna
A. Borowsky
Armin Biess
79
24
0
07 Jun 2020
Combining Reinforcement Learning and Constraint Programming for
  Combinatorial Optimization
Combining Reinforcement Learning and Constraint Programming for Combinatorial Optimization
Quentin Cappart
Thierry Moisan
Louis-Martin Rousseau
Isabeau Prémont-Schwarz
A. Ciré
87
145
0
02 Jun 2020
Acme: A Research Framework for Distributed Reinforcement Learning
Acme: A Research Framework for Distributed Reinforcement Learning
Matthew W. Hoffman
Bobak Shahriari
John Aslanides
Gabriel Barth-Maron
Nikola Momchev
...
Srivatsan Srinivasan
A. Cowie
Ziyun Wang
Bilal Piot
Nando de Freitas
143
226
0
01 Jun 2020
Manipulating the Distributions of Experience used for Self-Play Learning
  in Expert Iteration
Manipulating the Distributions of Experience used for Self-Play Learning in Expert Iteration
Dennis J. N. J. Soemers
Éric Piette
Matthew Stephenson
C. Browne
OffRL
72
6
0
30 May 2020
Reinforcement Learning with Iterative Reasoning for Merging in Dense
  Traffic
Reinforcement Learning with Iterative Reasoning for Merging in Dense Traffic
Maxime Bouton
A. Nakhaei
David Isele
K. Fujimura
Mykel J. Kochenderfer
120
34
0
25 May 2020
Learning visual servo policies via planner cloning
Learning visual servo policies via planner cloning
Ulrich Viereck
Kate Saenko
Robert Platt
OffRL
35
2
0
24 May 2020
Reinforcement Learning with General Value Function Approximation:
  Provably Efficient Approach via Bounded Eluder Dimension
Reinforcement Learning with General Value Function Approximation: Provably Efficient Approach via Bounded Eluder Dimension
Ruosong Wang
Ruslan Salakhutdinov
Lin F. Yang
109
55
0
21 May 2020
Reinforcement Learning for Variable Selection in a Branch and Bound
  Algorithm
Reinforcement Learning for Variable Selection in a Branch and Bound Algorithm
Marc Etheve
Zacharie Alès
Côme Bissuel
Olivier Juan
S. Kedad-Sidhoum
71
40
0
20 May 2020
Learning to Herd Agents Amongst Obstacles: Training Robust Shepherding
  Behaviors using Deep Reinforcement Learning
Learning to Herd Agents Amongst Obstacles: Training Robust Shepherding Behaviors using Deep Reinforcement Learning
Jixuan Zhi
Jyh-Ming Lien
44
29
0
19 May 2020
Experience Augmentation: Boosting and Accelerating Off-Policy
  Multi-Agent Reinforcement Learning
Experience Augmentation: Boosting and Accelerating Off-Policy Multi-Agent Reinforcement Learning
Zhenhui Ye
Yining Chen
Guang-hua Song
Bowei Yang
Sheng Fan
OffRL
77
7
0
19 May 2020
Basal Glucose Control in Type 1 Diabetes using Deep Reinforcement
  Learning: An In Silico Validation
Basal Glucose Control in Type 1 Diabetes using Deep Reinforcement Learning: An In Silico Validation
Taiyu Zhu
Kezhi Li
P. Herrero
Pantelis Georgiou
64
84
0
18 May 2020
Probabilistic Guarantees for Safe Deep Reinforcement Learning
Probabilistic Guarantees for Safe Deep Reinforcement Learning
E. Bacci
David Parker
91
27
0
14 May 2020
Unbiased Deep Reinforcement Learning: A General Training Framework for
  Existing and Future Algorithms
Unbiased Deep Reinforcement Learning: A General Training Framework for Existing and Future Algorithms
Huihui Zhang
Wu Huang
OODOffRL
25
1
0
12 May 2020
Mobile Robot Path Planning in Dynamic Environments through Globally
  Guided Reinforcement Learning
Mobile Robot Path Planning in Dynamic Environments through Globally Guided Reinforcement Learning
Binyu Wang
Yanfeng Guo
Qingbiao Li
Amanda Prorok
59
233
0
11 May 2020
Is Deep Reinforcement Learning Ready for Practical Applications in
  Healthcare? A Sensitivity Analysis of Duel-DDQN for Hemodynamic Management in
  Sepsis Patients
Is Deep Reinforcement Learning Ready for Practical Applications in Healthcare? A Sensitivity Analysis of Duel-DDQN for Hemodynamic Management in Sepsis Patients
Mingyu Lu
Zachary Shahn
Daby M. Sow
Finale Doshi-Velez
Li-wei H. Lehman
OODOffRL
67
3
0
08 May 2020
Discrete-to-Deep Supervised Policy Learning
Discrete-to-Deep Supervised Policy Learning
B. Kurniawan
Peter Vamplew
Michael Papasimeon
Richard Dazeley
Cameron Foale
OffRL
16
3
0
05 May 2020
ADVISER: A Toolkit for Developing Multi-modal, Multi-domain and
  Socially-engaged Conversational Agents
ADVISER: A Toolkit for Developing Multi-modal, Multi-domain and Socially-engaged Conversational Agents
Chia-Yu Li
Daniel Ortega
Dirk Vath
Florian Lux
Lindsey Vanderlyn
...
Moritz Volkel
Pavel Denisov
Sabrina Jenne
Zorica Kacarevic
Ngoc Thang Vu
52
8
0
04 May 2020
Deep Reinforcement Learning for Intelligent Transportation Systems: A
  Survey
Deep Reinforcement Learning for Intelligent Transportation Systems: A Survey
Ammar Haydari
Y. Yilmaz
AI4TS
114
469
0
02 May 2020
Visually Grounded Continual Learning of Compositional Phrases
Visually Grounded Continual Learning of Compositional Phrases
Xisen Jin
Junyi Du
Arka Sadhu
Ram Nevatia
Xiang Ren
CLL
61
4
0
02 May 2020
Unsupervised Learning of KB Queries in Task-Oriented Dialogs
Unsupervised Learning of KB Queries in Task-Oriented Dialogs
Dinesh Raghu
Nikhil Gupta
Mausam
OffRL
65
7
0
30 Apr 2020
Improving Target-driven Visual Navigation with Attention on 3D Spatial
  Relationships
Improving Target-driven Visual Navigation with Attention on 3D Spatial Relationships
Yunlian Lv
Ning Xie
Yimin Shi
Zijiao Wang
Jikang Cheng
34
0
0
29 Apr 2020
PBCS : Efficient Exploration and Exploitation Using a Synergy between
  Reinforcement Learning and Motion Planning
PBCS : Efficient Exploration and Exploitation Using a Synergy between Reinforcement Learning and Motion Planning
Guillaume Matheron
Nicolas Perrin
Olivier Sigaud
58
19
0
24 Apr 2020
Towards Runtime Verification of Programmable Switches
Towards Runtime Verification of Programmable Switches
Apoorv Shukla
K. Hudemann
Z. Vági
Lily Hügerich
Georgios Smaragdakis
Stefan Schmid
A. Hecker
A. Feldmann
27
3
0
22 Apr 2020
STDPG: A Spatio-Temporal Deterministic Policy Gradient Agent for Dynamic
  Routing in SDN
STDPG: A Spatio-Temporal Deterministic Policy Gradient Agent for Dynamic Routing in SDN
Juan Chen
Zhiwen Xiao
Huanlai Xing
Penglin Dai
Shouxi Luo
M. Iqbal
46
16
0
21 Apr 2020
Show Us the Way: Learning to Manage Dialog from Demonstrations
Show Us the Way: Learning to Manage Dialog from Demonstrations
Gabriel Gordon-Hall
P. Gorinski
Gerasimos Lampouras
Ignacio Iacobacci
OffRL
111
11
0
17 Apr 2020
Continual Reinforcement Learning with Multi-Timescale Replay
Continual Reinforcement Learning with Multi-Timescale Replay
Christos Kaplanis
Claudia Clopath
Murray Shanahan
CLL
51
15
0
16 Apr 2020
Extrapolation in Gridworld Markov-Decision Processes
Extrapolation in Gridworld Markov-Decision Processes
Eugene Charniak
29
0
0
14 Apr 2020
Stochastic batch size for adaptive regularization in deep network
  optimization
Stochastic batch size for adaptive regularization in deep network optimization
Kensuke Nakamura
Stefano Soatto
Byung-Woo Hong
ODL
51
6
0
14 Apr 2020
Self Punishment and Reward Backfill for Deep Q-Learning
Self Punishment and Reward Backfill for Deep Q-Learning
M. Bonyadi
Rui Wang
M. Ziaei
22
5
0
10 Apr 2020
Previous
123...181920...282930
Next