ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1511.05952
  4. Cited By
Prioritized Experience Replay

Prioritized Experience Replay

18 November 2015
Tom Schaul
John Quan
Ioannis Antonoglou
David Silver
    OffRL
ArXivPDFHTML

Papers citing "Prioritized Experience Replay"

50 / 1,441 papers shown
Title
Hindsight Expectation Maximization for Goal-conditioned Reinforcement
  Learning
Hindsight Expectation Maximization for Goal-conditioned Reinforcement Learning
Yunhao Tang
A. Kucukelbir
OffRL
27
16
0
13 Jun 2020
Self-Imitation Learning via Generalized Lower Bound Q-learning
Self-Imitation Learning via Generalized Lower Bound Q-learning
Yunhao Tang
SSL
33
24
0
12 Jun 2020
Continuous Control for Searching and Planning with a Learned Model
Continuous Control for Searching and Planning with a Learned Model
Xuxi Yang
Werner Duvaud
Peng Wei
33
5
0
12 Jun 2020
Bayesian Experience Reuse for Learning from Multiple Demonstrators
Bayesian Experience Reuse for Learning from Multiple Demonstrators
Michael Gimelfarb
Scott Sanner
Chi-Guhn Lee
11
0
0
10 Jun 2020
Maximum Entropy Model Rollouts: Fast Model Based Policy Optimization
  without Compounding Errors
Maximum Entropy Model Rollouts: Fast Model Based Policy Optimization without Compounding Errors
Chi Zhang
S. Kuppannagari
Viktor Prasanna
22
4
0
08 Jun 2020
Balancing a CartPole System with Reinforcement Learning -- A Tutorial
Balancing a CartPole System with Reinforcement Learning -- A Tutorial
S. Kumar
19
23
0
08 Jun 2020
A Comparison of Self-Play Algorithms Under a Generalized Framework
A Comparison of Self-Play Algorithms Under a Generalized Framework
Daniel Hernández
Kevin Denamganai
Sam Devlin
Spyridon Samothrakis
James Alfred Walker
19
12
0
08 Jun 2020
Deep Reinforcement Learning for Human-Like Driving Policies in Collision
  Avoidance Tasks of Self-Driving Cars
Deep Reinforcement Learning for Human-Like Driving Policies in Collision Avoidance Tasks of Self-Driving Cars
Ran Emuna
A. Borowsky
Armin Biess
42
22
0
07 Jun 2020
Combining Reinforcement Learning and Constraint Programming for
  Combinatorial Optimization
Combining Reinforcement Learning and Constraint Programming for Combinatorial Optimization
Quentin Cappart
Thierry Moisan
Louis-Martin Rousseau
Isabeau Prémont-Schwarz
A. Ciré
26
138
0
02 Jun 2020
Acme: A Research Framework for Distributed Reinforcement Learning
Acme: A Research Framework for Distributed Reinforcement Learning
Matthew W. Hoffman
Bobak Shahriari
John Aslanides
Gabriel Barth-Maron
Nikola Momchev
...
Srivatsan Srinivasan
A. Cowie
Ziyun Wang
Bilal Piot
Nando de Freitas
65
225
0
01 Jun 2020
Manipulating the Distributions of Experience used for Self-Play Learning
  in Expert Iteration
Manipulating the Distributions of Experience used for Self-Play Learning in Expert Iteration
Dennis J. N. J. Soemers
Éric Piette
Matthew Stephenson
C. Browne
OffRL
14
6
0
30 May 2020
Reinforcement Learning
Reinforcement Learning
Olivier Buffer
Olivier Pietquin
Paul Weng
OffRL
9
1
0
29 May 2020
Reinforcement Learning with Iterative Reasoning for Merging in Dense
  Traffic
Reinforcement Learning with Iterative Reasoning for Merging in Dense Traffic
Maxime Bouton
A. Nakhaei
David Isele
K. Fujimura
Mykel J. Kochenderfer
17
34
0
25 May 2020
Learning visual servo policies via planner cloning
Learning visual servo policies via planner cloning
Ulrich Viereck
Kate Saenko
Robert Platt
OffRL
11
2
0
24 May 2020
Reinforcement Learning with General Value Function Approximation:
  Provably Efficient Approach via Bounded Eluder Dimension
Reinforcement Learning with General Value Function Approximation: Provably Efficient Approach via Bounded Eluder Dimension
Ruosong Wang
Ruslan Salakhutdinov
Lin F. Yang
25
55
0
21 May 2020
Reinforcement Learning for Variable Selection in a Branch and Bound
  Algorithm
Reinforcement Learning for Variable Selection in a Branch and Bound Algorithm
Marc Etheve
Zacharie Alès
Côme Bissuel
Olivier Juan
S. Kedad-Sidhoum
16
38
0
20 May 2020
Learning to Herd Agents Amongst Obstacles: Training Robust Shepherding
  Behaviors using Deep Reinforcement Learning
Learning to Herd Agents Amongst Obstacles: Training Robust Shepherding Behaviors using Deep Reinforcement Learning
Jixuan Zhi
Jyh-Ming Lien
9
29
0
19 May 2020
Experience Augmentation: Boosting and Accelerating Off-Policy
  Multi-Agent Reinforcement Learning
Experience Augmentation: Boosting and Accelerating Off-Policy Multi-Agent Reinforcement Learning
Zhenhui Ye
Yining Chen
Guang-hua Song
Bowei Yang
Sheng Fan
OffRL
28
7
0
19 May 2020
Basal Glucose Control in Type 1 Diabetes using Deep Reinforcement
  Learning: An In Silico Validation
Basal Glucose Control in Type 1 Diabetes using Deep Reinforcement Learning: An In Silico Validation
Taiyu Zhu
Kezhi Li
P. Herrero
Pantelis Georgiou
24
80
0
18 May 2020
Probabilistic Guarantees for Safe Deep Reinforcement Learning
Probabilistic Guarantees for Safe Deep Reinforcement Learning
E. Bacci
David Parker
19
27
0
14 May 2020
Unbiased Deep Reinforcement Learning: A General Training Framework for
  Existing and Future Algorithms
Unbiased Deep Reinforcement Learning: A General Training Framework for Existing and Future Algorithms
Huihui Zhang
Wu Huang
OOD
OffRL
11
1
0
12 May 2020
Mobile Robot Path Planning in Dynamic Environments through Globally
  Guided Reinforcement Learning
Mobile Robot Path Planning in Dynamic Environments through Globally Guided Reinforcement Learning
Binyu Wang
Zhe Liu
Qingbiao Li
Amanda Prorok
35
224
0
11 May 2020
Is Deep Reinforcement Learning Ready for Practical Applications in
  Healthcare? A Sensitivity Analysis of Duel-DDQN for Hemodynamic Management in
  Sepsis Patients
Is Deep Reinforcement Learning Ready for Practical Applications in Healthcare? A Sensitivity Analysis of Duel-DDQN for Hemodynamic Management in Sepsis Patients
Mingyu Lu
Zachary Shahn
Daby M. Sow
Finale Doshi-Velez
Li-wei H. Lehman
OOD
OffRL
14
3
0
08 May 2020
Discrete-to-Deep Supervised Policy Learning
Discrete-to-Deep Supervised Policy Learning
B. Kurniawan
Peter Vamplew
Michael Papasimeon
Richard Dazeley
Cameron Foale
OffRL
16
3
0
05 May 2020
ADVISER: A Toolkit for Developing Multi-modal, Multi-domain and
  Socially-engaged Conversational Agents
ADVISER: A Toolkit for Developing Multi-modal, Multi-domain and Socially-engaged Conversational Agents
Chia-Yu Li
Daniel Ortega
Dirk Vath
Florian Lux
Lindsey Vanderlyn
...
Moritz Volkel
Pavel Denisov
Sabrina Jenne
Zorica Kacarevic
Ngoc Thang Vu
24
8
0
04 May 2020
Deep Reinforcement Learning for Intelligent Transportation Systems: A
  Survey
Deep Reinforcement Learning for Intelligent Transportation Systems: A Survey
Ammar Haydari
Y. Yilmaz
AI4TS
33
455
0
02 May 2020
Visually Grounded Continual Learning of Compositional Phrases
Visually Grounded Continual Learning of Compositional Phrases
Xisen Jin
Junyi Du
Arka Sadhu
Ram Nevatia
Xiang Ren
CLL
22
4
0
02 May 2020
Unsupervised Learning of KB Queries in Task-Oriented Dialogs
Unsupervised Learning of KB Queries in Task-Oriented Dialogs
Dinesh Raghu
Nikhil Gupta
Mausam
OffRL
30
7
0
30 Apr 2020
Improving Target-driven Visual Navigation with Attention on 3D Spatial
  Relationships
Improving Target-driven Visual Navigation with Attention on 3D Spatial Relationships
Yunlian Lv
Ning Xie
Yimin Shi
Zijiao Wang
H. Shen
27
0
0
29 Apr 2020
PBCS : Efficient Exploration and Exploitation Using a Synergy between
  Reinforcement Learning and Motion Planning
PBCS : Efficient Exploration and Exploitation Using a Synergy between Reinforcement Learning and Motion Planning
Guillaume Matheron
Nicolas Perrin
Olivier Sigaud
23
18
0
24 Apr 2020
Towards Runtime Verification of Programmable Switches
Towards Runtime Verification of Programmable Switches
Apoorv Shukla
K. Hudemann
Z. Vági
Lily Hügerich
Georgios Smaragdakis
Stefan Schmid
A. Hecker
A. Feldmann
11
3
0
22 Apr 2020
STDPG: A Spatio-Temporal Deterministic Policy Gradient Agent for Dynamic
  Routing in SDN
STDPG: A Spatio-Temporal Deterministic Policy Gradient Agent for Dynamic Routing in SDN
Juan Chen
Zhiwen Xiao
Huanlai Xing
Penglin Dai
Shouxi Luo
M. Iqbal
17
16
0
21 Apr 2020
Show Us the Way: Learning to Manage Dialog from Demonstrations
Show Us the Way: Learning to Manage Dialog from Demonstrations
Gabriel Gordon-Hall
P. Gorinski
Gerasimos Lampouras
Ignacio Iacobacci
OffRL
25
11
0
17 Apr 2020
Continual Reinforcement Learning with Multi-Timescale Replay
Continual Reinforcement Learning with Multi-Timescale Replay
Christos Kaplanis
Claudia Clopath
Murray Shanahan
CLL
22
14
0
16 Apr 2020
Extrapolation in Gridworld Markov-Decision Processes
Extrapolation in Gridworld Markov-Decision Processes
Eugene Charniak
14
0
0
14 Apr 2020
Stochastic batch size for adaptive regularization in deep network
  optimization
Stochastic batch size for adaptive regularization in deep network optimization
Kensuke Nakamura
Stefano Soatto
Byung-Woo Hong
ODL
27
6
0
14 Apr 2020
Self Punishment and Reward Backfill for Deep Q-Learning
Self Punishment and Reward Backfill for Deep Q-Learning
M. Bonyadi
Rui Wang
M. Ziaei
22
4
0
10 Apr 2020
Risk-Aware High-level Decisions for Automated Driving at Occluded
  Intersections with Reinforcement Learning
Risk-Aware High-level Decisions for Automated Driving at Occluded Intersections with Reinforcement Learning
Danial Kamran
Carlos Fernandez Lopez
Martin Lauer
Christoph Stiller
16
64
0
09 Apr 2020
CURL: Contrastive Unsupervised Representations for Reinforcement
  Learning
CURL: Contrastive Unsupervised Representations for Reinforcement Learning
A. Srinivas
Michael Laskin
Pieter Abbeel
SSL
DRL
OffRL
49
1,063
0
08 Apr 2020
An Application of Deep Reinforcement Learning to Algorithmic Trading
An Application of Deep Reinforcement Learning to Algorithmic Trading
Thibaut Théate
D. Ernst
AIFin
19
162
0
07 Apr 2020
Ultrasound-Guided Robotic Navigation with Deep Reinforcement Learning
Ultrasound-Guided Robotic Navigation with Deep Reinforcement Learning
Hannes Hase
Mohammad Farid Azampour
M. Tirindelli
Magdalini Paschali
Walter Simson
E. Fatemizadeh
Nassir Navab
31
38
0
30 Mar 2020
Learning medical triage from clinicians using Deep Q-Learning
Learning medical triage from clinicians using Deep Q-Learning
A. Buchard
Baptiste Bouvier
G. Prando
R. Beard
Michail Livieratos
...
Yuanzhao Zhang
Adam Baker
Yura N. Perov
Kostis Gourgoulias
Saurabh Johri
OffRL
24
3
0
28 Mar 2020
Modeling 3D Shapes by Reinforcement Learning
Modeling 3D Shapes by Reinforcement Learning
Cheng Lin
Tingxiang Fan
Wenping Wang
Matthias Nießner
OffRL
3DV
25
36
0
27 Mar 2020
Towards Safer Self-Driving Through Great PAIN (Physically Adversarial
  Intelligent Networks)
Towards Safer Self-Driving Through Great PAIN (Physically Adversarial Intelligent Networks)
Piyush B. Gupta
Demetris Coleman
J. Siegel
AAML
29
16
0
24 Mar 2020
Multi-Agent Reinforcement Learning for Problems with Combined Individual
  and Team Reward
Multi-Agent Reinforcement Learning for Problems with Combined Individual and Team Reward
Hassam Sheikh
Ladislau Bölöni
39
36
0
24 Mar 2020
Deep Reinforcement Learning with Weighted Q-Learning
Deep Reinforcement Learning with Weighted Q-Learning
Andrea Cini
Carlo DÉramo
Jan Peters
Cesare Alippi
OffRL
34
9
0
20 Mar 2020
Robust Deep Reinforcement Learning against Adversarial Perturbations on
  State Observations
Robust Deep Reinforcement Learning against Adversarial Perturbations on State Observations
Huan Zhang
Hongge Chen
Chaowei Xiao
Yue Liu
Mingyan D. Liu
Duane S. Boning
Cho-Jui Hsieh
AAML
49
261
0
19 Mar 2020
Generating Socially Acceptable Perturbations for Efficient Evaluation of
  Autonomous Vehicles
Generating Socially Acceptable Perturbations for Efficient Evaluation of Autonomous Vehicles
Songan Zhang
H. Peng
S. Nageshrao
E. Tseng
AAML
29
5
0
18 Mar 2020
Simultaneous Navigation and Radio Mapping for Cellular-Connected UAV
  with Deep Reinforcement Learning
Simultaneous Navigation and Radio Mapping for Cellular-Connected UAV with Deep Reinforcement Learning
Yong Zeng
Xiaoli Xu
Shi Jin
Rui Zhang
9
164
0
17 Mar 2020
DisCor: Corrective Feedback in Reinforcement Learning via Distribution
  Correction
DisCor: Corrective Feedback in Reinforcement Learning via Distribution Correction
Aviral Kumar
Abhishek Gupta
Sergey Levine
OffRL
16
100
0
16 Mar 2020
Previous
123...181920...272829
Next