ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1511.05952
  4. Cited By
Prioritized Experience Replay
v1v2v3v4 (latest)

Prioritized Experience Replay

18 November 2015
Tom Schaul
John Quan
Ioannis Antonoglou
David Silver
    OffRL
ArXiv (abs)PDFHTML

Papers citing "Prioritized Experience Replay"

50 / 1,454 papers shown
Title
Adaptive Behavior Generation for Autonomous Driving using Deep
  Reinforcement Learning with Compact Semantic States
Adaptive Behavior Generation for Autonomous Driving using Deep Reinforcement Learning with Compact Semantic States
Peter Wolf
Karl Kurzer
Tobias Wingert
Florian Kuhnt
Johann Marius Zöllner
60
56
0
10 Sep 2018
Neural Guided Constraint Logic Programming for Program Synthesis
Neural Guided Constraint Logic Programming for Program Synthesis
Lisa Zhang
Gregory Rosenblatt
Ethan Fetaya
Renjie Liao
William E. Byrd
M. Might
R. Urtasun
R. Zemel
NAI
147
30
0
08 Sep 2018
Challenges of Context and Time in Reinforcement Learning: Introducing
  Space Fortress as a Benchmark
Challenges of Context and Time in Reinforcement Learning: Introducing Space Fortress as a Benchmark
Akshat Agarwal
Ryan Hope
Katia Sycara
OffRL
34
9
0
06 Sep 2018
ARCHER: Aggressive Rewards to Counter bias in Hindsight Experience
  Replay
ARCHER: Aggressive Rewards to Counter bias in Hindsight Experience Replay
Sameera Lanka
Tianfu Wu
66
30
0
06 Sep 2018
Sample-Efficient Imitation Learning via Generative Adversarial Nets
Sample-Efficient Imitation Learning via Generative Adversarial Nets
Lionel Blondé
Alexandros Kalousis
GAN
67
47
0
06 Sep 2018
Goal-oriented Dialogue Policy Learning from Failures
Goal-oriented Dialogue Policy Learning from Failures
Keting Lu
Shiqi Zhang
Xiaoping Chen
OffRL
46
29
0
20 Aug 2018
Reinforcement Learning for Autonomous Defence in Software-Defined
  Networking
Reinforcement Learning for Autonomous Defence in Software-Defined Networking
Yi Han
Benjamin I. P. Rubinstein
Tamas Abraham
T. Alpcan
O. Vel
S. Erfani
David Hubczenko
C. Leckie
Paul Montague
AAML
55
69
0
17 Aug 2018
Small Sample Learning in Big Data Era
Small Sample Learning in Big Data Era
Jun Shu
Zongben Xu
Deyu Meng
108
72
0
14 Aug 2018
A Survey of Machine and Deep Learning Methods for Internet of Things
  (IoT) Security
A Survey of Machine and Deep Learning Methods for Internet of Things (IoT) Security
M. Al-garadi
Amr M. Mohamed
A. Al-Ali
Xiaojiang Du
Mohsen Guizani
98
834
0
29 Jul 2018
FuzzerGym: A Competitive Framework for Fuzzing and Learning
FuzzerGym: A Competitive Framework for Fuzzing and Learning
W. Drozd
Michael D. Wagner
66
33
0
19 Jul 2018
Discrete linear-complexity reinforcement learning in continuous action
  spaces for Q-learning algorithms
Discrete linear-complexity reinforcement learning in continuous action spaces for Q-learning algorithms
P. Tavallali
G. Doran
L. Mandrake
28
0
0
16 Jul 2018
Remember and Forget for Experience Replay
Remember and Forget for Experience Replay
G. Novati
Petros Koumoutsakos
OffRL
108
92
0
16 Jul 2018
Temporal Difference Learning with Neural Networks - Study of the Leakage
  Propagation Problem
Temporal Difference Learning with Neural Networks - Study of the Leakage Propagation Problem
Hugo Penedones
Damien Vincent
Hartmut Maennel
Sylvain Gelly
Timothy A. Mann
André Barreto
AAML
38
7
0
09 Jul 2018
Memory Augmented Policy Optimization for Program Synthesis and Semantic
  Parsing
Memory Augmented Policy Optimization for Program Synthesis and Semantic Parsing
Chen Liang
Mohammad Norouzi
Jonathan Berant
Quoc V. Le
Ni Lao
134
134
0
06 Jul 2018
Goal-oriented Trajectories for Efficient Exploration
Goal-oriented Trajectories for Efficient Exploration
Fabio Pardo
Vitaly Levdik
Petar Kormushev
33
2
0
05 Jul 2018
Policy Optimization With Penalized Point Probability Distance: An
  Alternative To Proximal Policy Optimization
Policy Optimization With Penalized Point Probability Distance: An Alternative To Proximal Policy Optimization
Xiangxiang Chu
105
9
0
02 Jul 2018
Learning to Drive in a Day
Learning to Drive in a Day
Alex Kendall
Jeffrey Hawke
David Janz
Przemyslaw Mazur
Daniele Reda
John M. Allen
Vinh-Dieu Lam
Alex Bewley
Amar Shah
115
659
0
01 Jul 2018
Deictic Image Maps: An Abstraction For Learning Pose Invariant
  Manipulation Policies
Deictic Image Maps: An Abstraction For Learning Pose Invariant Manipulation Policies
Robert Platt
Colin Kohler
Marcus Gualtieri
93
11
0
26 Jun 2018
Q-DeckRec: A Fast Deck Recommendation System for Collectible Card Games
Q-DeckRec: A Fast Deck Recommendation System for Collectible Card Games
Zhengxing Chen
Chris Amato
Truong-Huy D. Nguyen
Seth Cooper
Yizhou Sun
M. S. El-Nasr
BDL
61
40
0
26 Jun 2018
Learning-to-Ask: Knowledge Acquisition via 20 Questions
Learning-to-Ask: Knowledge Acquisition via 20 Questions
Yihong Chen
B. Chen
Xuguang Duan
Jian-Guang Lou
Yue Wang
Wenwu Zhu
Yong Cao
54
15
0
22 Jun 2018
RUDDER: Return Decomposition for Delayed Rewards
RUDDER: Return Decomposition for Delayed Rewards
Jose A. Arjona-Medina
Michael Gillhofer
Michael Widrich
Thomas Unterthiner
Johannes Brandstetter
Sepp Hochreiter
130
222
0
20 Jun 2018
Sim-to-Real Reinforcement Learning for Deformable Object Manipulation
Sim-to-Real Reinforcement Learning for Deformable Object Manipulation
J. Matas
Stephen James
Andrew J. Davison
AI4CE
77
361
0
20 Jun 2018
Evolving simple programs for playing Atari games
Evolving simple programs for playing Atari games
Dennis G. Wilson
Sylvain Cussat-Blanc
H. Luga
J. Miller
74
62
0
14 Jun 2018
Self-Imitation Learning
Self-Imitation Learning
Junhyuk Oh
Yijie Guo
Satinder Singh
Honglak Lee
SSL
88
251
0
14 Jun 2018
Implicit Quantile Networks for Distributional Reinforcement Learning
Implicit Quantile Networks for Distributional Reinforcement Learning
Will Dabney
Georg Ostrovski
David Silver
Rémi Munos
OffRL
174
535
0
14 Jun 2018
Qualitative Measurements of Policy Discrepancy for Return-Based Deep
  Q-Network
Qualitative Measurements of Policy Discrepancy for Return-Based Deep Q-Network
Wenjia Meng
Qian Zheng
L. Yang
Pengfei Li
Gang Pan
45
21
0
14 Jun 2018
Organizing Experience: A Deeper Look at Replay Mechanisms for
  Sample-based Planning in Continuous State Domains
Organizing Experience: A Deeper Look at Replay Mechanisms for Sample-based Planning in Continuous State Domains
Yangchen Pan
M. Zaheer
Adam White
Andrew Patterson
Martha White
80
46
0
12 Jun 2018
Multi-Agent Deep Reinforcement Learning with Human Strategies
Multi-Agent Deep Reinforcement Learning with Human Strategies
Thanh Nguyen
Ngoc Duy Nguyen
S. Nahavandi
82
12
0
12 Jun 2018
Deep Curiosity Loops in Social Environments
Deep Curiosity Loops in Social Environments
Jonatan Barkan
Goren Gordon
33
2
0
10 Jun 2018
Learning to Search in Long Documents Using Document Structure
Learning to Search in Long Documents Using Document Structure
Mor Geva
Jonathan Berant
RALM
80
15
0
09 Jun 2018
Randomized Prior Functions for Deep Reinforcement Learning
Randomized Prior Functions for Deep Reinforcement Learning
Ian Osband
John Aslanides
Albin Cassirer
UQCVBDL
97
380
0
08 Jun 2018
Program Synthesis Through Reinforcement Learning Guided Tree Search
Program Synthesis Through Reinforcement Learning Guided Tree Search
Riley Simmons-Edler
Anders Miltner
Sebastian Seung
131
11
0
08 Jun 2018
Between Progress and Potential Impact of AI: the Neglected Dimensions
Between Progress and Potential Impact of AI: the Neglected Dimensions
Fernando Martínez-Plumed
S. Avin
Miles Brundage
Allan Dafoe
Seán Ó hÉigeartaigh
José Hernández-Orallo
64
3
0
02 Jun 2018
Deep Curiosity Search: Intra-Life Exploration Can Improve Performance on
  Challenging Deep Reinforcement Learning Problems
Deep Curiosity Search: Intra-Life Exploration Can Improve Performance on Challenging Deep Reinforcement Learning Problems
C. Stanton
Jeff Clune
LRM
62
41
0
01 Jun 2018
Sequential Attacks on Agents for Long-Term Adversarial Goals
Sequential Attacks on Agents for Long-Term Adversarial Goals
E. Tretschk
Seong Joon Oh
Mario Fritz
OnRL
409
48
1
31 May 2018
Transfer Learning for Related Reinforcement Learning Tasks via
  Image-to-Image Translation
Transfer Learning for Related Reinforcement Learning Tasks via Image-to-Image Translation
Shani Gamrian
Yoav Goldberg
124
108
0
31 May 2018
Sample-Efficient Deep Reinforcement Learning via Episodic Backward
  Update
Sample-Efficient Deep Reinforcement Learning via Episodic Backward Update
Su Young Lee
Sung-Ik Choi
Sae-Young Chung
BDL
79
75
0
31 May 2018
Observe and Look Further: Achieving Consistent Performance on Atari
Observe and Look Further: Achieving Consistent Performance on Atari
Tobias Pohlen
Bilal Piot
Todd Hester
M. G. Azar
Dan Horgan
...
John Quan
Mel Vecerík
Matteo Hessel
Rémi Munos
Olivier Pietquin
68
121
0
29 May 2018
Bayesian Inference with Anchored Ensembles of Neural Networks, and
  Application to Exploration in Reinforcement Learning
Bayesian Inference with Anchored Ensembles of Neural Networks, and Application to Exploration in Reinforcement Learning
Tim Pearce
Nicolas Anastassacos
Mohamed H. Zaki
A. Neely
BDLUQCV
83
17
0
29 May 2018
Meta-Gradient Reinforcement Learning
Meta-Gradient Reinforcement Learning
Zhongwen Xu
H. V. Hasselt
David Silver
117
327
0
24 May 2018
Deep Reinforcement Learning For Sequence to Sequence Models
Deep Reinforcement Learning For Sequence to Sequence Models
Yaser Keneshloo
Tian Shi
Naren Ramakrishnan
Chandan K. Reddy
AIMat3DVOffRL
92
211
0
24 May 2018
Meta-Learning with Hessian-Free Approach in Deep Neural Nets Training
Meta-Learning with Hessian-Free Approach in Deep Neural Nets Training
Boyu Chen
Wenlian Lu
Ernest Fokoue
52
1
0
22 May 2018
Constrained Policy Improvement for Safe and Efficient Reinforcement
  Learning
Constrained Policy Improvement for Safe and Efficient Reinforcement Learning
Elad Sarafian
Aviv Tamar
Sarit Kraus
OffRL
60
11
0
20 May 2018
A Lyapunov-based Approach to Safe Reinforcement Learning
A Lyapunov-based Approach to Safe Reinforcement Learning
Yinlam Chow
Ofir Nachum
Edgar A. Duénez-Guzmán
Mohammad Ghavamzadeh
165
510
0
20 May 2018
Episodic Memory Deep Q-Networks
Episodic Memory Deep Q-Networks
Zichuan Lin
Tianqi Zhao
Guangwen Yang
Lintao Zhang
OffRL
61
87
0
19 May 2018
Learning Permutations with Sinkhorn Policy Gradient
Learning Permutations with Sinkhorn Policy Gradient
Patrick Emami
Sanjay Ranka
65
55
0
18 May 2018
Language Expansion In Text-Based Games
Language Expansion In Text-Based Games
Ghulam Ahmed Ansari
P. SagarJ.
A. Chandar
Balaraman Ravindran
LLMAG
37
8
0
17 May 2018
Learning Temporal Strategic Relationships using Generative Adversarial
  Imitation Learning
Learning Temporal Strategic Relationships using Generative Adversarial Imitation Learning
Tharindu Fernando
Simon Denman
Sridha Sridharan
Clinton Fookes
GAN
80
26
0
13 May 2018
Deep Hierarchical Reinforcement Learning Algorithm in Partially
  Observable Markov Decision Processes
Deep Hierarchical Reinforcement Learning Algorithm in Partially Observable Markov Decision Processes
T. P. Le
Ngo Anh Vien
Abu Layek
TaeChoong Chung
53
52
0
11 May 2018
Deep Reinforcement Learning for Optimal Control of Space Heating
Deep Reinforcement Learning for Optimal Control of Space Heating
Ádám Nagy
H. Kazmi
Farah Cheaib
Johan Driesen
AI4CE
52
45
0
10 May 2018
Previous
123...252627282930
Next