Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
1511.05952
Cited By
v1
v2
v3
v4 (latest)
Prioritized Experience Replay
18 November 2015
Tom Schaul
John Quan
Ioannis Antonoglou
David Silver
OffRL
Re-assign community
ArXiv (abs)
PDF
HTML
Papers citing
"Prioritized Experience Replay"
50 / 1,454 papers shown
Title
Adaptive Behavior Generation for Autonomous Driving using Deep Reinforcement Learning with Compact Semantic States
Peter Wolf
Karl Kurzer
Tobias Wingert
Florian Kuhnt
Johann Marius Zöllner
60
56
0
10 Sep 2018
Neural Guided Constraint Logic Programming for Program Synthesis
Lisa Zhang
Gregory Rosenblatt
Ethan Fetaya
Renjie Liao
William E. Byrd
M. Might
R. Urtasun
R. Zemel
NAI
147
30
0
08 Sep 2018
Challenges of Context and Time in Reinforcement Learning: Introducing Space Fortress as a Benchmark
Akshat Agarwal
Ryan Hope
Katia Sycara
OffRL
34
9
0
06 Sep 2018
ARCHER: Aggressive Rewards to Counter bias in Hindsight Experience Replay
Sameera Lanka
Tianfu Wu
66
30
0
06 Sep 2018
Sample-Efficient Imitation Learning via Generative Adversarial Nets
Lionel Blondé
Alexandros Kalousis
GAN
67
47
0
06 Sep 2018
Goal-oriented Dialogue Policy Learning from Failures
Keting Lu
Shiqi Zhang
Xiaoping Chen
OffRL
46
29
0
20 Aug 2018
Reinforcement Learning for Autonomous Defence in Software-Defined Networking
Yi Han
Benjamin I. P. Rubinstein
Tamas Abraham
T. Alpcan
O. Vel
S. Erfani
David Hubczenko
C. Leckie
Paul Montague
AAML
55
69
0
17 Aug 2018
Small Sample Learning in Big Data Era
Jun Shu
Zongben Xu
Deyu Meng
108
72
0
14 Aug 2018
A Survey of Machine and Deep Learning Methods for Internet of Things (IoT) Security
M. Al-garadi
Amr M. Mohamed
A. Al-Ali
Xiaojiang Du
Mohsen Guizani
98
834
0
29 Jul 2018
FuzzerGym: A Competitive Framework for Fuzzing and Learning
W. Drozd
Michael D. Wagner
66
33
0
19 Jul 2018
Discrete linear-complexity reinforcement learning in continuous action spaces for Q-learning algorithms
P. Tavallali
G. Doran
L. Mandrake
28
0
0
16 Jul 2018
Remember and Forget for Experience Replay
G. Novati
Petros Koumoutsakos
OffRL
108
92
0
16 Jul 2018
Temporal Difference Learning with Neural Networks - Study of the Leakage Propagation Problem
Hugo Penedones
Damien Vincent
Hartmut Maennel
Sylvain Gelly
Timothy A. Mann
André Barreto
AAML
38
7
0
09 Jul 2018
Memory Augmented Policy Optimization for Program Synthesis and Semantic Parsing
Chen Liang
Mohammad Norouzi
Jonathan Berant
Quoc V. Le
Ni Lao
134
134
0
06 Jul 2018
Goal-oriented Trajectories for Efficient Exploration
Fabio Pardo
Vitaly Levdik
Petar Kormushev
33
2
0
05 Jul 2018
Policy Optimization With Penalized Point Probability Distance: An Alternative To Proximal Policy Optimization
Xiangxiang Chu
105
9
0
02 Jul 2018
Learning to Drive in a Day
Alex Kendall
Jeffrey Hawke
David Janz
Przemyslaw Mazur
Daniele Reda
John M. Allen
Vinh-Dieu Lam
Alex Bewley
Amar Shah
115
659
0
01 Jul 2018
Deictic Image Maps: An Abstraction For Learning Pose Invariant Manipulation Policies
Robert Platt
Colin Kohler
Marcus Gualtieri
93
11
0
26 Jun 2018
Q-DeckRec: A Fast Deck Recommendation System for Collectible Card Games
Zhengxing Chen
Chris Amato
Truong-Huy D. Nguyen
Seth Cooper
Yizhou Sun
M. S. El-Nasr
BDL
61
40
0
26 Jun 2018
Learning-to-Ask: Knowledge Acquisition via 20 Questions
Yihong Chen
B. Chen
Xuguang Duan
Jian-Guang Lou
Yue Wang
Wenwu Zhu
Yong Cao
54
15
0
22 Jun 2018
RUDDER: Return Decomposition for Delayed Rewards
Jose A. Arjona-Medina
Michael Gillhofer
Michael Widrich
Thomas Unterthiner
Johannes Brandstetter
Sepp Hochreiter
130
222
0
20 Jun 2018
Sim-to-Real Reinforcement Learning for Deformable Object Manipulation
J. Matas
Stephen James
Andrew J. Davison
AI4CE
77
361
0
20 Jun 2018
Evolving simple programs for playing Atari games
Dennis G. Wilson
Sylvain Cussat-Blanc
H. Luga
J. Miller
74
62
0
14 Jun 2018
Self-Imitation Learning
Junhyuk Oh
Yijie Guo
Satinder Singh
Honglak Lee
SSL
88
251
0
14 Jun 2018
Implicit Quantile Networks for Distributional Reinforcement Learning
Will Dabney
Georg Ostrovski
David Silver
Rémi Munos
OffRL
174
535
0
14 Jun 2018
Qualitative Measurements of Policy Discrepancy for Return-Based Deep Q-Network
Wenjia Meng
Qian Zheng
L. Yang
Pengfei Li
Gang Pan
45
21
0
14 Jun 2018
Organizing Experience: A Deeper Look at Replay Mechanisms for Sample-based Planning in Continuous State Domains
Yangchen Pan
M. Zaheer
Adam White
Andrew Patterson
Martha White
80
46
0
12 Jun 2018
Multi-Agent Deep Reinforcement Learning with Human Strategies
Thanh Nguyen
Ngoc Duy Nguyen
S. Nahavandi
82
12
0
12 Jun 2018
Deep Curiosity Loops in Social Environments
Jonatan Barkan
Goren Gordon
33
2
0
10 Jun 2018
Learning to Search in Long Documents Using Document Structure
Mor Geva
Jonathan Berant
RALM
80
15
0
09 Jun 2018
Randomized Prior Functions for Deep Reinforcement Learning
Ian Osband
John Aslanides
Albin Cassirer
UQCV
BDL
97
380
0
08 Jun 2018
Program Synthesis Through Reinforcement Learning Guided Tree Search
Riley Simmons-Edler
Anders Miltner
Sebastian Seung
131
11
0
08 Jun 2018
Between Progress and Potential Impact of AI: the Neglected Dimensions
Fernando Martínez-Plumed
S. Avin
Miles Brundage
Allan Dafoe
Seán Ó hÉigeartaigh
José Hernández-Orallo
64
3
0
02 Jun 2018
Deep Curiosity Search: Intra-Life Exploration Can Improve Performance on Challenging Deep Reinforcement Learning Problems
C. Stanton
Jeff Clune
LRM
62
41
0
01 Jun 2018
Sequential Attacks on Agents for Long-Term Adversarial Goals
E. Tretschk
Seong Joon Oh
Mario Fritz
OnRL
409
48
1
31 May 2018
Transfer Learning for Related Reinforcement Learning Tasks via Image-to-Image Translation
Shani Gamrian
Yoav Goldberg
124
108
0
31 May 2018
Sample-Efficient Deep Reinforcement Learning via Episodic Backward Update
Su Young Lee
Sung-Ik Choi
Sae-Young Chung
BDL
79
75
0
31 May 2018
Observe and Look Further: Achieving Consistent Performance on Atari
Tobias Pohlen
Bilal Piot
Todd Hester
M. G. Azar
Dan Horgan
...
John Quan
Mel Vecerík
Matteo Hessel
Rémi Munos
Olivier Pietquin
68
121
0
29 May 2018
Bayesian Inference with Anchored Ensembles of Neural Networks, and Application to Exploration in Reinforcement Learning
Tim Pearce
Nicolas Anastassacos
Mohamed H. Zaki
A. Neely
BDL
UQCV
83
17
0
29 May 2018
Meta-Gradient Reinforcement Learning
Zhongwen Xu
H. V. Hasselt
David Silver
117
327
0
24 May 2018
Deep Reinforcement Learning For Sequence to Sequence Models
Yaser Keneshloo
Tian Shi
Naren Ramakrishnan
Chandan K. Reddy
AIMat
3DV
OffRL
92
211
0
24 May 2018
Meta-Learning with Hessian-Free Approach in Deep Neural Nets Training
Boyu Chen
Wenlian Lu
Ernest Fokoue
52
1
0
22 May 2018
Constrained Policy Improvement for Safe and Efficient Reinforcement Learning
Elad Sarafian
Aviv Tamar
Sarit Kraus
OffRL
60
11
0
20 May 2018
A Lyapunov-based Approach to Safe Reinforcement Learning
Yinlam Chow
Ofir Nachum
Edgar A. Duénez-Guzmán
Mohammad Ghavamzadeh
165
510
0
20 May 2018
Episodic Memory Deep Q-Networks
Zichuan Lin
Tianqi Zhao
Guangwen Yang
Lintao Zhang
OffRL
61
87
0
19 May 2018
Learning Permutations with Sinkhorn Policy Gradient
Patrick Emami
Sanjay Ranka
65
55
0
18 May 2018
Language Expansion In Text-Based Games
Ghulam Ahmed Ansari
P. SagarJ.
A. Chandar
Balaraman Ravindran
LLMAG
37
8
0
17 May 2018
Learning Temporal Strategic Relationships using Generative Adversarial Imitation Learning
Tharindu Fernando
Simon Denman
Sridha Sridharan
Clinton Fookes
GAN
80
26
0
13 May 2018
Deep Hierarchical Reinforcement Learning Algorithm in Partially Observable Markov Decision Processes
T. P. Le
Ngo Anh Vien
Abu Layek
TaeChoong Chung
53
52
0
11 May 2018
Deep Reinforcement Learning for Optimal Control of Space Heating
Ádám Nagy
H. Kazmi
Farah Cheaib
Johan Driesen
AI4CE
52
45
0
10 May 2018
Previous
1
2
3
...
25
26
27
28
29
30
Next