Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
1511.05952
Cited By
v1
v2
v3
v4 (latest)
Prioritized Experience Replay
18 November 2015
Tom Schaul
John Quan
Ioannis Antonoglou
David Silver
OffRL
Re-assign community
ArXiv (abs)
PDF
HTML
Papers citing
"Prioritized Experience Replay"
50 / 1,454 papers shown
Title
Deep Exploration via Randomized Value Functions
Ian Osband
Benjamin Van Roy
Daniel Russo
Zheng Wen
129
307
0
22 Mar 2017
Sensor Fusion for Robot Control through Deep Reinforcement Learning
Steven Bohez
Tim Verbelen
E. D. Coninck
B. Vankeirsbilck
Pieter Simoens
Bart Dhoedt
SSL
65
29
0
13 Mar 2017
Reinforcement Learning for Transition-Based Mention Detection
Georgiana Dinu
Wael Hamza
Radu Florian
26
1
0
13 Mar 2017
Neural Episodic Control
Alexander Pritzel
Benigno Uria
Sriram Srinivasan
A. Badia
Oriol Vinyals
Demis Hassabis
Daan Wierstra
Charles Blundell
OffRL
BDL
115
346
0
06 Mar 2017
End-to-End Task-Completion Neural Dialogue Systems
Xiujun Li
Yun-Nung Chen
Lihong Li
Jianfeng Gao
Asli Celikyilmaz
96
371
0
03 Mar 2017
Bridging the Gap Between Value and Policy Based Reinforcement Learning
Ofir Nachum
Mohammad Norouzi
Kelvin Xu
Dale Schuurmans
205
478
0
28 Feb 2017
Stabilising Experience Replay for Deep Multi-Agent Reinforcement Learning
Jakob N. Foerster
Nantas Nardelli
Gregory Farquhar
Triantafyllos Afouras
Philip Torr
Pushmeet Kohli
Shimon Whiteson
OffRL
202
601
0
28 Feb 2017
Learning Control for Air Hockey Striking using Deep Reinforcement Learning
Ayal Taitler
N. Shimkin
57
10
0
26 Feb 2017
Towards a Common Implementation of Reinforcement Learning for Multiple Robotic Tasks
Angel Martínez-Tenor
Juan-Antonio Fernández-Madrigal
A. Cruz-Martín
Javier González Jiménez
OffRL
34
32
0
21 Feb 2017
Learning to Multi-Task by Active Sampling
Sahil Sharma
Ashutosh Jha
Parikshit Hegde
Balaraman Ravindran
151
21
0
20 Feb 2017
Understanding Deep Learning Performance through an Examination of Test Set Difficulty: A Psychometric Case Study
John P. Lalor
Hao Wu
Tsendsuren Munkhdalai
Hong-ye Yu
ELM
59
3
0
15 Feb 2017
Sigmoid-Weighted Linear Units for Neural Network Function Approximation in Reinforcement Learning
Stefan Elfwing
E. Uchibe
Kenji Doya
145
1,762
0
10 Feb 2017
Deep Reinforcement Learning: An Overview
Yuxi Li
OffRL
VLM
346
1,550
0
25 Jan 2017
A User Simulator for Task-Completion Dialogues
Xiujun Li
Zachary Chase Lipton
Bhuwan Dhingra
Lihong Li
Jianfeng Gao
Yun-Nung Chen
OffRL
93
167
0
17 Dec 2016
Deep Reinforcement Learning with Successor Features for Navigation across Similar Environments
Jingwei Zhang
Jost Tobias Springenberg
Joschka Boedecker
Wolfram Burgard
85
295
0
16 Dec 2016
Transfer Learning Across Patient Variations with Hidden Parameter Markov Decision Processes
Taylor W. Killian
George Konidaris
Finale Doshi-Velez
OOD
44
9
0
01 Dec 2016
Playing Doom with SLAM-Augmented Deep Reinforcement Learning
Shehroze Bhatti
Alban Desmaison
O. Mikšík
Nantas Nardelli
N. Siddharth
Philip Torr
OffRL
94
69
0
01 Dec 2016
Improving Policy Gradient by Exploring Under-appreciated Rewards
Ofir Nachum
Mohammad Norouzi
Dale Schuurmans
106
44
0
28 Nov 2016
Nonparametric General Reinforcement Learning
Jan Leike
OffRL
113
26
0
28 Nov 2016
Reinforcement Learning through Asynchronous Advantage Actor-Critic on a GPU
Mohammad Babaeizadeh
I. Frosio
Stephen Tyree
Jason Clemons
Jan Kautz
OffRL
84
259
0
18 Nov 2016
Reinforcement Learning with Unsupervised Auxiliary Tasks
Max Jaderberg
Volodymyr Mnih
Wojciech M. Czarnecki
Tom Schaul
Joel Z Leibo
David Silver
Koray Kavukcuoglu
SSL
121
1,229
0
16 Nov 2016
Sequence Tutor: Conservative Fine-Tuning of Sequence Generation Models with KL-control
Natasha Jaques
S. Gu
Dzmitry Bahdanau
José Miguel Hernández-Lobato
Richard Turner
Douglas Eck
186
173
0
09 Nov 2016
Averaged-DQN: Variance Reduction and Stabilization for Deep Reinforcement Learning
Oron Anschel
Nir Baram
N. Shimkin
117
318
0
07 Nov 2016
Combining policy gradient and Q-learning
Brendan O'Donoghue
Rémi Munos
Koray Kavukcuoglu
Volodymyr Mnih
OffRL
OnRL
109
140
0
05 Nov 2016
Learning to Play in a Day: Faster Deep Reinforcement Learning by Optimality Tightening
Frank S. He
Yang Liu
Alex Schwing
Jian-wei Peng
91
84
0
05 Nov 2016
Sample Efficient Actor-Critic with Experience Replay
Ziyun Wang
V. Bapst
N. Heess
Volodymyr Mnih
Rémi Munos
Koray Kavukcuoglu
Nando de Freitas
136
763
0
03 Nov 2016
Towards Lifelong Self-Supervision: A Deep Learning Direction for Robotics
J. M. Wong
79
11
0
01 Nov 2016
Neural Symbolic Machines: Learning Semantic Parsers on Freebase with Weak Supervision
Chen Liang
Jonathan Berant
Quoc V. Le
Kenneth D. Forbus
Ni Lao
NAI
140
406
0
31 Oct 2016
Online Contrastive Divergence with Generative Replay: Experience Replay without Storing Data
Decebal Constantin Mocanu
M. T. Vega
Eric Eaton
Peter Stone
A. Liotta
OffRL
98
26
0
18 Oct 2016
Multi-Objective Deep Reinforcement Learning
Hossam Mossalam
Yannis Assael
D. Roijers
Shimon Whiteson
83
155
0
09 Oct 2016
Supervision via Competition: Robot Adversaries for Learning Tasks
Lerrel Pinto
James Davidson
Abhinav Gupta
SSL
94
82
0
05 Oct 2016
Playing FPS Games with Deep Reinforcement Learning
Guillaume Lample
Devendra Singh Chaplot
OffRL
EgoV
100
589
0
18 Sep 2016
BBQ-Networks: Efficient Exploration in Deep Reinforcement Learning for Task-Oriented Dialogue Systems
Zachary Chase Lipton
Xiujun Li
Jianfeng Gao
Lihong Li
Faisal Ahmed
Li Deng
97
6
0
17 Aug 2016
Playing Atari Games with Deep Reinforcement Learning and Human Checkpoint Replay
Ionel-Alexandru Hosu
Traian Rebedea
88
97
0
18 Jul 2016
Deep Reinforcement Learning With Macro-Actions
Ishan Durugkar
Clemens Rosenbaum
S. Dernbach
Sridhar Mahadevan
58
25
0
15 Jun 2016
Model-Free Episodic Control
Charles Blundell
Benigno Uria
Alexander Pritzel
Yazhe Li
Avraham Ruderman
Joel Z Leibo
Jack W. Rae
Daan Wierstra
Demis Hassabis
OffRL
BDL
59
250
0
14 Jun 2016
Safe and Efficient Off-Policy Reinforcement Learning
Rémi Munos
T. Stepleton
Anna Harutyunyan
Marc G. Bellemare
OffRL
177
619
0
08 Jun 2016
Towards End-to-End Learning for Dialog State Tracking and Management using Deep Reinforcement Learning
Tiancheng Zhao
M. Eskénazi
120
265
0
08 Jun 2016
Deep Successor Reinforcement Learning
Tejas D. Kulkarni
A. Saeedi
Simanta Gautam
S. Gershman
80
209
0
08 Jun 2016
Unifying Count-Based Exploration and Intrinsic Motivation
Marc G. Bellemare
S. Srinivasan
Georg Ostrovski
Tom Schaul
D. Saxton
Rémi Munos
195
1,487
0
06 Jun 2016
Dynamic Frame skip Deep Q Network
A. Srinivas
Sahil Sharma
Balaraman Ravindran
78
23
0
17 May 2016
A Deep Hierarchical Approach to Lifelong Learning in Minecraft
Chen Tessler
Shahar Givony
Tom Zahavy
D. Mankowitz
Shie Mannor
CLL
181
381
0
25 Apr 2016
Hierarchical Deep Reinforcement Learning: Integrating Temporal Abstraction and Intrinsic Motivation
Tejas D. Kulkarni
Karthik Narasimhan
A. Saeedi
J. Tenenbaum
135
1,144
0
20 Apr 2016
Continuous Deep Q-Learning with Model-based Acceleration
S. Gu
Timothy Lillicrap
Ilya Sutskever
Sergey Levine
127
1,013
0
02 Mar 2016
Learning values across many orders of magnitude
H. V. Hasselt
A. Guez
Matteo Hessel
Volodymyr Mnih
David Silver
88
170
0
24 Feb 2016
Deep Exploration via Bootstrapped DQN
Ian Osband
Charles Blundell
Alexander Pritzel
Benjamin Van Roy
138
1,315
0
15 Feb 2016
Learning to Communicate to Solve Riddles with Deep Distributed Recurrent Q-Networks
Jakob N. Foerster
Yannis Assael
Nando de Freitas
Shimon Whiteson
85
147
0
08 Feb 2016
Graying the black box: Understanding DQNs
Tom Zahavy
Nir Ben-Zrihem
Shie Mannor
84
263
0
08 Feb 2016
Ensemble Robustness and Generalization of Stochastic Deep Learning Algorithms
Tom Zahavy
Bingyi Kang
Alex Sivak
Jiashi Feng
Huan Xu
Shie Mannor
OOD
AAML
101
12
0
07 Feb 2016
Asynchronous Methods for Deep Reinforcement Learning
Volodymyr Mnih
Adria Puigdomenech Badia
M. Berk Mirza
Alex Graves
Timothy Lillicrap
Tim Harley
David Silver
Koray Kavukcuoglu
389
8,901
0
04 Feb 2016
Previous
1
2
3
...
28
29
30
Next