Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
1509.06461
Cited By
v1
v2
v3 (latest)
Deep Reinforcement Learning with Double Q-learning
22 September 2015
H. V. Hasselt
A. Guez
David Silver
OffRL
Re-assign community
ArXiv (abs)
PDF
HTML
Papers citing
"Deep Reinforcement Learning with Double Q-learning"
41 / 2,291 papers shown
Title
Nonparametric General Reinforcement Learning
Jan Leike
OffRL
102
26
0
28 Nov 2016
Reinforcement Learning through Asynchronous Advantage Actor-Critic on a GPU
Mohammad Babaeizadeh
I. Frosio
Stephen Tyree
Jason Clemons
Jan Kautz
OffRL
80
259
0
18 Nov 2016
#Exploration: A Study of Count-Based Exploration for Deep Reinforcement Learning
Haoran Tang
Rein Houthooft
Davis Foote
Adam Stooke
Xi Chen
Yan Duan
John Schulman
F. Turck
Pieter Abbeel
OffRL
143
776
0
15 Nov 2016
Playing SNES in the Retro Learning Environment
Nadav Bhonker
Shai Rozenberg
Itay Hubara
63
19
0
07 Nov 2016
Averaged-DQN: Variance Reduction and Stabilization for Deep Reinforcement Learning
Oron Anschel
Nir Baram
N. Shimkin
102
318
0
07 Nov 2016
Combining policy gradient and Q-learning
Brendan O'Donoghue
Rémi Munos
Koray Kavukcuoglu
Volodymyr Mnih
OffRL
OnRL
105
140
0
05 Nov 2016
Learning to Play in a Day: Faster Deep Reinforcement Learning by Optimality Tightening
Frank S. He
Yang Liu
Alex Schwing
Jian-wei Peng
91
84
0
05 Nov 2016
Towards Lifelong Self-Supervision: A Deep Learning Direction for Robotics
J. M. Wong
79
11
0
01 Nov 2016
Learning Runtime Parameters in Computer Systems with Delayed Experience Injection
Michael Schaarschmidt
Felix Gessert
Valentin Dalibard
Eiko Yoneki
30
9
0
31 Oct 2016
Particle Swarm Optimization for Generating Interpretable Fuzzy Reinforcement Learning Policies
D. Hein
A. Hentschel
Thomas Runkler
Steffen Udluft
OffRL
150
80
0
19 Oct 2016
Multi-Objective Deep Reinforcement Learning
Hossam Mossalam
Yannis Assael
D. Roijers
Shimon Whiteson
83
154
0
09 Oct 2016
Information-Theoretic Methods for Planning and Learning in Partially Observable Markov Decision Processes
Roy Fox
29
0
0
24 Sep 2016
Playing FPS Games with Deep Reinforcement Learning
Guillaume Lample
Devendra Singh Chaplot
OffRL
EgoV
100
588
0
18 Sep 2016
Interactive Spoken Content Retrieval by Deep Reinforcement Learning
Yen-Chen Wu
Tzu-Hsiang Lin
Pei-Hung Chung
Hung-yi Lee
Tsung-Hsien Wen
27
12
0
16 Sep 2016
Episodic Exploration for Deep Deterministic Policies: An Application to StarCraft Micromanagement Tasks
Nicolas Usunier
Gabriel Synnaeve
Zeming Lin
Soumith Chintala
99
138
0
10 Sep 2016
Reward Augmented Maximum Likelihood for Neural Structured Prediction
Mohammad Norouzi
Samy Bengio
Zhiwen Chen
Navdeep Jaitly
M. Schuster
Yonghui Wu
Dale Schuurmans
118
253
0
01 Sep 2016
BBQ-Networks: Efficient Exploration in Deep Reinforcement Learning for Task-Oriented Dialogue Systems
Zachary Chase Lipton
Xiujun Li
Jianfeng Gao
Lihong Li
Faisal Ahmed
Li Deng
97
6
0
17 Aug 2016
Playing Atari Games with Deep Reinforcement Learning and Human Checkpoint Replay
Ionel-Alexandru Hosu
Traian Rebedea
88
97
0
18 Jul 2016
Deep Reinforcement Learning With Macro-Actions
Ishan Durugkar
Clemens Rosenbaum
S. Dernbach
Sridhar Mahadevan
56
25
0
15 Jun 2016
Policy Networks with Two-Stage Training for Dialogue Systems
Mehdi Fatemi
Layla El Asri
Hannes Schulz
Jing He
Kaheer Suleman
OffRL
88
108
0
10 Jun 2016
Towards End-to-End Learning for Dialog State Tracking and Management using Deep Reinforcement Learning
Tiancheng Zhao
M. Eskénazi
114
265
0
08 Jun 2016
Unifying Count-Based Exploration and Intrinsic Motivation
Marc G. Bellemare
S. Srinivasan
Georg Ostrovski
Tom Schaul
D. Saxton
Rémi Munos
195
1,485
0
06 Jun 2016
Deep Reinforcement Learning Radio Control and Signal Detection with KeRLym, a Gym RL Agent
Tim O'Shea
T. Clancy
53
19
0
30 May 2016
Learning from the memory of Atari 2600
Jakub Sygnowski
Henryk Michalewski
116
12
0
04 May 2016
Classifying Options for Deep Reinforcement Learning
Kai Arulkumaran
Nat Dilokthanakul
Murray Shanahan
Anil Anthony Bharath
69
20
0
27 Apr 2016
A Deep Hierarchical Approach to Lifelong Learning in Minecraft
Chen Tessler
Shahar Givony
Tom Zahavy
D. Mankowitz
Shie Mannor
CLL
175
381
0
25 Apr 2016
Easy Monotonic Policy Iteration
Joshua Achiam
OffRL
49
0
0
29 Feb 2016
Learning values across many orders of magnitude
H. V. Hasselt
A. Guez
Matteo Hessel
Volodymyr Mnih
David Silver
88
170
0
24 Feb 2016
Deep Exploration via Bootstrapped DQN
Ian Osband
Charles Blundell
Alexander Pritzel
Benjamin Van Roy
127
1,315
0
15 Feb 2016
Learning to Communicate to Solve Riddles with Deep Distributed Recurrent Q-Networks
Jakob N. Foerster
Yannis Assael
Nando de Freitas
Shimon Whiteson
85
147
0
08 Feb 2016
Graying the black box: Understanding DQNs
Tom Zahavy
Nir Ben-Zrihem
Shie Mannor
84
263
0
08 Feb 2016
Asynchronous Methods for Deep Reinforcement Learning
Volodymyr Mnih
Adria Puigdomenech Badia
M. Berk Mirza
Alex Graves
Timothy Lillicrap
Tim Harley
David Silver
Koray Kavukcuoglu
223
8,893
0
04 Feb 2016
Taming the Noise in Reinforcement Learning via Soft Updates
Roy Fox
Ari Pakman
Naftali Tishby
112
341
0
28 Dec 2015
Increasing the Action Gap: New Operators for Reinforcement Learning
Marc G. Bellemare
Georg Ostrovski
A. Guez
Philip S. Thomas
Rémi Munos
78
157
0
15 Dec 2015
How to Discount Deep Reinforcement Learning: Towards New Dynamic Strategies
Vincent François-Lavet
R. Fonteneau
D. Ernst
87
111
0
07 Dec 2015
Deep Attention Recurrent Q-Network
Ivan Sorokin
Alexey Seleznev
Mikhail Pavlov
A. Fedorov
Anastasiia Ignateva
75
152
0
05 Dec 2015
State of the Art Control of Atari Games Using Shallow Reinforcement Learning
Yitao Liang
Marlos C. Machado
Erik Talvitie
Michael Bowling
105
113
0
04 Dec 2015
Dueling Network Architectures for Deep Reinforcement Learning
Ziyun Wang
Tom Schaul
Matteo Hessel
H. V. Hasselt
Marc Lanctot
Nando de Freitas
OffRL
112
3,780
0
20 Nov 2015
Policy Distillation
Andrei A. Rusu
Sergio Gomez Colmenarejo
Çağlar Gülçehre
Guillaume Desjardins
J. Kirkpatrick
Razvan Pascanu
Volodymyr Mnih
Koray Kavukcuoglu
R. Hadsell
137
698
0
19 Nov 2015
Prioritized Experience Replay
Tom Schaul
John Quan
Ioannis Antonoglou
David Silver
OffRL
248
3,807
0
18 Nov 2015
Deep Reinforcement Learning in Parameterized Action Space
Matthew J. Hausknecht
Peter Stone
78
308
0
13 Nov 2015
Previous
1
2
3
...
44
45
46