Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
1509.06461
Cited By
v1
v2
v3 (latest)
Deep Reinforcement Learning with Double Q-learning
22 September 2015
H. V. Hasselt
A. Guez
David Silver
OffRL
Re-assign community
ArXiv (abs)
PDF
HTML
Papers citing
"Deep Reinforcement Learning with Double Q-learning"
50 / 2,291 papers shown
Title
Deep Reinforcement Learning for Inquiry Dialog Policies with Logical Formula Embeddings
Takuya Hiraoka
Masaaki Tsuchida
Yotaro Watanabe
28
0
0
02 Aug 2017
A Distributional Perspective on Reinforcement Learning
Marc G. Bellemare
Will Dabney
Rémi Munos
OffRL
108
1,510
0
21 Jul 2017
Trial without Error: Towards Safe Reinforcement Learning via Human Intervention
William Saunders
Girish Sastry
Andreas Stuhlmuller
Owain Evans
OffRL
79
231
0
17 Jul 2017
Lenient Multi-Agent Deep Reinforcement Learning
Gregory Palmer
K. Tuyls
D. Bloembergen
Rahul Savani
82
160
0
14 Jul 2017
Distral: Robust Multitask Reinforcement Learning
Yee Whye Teh
V. Bapst
Wojciech M. Czarnecki
John Quan
J. Kirkpatrick
R. Hadsell
N. Heess
Razvan Pascanu
216
553
0
13 Jul 2017
Learning Heuristic Search via Imitation
M. Bhardwaj
Sanjiban Choudhury
Sebastian Scherer
71
83
0
10 Jul 2017
Trust-PCL: An Off-Policy Trust Region Method for Continuous Control
Ofir Nachum
Mohammad Norouzi
Kelvin Xu
Dale Schuurmans
89
107
0
06 Jul 2017
Hashing over Predicted Future Frames for Informed Exploration of Deep Reinforcement Learning
Haiyan Yin
Jianda Chen
Sinno Jialin Pan
51
5
0
03 Jul 2017
Sample-efficient Actor-Critic Reinforcement Learning with Supervised Data for Dialogue Management
Pei-hao Su
Paweł Budzianowski
Stefan Ultes
Milica Gasic
S. Young
OffRL
136
130
0
01 Jul 2017
Noisy Networks for Exploration
Meire Fortunato
M. G. Azar
Bilal Piot
Jacob Menick
Ian Osband
...
Rémi Munos
Demis Hassabis
Olivier Pietquin
Charles Blundell
Shane Legg
116
897
0
30 Jun 2017
Towards Monocular Vision based Obstacle Avoidance through Deep Reinforcement Learning
Linhai Xie
Sen Wang
Andrew Markham
A. Trigoni
54
173
0
29 Jun 2017
Count-Based Exploration in Feature Space for Reinforcement Learning
Jarryd Martin
S. N. Sasikumar
Tom Everitt
Marcus Hutter
76
124
0
25 Jun 2017
Robust and Efficient Transfer Learning with Hidden-Parameter Markov Decision Processes
Taylor W. Killian
Samuel Daulton
George Konidaris
Finale Doshi-Velez
117
106
0
20 Jun 2017
Dex: Incremental Learning for Complex Environments in Deep Reinforcement Learning
Nick Erickson
Qi Zhao
CLL
OffRL
422
2
0
19 Jun 2017
Schema Networks: Zero-shot Transfer with a Generative Causal Model of Intuitive Physics
Ken Kansky
Tom Silver
David A. Mély
Mohamed Eldawy
Miguel Lazaro-Gredilla
Xinghua Lou
N. Dorfman
Szymon Sidor
Scott Phoenix
Dileep George
AI4CE
122
236
0
14 Jun 2017
Hybrid Reward Architecture for Reinforcement Learning
H. V. Seijen
Mehdi Fatemi
Joshua Romoff
Romain Laroche
Tavian Barnes
Jeffrey Tsang
100
253
0
13 Jun 2017
UCB Exploration via Q-Ensembles
Richard Y. Chen
Szymon Sidor
Pieter Abbeel
John Schulman
OffRL
87
6
0
05 Jun 2017
The Atari Grand Challenge Dataset
Vitaly Kurin
Sebastian Nowozin
Katja Hofmann
Lucas Beyer
Bastian Leibe
OffRL
86
45
0
31 May 2017
Sequential Dynamic Decision Making with Deep Neural Nets on a Test-Time Budget
Henghui Zhu
Feng Nan
I. Paschalidis
Venkatesh Saligrama
11
2
0
31 May 2017
Continuous State-Space Models for Optimal Sepsis Treatment - a Deep Reinforcement Learning Approach
Aniruddh Raghu
Matthieu Komorowski
Leo Anthony Celi
Peter Szolovits
Marzyeh Ghassemi
OffRL
58
193
0
23 May 2017
Shallow Updates for Deep Reinforcement Learning
Nir Levine
Tom Zahavy
D. Mankowitz
Aviv Tamar
Shie Mannor
OffRL
72
48
0
21 May 2017
Learning to Factor Policies and Action-Value Functions: Factored Action Space Representations for Deep Reinforcement learning
Sahil Sharma
A. Suresh
Rahul Ramesh
Balaraman Ravindran
OffRL
56
36
0
20 May 2017
Discrete Sequential Prediction of Continuous Actions for Deep RL
Luke Metz
Julian Ibarz
Navdeep Jaitly
James Davidson
BDL
OffRL
90
120
0
14 May 2017
Deep Episodic Value Iteration for Model-based Meta-Reinforcement Learning
Steven Hansen
OffRL
VLM
38
5
0
09 May 2017
Reinforcement Learning with External Knowledge and Two-Stage Q-functions for Predicting Popular Reddit Threads
Ji He
Mari Ostendorf
Xiaodong He
OffRL
LRM
45
10
0
20 Apr 2017
The Reactor: A fast and sample-efficient Actor-Critic agent for Reinforcement Learning
A. Gruslys
Will Dabney
M. G. Azar
Bilal Piot
Marc G. Bellemare
Rémi Munos
76
58
0
15 Apr 2017
Deep Q-learning from Demonstrations
Todd Hester
Matej Vecerík
Olivier Pietquin
Marc Lanctot
Tom Schaul
...
Gabriel Dulac-Arnold
Ian Osband
J. Agapiou
Joel Z Leibo
A. Gruslys
OffRL
94
157
0
12 Apr 2017
Deep Deterministic Policy Gradient for Urban Traffic Light Control
Noe Casas
79
168
0
27 Mar 2017
Deep Value Networks Learn to Evaluate and Iteratively Refine Structured Outputs
Michael Gygli
Mohammad Norouzi
A. Angelova
TDI
147
68
0
13 Mar 2017
Micro-Objective Learning : Accelerating Deep Reinforcement Learning through the Discovery of Continuous Subgoals
Sungtae Lee
Sang-Woo Lee
Jinyoung Choi
Donghyun Kwak
Byoung-Tak Zhang
54
2
0
11 Mar 2017
Deep Robust Kalman Filter
Shirli Di-Castro Shashua
Shie Mannor
BDL
79
28
0
07 Mar 2017
Neural Episodic Control
Alexander Pritzel
Benigno Uria
Sriram Srinivasan
A. Badia
Oriol Vinyals
Demis Hassabis
Daan Wierstra
Charles Blundell
OffRL
BDL
113
346
0
06 Mar 2017
Surprise-Based Intrinsic Motivation for Deep Reinforcement Learning
Joshua Achiam
S. Shankar Sastry
80
238
0
06 Mar 2017
Count-Based Exploration with Neural Density Models
Georg Ostrovski
Marc G. Bellemare
Aaron van den Oord
Rémi Munos
102
626
0
03 Mar 2017
Scaffolding Networks: Incremental Learning and Teaching Through Questioning
Asli Celikyilmaz
Li Deng
Lihong Li
Chong-Jun Wang
LRM
CLL
48
2
0
28 Feb 2017
Learning Control for Air Hockey Striking using Deep Reinforcement Learning
Ayal Taitler
N. Shimkin
55
10
0
26 Feb 2017
Online Meta-learning by Parallel Algorithm Competition
Stefan Elfwing
E. Uchibe
Kenji Doya
73
22
0
24 Feb 2017
Beating the World's Best at Super Smash Bros. with Deep Reinforcement Learning
Vlad Firoiu
William F. Whitney
J. Tenenbaum
94
36
0
21 Feb 2017
Learning to Multi-Task by Active Sampling
Sahil Sharma
Ashutosh Jha
Parikshit Hegde
Balaraman Ravindran
151
21
0
20 Feb 2017
Sigmoid-Weighted Linear Units for Neural Network Function Approximation in Reinforcement Learning
Stefan Elfwing
E. Uchibe
Kenji Doya
145
1,761
0
10 Feb 2017
Deep Reinforcement Learning for Robotic Manipulation-The state of the art
S. Amarjyoti
53
65
0
31 Jan 2017
Learning Light Transport the Reinforced Way
Ken Dahm
A. Keller
75
64
0
25 Jan 2017
Deep Reinforcement Learning: An Overview
Yuxi Li
OffRL
VLM
346
1,549
0
25 Jan 2017
Reinforcement Learning via Recurrent Convolutional Neural Networks
Tanmay Shankar
S. K. Dwivedy
Prithwijit Guha
SSL
44
20
0
09 Jan 2017
Learning to predict where to look in interactive environments using deep recurrent q-learning
Seyed Sajad Mousavi
Michael Schukat
Enda Howley
Ali Borji
N. Mozayani
59
31
0
17 Dec 2016
Deep Reinforcement Learning with Successor Features for Navigation across Similar Environments
Jingwei Zhang
Jost Tobias Springenberg
Joschka Boedecker
Wolfram Burgard
85
295
0
16 Dec 2016
Deep Learning of Robotic Tasks without a Simulator using Strong and Weak Human Supervision
Bar Hilleli
Ran El-Yaniv
45
2
0
04 Dec 2016
Overcoming catastrophic forgetting in neural networks
J. Kirkpatrick
Razvan Pascanu
Neil C. Rabinowitz
J. Veness
Guillaume Desjardins
...
A. Grabska-Barwinska
Demis Hassabis
Claudia Clopath
D. Kumaran
R. Hadsell
CLL
389
7,619
0
02 Dec 2016
Transfer Learning Across Patient Variations with Hidden Parameter Markov Decision Processes
Taylor W. Killian
George Konidaris
Finale Doshi-Velez
OOD
44
9
0
01 Dec 2016
Improving Policy Gradient by Exploring Under-appreciated Rewards
Ofir Nachum
Mohammad Norouzi
Dale Schuurmans
106
44
0
28 Nov 2016
Previous
1
2
3
...
44
45
46
Next