Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
1509.06461
Cited By
v1
v2
v3 (latest)
Deep Reinforcement Learning with Double Q-learning
22 September 2015
H. V. Hasselt
A. Guez
David Silver
OffRL
Re-assign community
ArXiv (abs)
PDF
HTML
Papers citing
"Deep Reinforcement Learning with Double Q-learning"
50 / 2,291 papers shown
Title
Safe Option-Critic: Learning Safety in the Option-Critic Architecture
Arushi Jain
Khimya Khetarpal
Doina Precup
70
27
0
21 Jul 2018
FuzzerGym: A Competitive Framework for Fuzzing and Learning
W. Drozd
Michael D. Wagner
66
32
0
19 Jul 2018
Discrete linear-complexity reinforcement learning in continuous action spaces for Q-learning algorithms
P. Tavallali
G. Doran
L. Mandrake
28
0
0
16 Jul 2018
Online Robust Policy Learning in the Presence of Unknown Adversaries
Aaron J. Havens
Zhanhong Jiang
Soumik Sarkar
AAML
115
44
0
16 Jul 2018
Video Summarisation by Classification with Deep Reinforcement Learning
Kaiyang Zhou
Tao Xiang
Andrea Cavallaro
OffRL
72
36
0
09 Jul 2018
Using Reinforcement Learning with Partial Vehicle Detection for Intelligent Traffic Signal Control
Rusheng Zhang
A. Ishikawa
Wenli Wang
Benjamin Striner
Ozan Tonguz
74
104
0
04 Jul 2018
Policy Optimization With Penalized Point Probability Distance: An Alternative To Proximal Policy Optimization
Xiangxiang Chu
101
9
0
02 Jul 2018
QT-Opt: Scalable Deep Reinforcement Learning for Vision-Based Robotic Manipulation
Dmitry Kalashnikov
A. Irpan
P. Pastor
Julian Ibarz
Alexander Herzog
...
Deirdre Quillen
E. Holly
Mrinal Kalakrishnan
Vincent Vanhoucke
Sergey Levine
175
1,473
0
27 Jun 2018
Learning-to-Ask: Knowledge Acquisition via 20 Questions
Yihong Chen
B. Chen
Xuguang Duan
Jian-Guang Lou
Yue Wang
Wenwu Zhu
Yong Cao
54
15
0
22 Jun 2018
A Dissection of Overfitting and Generalization in Continuous Reinforcement Learning
Amy Zhang
Nicolas Ballas
Joelle Pineau
CLL
OffRL
113
180
0
20 Jun 2018
RUDDER: Return Decomposition for Delayed Rewards
Jose A. Arjona-Medina
Michael Gillhofer
Michael Widrich
Thomas Unterthiner
Johannes Brandstetter
Sepp Hochreiter
130
222
0
20 Jun 2018
Surprising Negative Results for Generative Adversarial Tree Search
Kamyar Azizzadenesheli
Brandon Yang
Weitang Liu
Zachary Chase Lipton
Anima Anandkumar
91
13
0
15 Jun 2018
Evolving simple programs for playing Atari games
Dennis G. Wilson
Sylvain Cussat-Blanc
H. Luga
J. Miller
68
62
0
14 Jun 2018
Implicit Quantile Networks for Distributional Reinforcement Learning
Will Dabney
Georg Ostrovski
David Silver
Rémi Munos
OffRL
144
534
0
14 Jun 2018
Qualitative Measurements of Policy Discrepancy for Return-Based Deep Q-Network
Wenjia Meng
Qian Zheng
L. Yang
Pengfei Li
Gang Pan
45
21
0
14 Jun 2018
Structured Variational Learning of Bayesian Neural Networks with Horseshoe Priors
S. Ghosh
Jiayu Yao
Finale Doshi-Velez
BDL
UQCV
70
78
0
13 Jun 2018
Context-Aware Policy Reuse
Siyuan Li
Fangda Gu
Guangxiang Zhu
Chongjie Zhang
OffRL
159
37
0
11 Jun 2018
Learning to Search in Long Documents Using Document Structure
Mor Geva
Jonathan Berant
RALM
80
15
0
09 Jun 2018
Randomized Prior Functions for Deep Reinforcement Learning
Ian Osband
John Aslanides
Albin Cassirer
UQCV
BDL
90
380
0
08 Jun 2018
Automatic View Planning with Multi-scale Deep Reinforcement Learning Agents
A. Alansary
Loic Le Folgoc
G. Vaillant
Ozan Oktay
Yuanwei Li
...
Benjamin Hou
Jingyu Sun
Ben Glocker
Bernhard Kainz
Daniel Rueckert
3DV
OffRL
51
54
0
08 Jun 2018
Program Synthesis Through Reinforcement Learning Guided Tree Search
Riley Simmons-Edler
Anders Miltner
Sebastian Seung
129
11
0
08 Jun 2018
Randomized Value Functions via Multiplicative Normalizing Flows
Ahmed Touati
Harsh Satija
Joshua Romoff
Joelle Pineau
Pascal Vincent
67
36
0
06 Jun 2018
Building Advanced Dialogue Managers for Goal-Oriented Dialogue Systems
V. Ilievski
OffRL
21
4
0
03 Jun 2018
Between Progress and Potential Impact of AI: the Neglected Dimensions
Fernando Martínez-Plumed
S. Avin
Miles Brundage
Allan Dafoe
Seán Ó hÉigeartaigh
José Hernández-Orallo
55
3
0
02 Jun 2018
Deep Curiosity Search: Intra-Life Exploration Can Improve Performance on Challenging Deep Reinforcement Learning Problems
C. Stanton
Jeff Clune
LRM
62
41
0
01 Jun 2018
Inference Aided Reinforcement Learning for Incentive Mechanism Design in Crowdsourcing
Zehong Hu
Yitao Liang
Yang Liu
Jie Zhang
OffRL
60
24
0
01 Jun 2018
Fast Exploration with Simplified Models and Approximately Optimistic Planning in Model Based Reinforcement Learning
Ramtin Keramati
Jay Whang
Patrick Cho
Emma Brunskill
OffRL
90
7
0
01 Jun 2018
Sequential Attacks on Agents for Long-Term Adversarial Goals
E. Tretschk
Seong Joon Oh
Mario Fritz
OnRL
399
48
1
31 May 2018
Sample-Efficient Deep Reinforcement Learning via Episodic Backward Update
Su Young Lee
Sung-Ik Choi
Sae-Young Chung
BDL
77
75
0
31 May 2018
Depth and nonlinearity induce implicit exploration for RL
Justas Dauparas
Ryota Tomioka
Katja Hofmann
OffRL
33
4
0
29 May 2018
Observe and Look Further: Achieving Consistent Performance on Atari
Tobias Pohlen
Bilal Piot
Todd Hester
M. G. Azar
Dan Horgan
...
John Quan
Mel Vecerík
Matteo Hessel
Rémi Munos
Olivier Pietquin
68
121
0
29 May 2018
Meta-Gradient Reinforcement Learning
Zhongwen Xu
H. V. Hasselt
David Silver
117
327
0
24 May 2018
Resource Allocation for a Wireless Coexistence Management System Based on Reinforcement Learning
Philip Soeffker
Dimitri Block
Nico Wiebusch
U. Meier
22
4
0
24 May 2018
Deep Reinforcement Learning For Sequence to Sequence Models
Yaser Keneshloo
Tian Shi
Naren Ramakrishnan
Chandan K. Reddy
AIMat
3DV
OffRL
92
211
0
24 May 2018
Unsupervised Video Object Segmentation for Deep Reinforcement Learning
Vikrant Goel
James Weng
Pascal Poupart
OCL
73
66
0
20 May 2018
Nonlinear Distributional Gradient Temporal-Difference Learning
Chao Qu
Shie Mannor
Huan Xu
90
12
0
20 May 2018
Learning Sampling Policies for Domain Adaptation
Yash J. Patel
Kashyap Chitta
Bhavan A. Jasani
27
9
0
19 May 2018
Episodic Memory Deep Q-Networks
Zichuan Lin
Tianqi Zhao
Guangwen Yang
Lintao Zhang
OffRL
61
87
0
19 May 2018
Hierarchical Reinforcement Learning with Deep Nested Agents
Marc Brittain
Peng Wei
BDL
38
1
0
18 May 2018
Language Expansion In Text-Based Games
Ghulam Ahmed Ansari
P. SagarJ.
A. Chandar
Balaraman Ravindran
LLMAG
37
8
0
17 May 2018
Optimized Computation Offloading Performance in Virtual Edge Computing Systems via Deep Reinforcement Learning
Xianfu Chen
Honggang Zhang
Celimuge Wu
S. Mao
Yusheng Ji
Medhi Bennis
OffRL
86
485
0
16 May 2018
Graph Signal Sampling via Reinforcement Learning
Oleksii Abramenko
A. Jung
43
5
0
15 May 2018
Deep Hierarchical Reinforcement Learning Algorithm in Partially Observable Markov Decision Processes
T. P. Le
Ngo Anh Vien
Abu Layek
TaeChoong Chung
53
52
0
11 May 2018
Stochastic Approximation for Risk-aware Markov Decision Processes
Wenjie Huang
W. Haskell
55
18
0
11 May 2018
Metatrace Actor-Critic: Online Step-size Tuning by Meta-gradient Descent for Reinforcement Learning Control
K. Young
Baoxiang Wang
Matthew E. Taylor
OffRL
84
15
0
10 May 2018
Multimodal Hierarchical Reinforcement Learning Policy for Task-Oriented Visual Dialog
Jiaping Zhang
Tiancheng Zhao
Zhou Yu
57
40
0
08 May 2018
Image Retrieval with Mixed Initiative and Multimodal Feedback
Nils Murrugarra-Llerena
Adriana Kovashka
53
14
0
08 May 2018
Ranking for Relevance and Display Preferences in Complex Presentation Layouts
Harrie Oosterhuis
Maarten de Rijke
45
27
0
07 May 2018
Crawling in Rogue's dungeons with (partitioned) A3C
Andrea Asperti
Daniele Cortesi
Francesco Sovrano
67
12
0
23 Apr 2018
State Distribution-aware Sampling for Deep Q-learning
Weichao Li
Fuxian Huang
Xi Li
G. Pan
Leilei Gan
TTA
46
4
0
23 Apr 2018
Previous
1
2
3
...
41
42
43
44
45
46
Next