Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
1509.06461
Cited By
v1
v2
v3 (latest)
Deep Reinforcement Learning with Double Q-learning
22 September 2015
H. V. Hasselt
A. Guez
David Silver
OffRL
Re-assign community
ArXiv (abs)
PDF
HTML
Papers citing
"Deep Reinforcement Learning with Double Q-learning"
50 / 2,291 papers shown
Title
Proximal Policy Optimization via Enhanced Exploration Efficiency
Junwei Zhang
Zhenghao Zhang
Shuai Han
Shuai Lu
131
44
0
11 Nov 2020
Continual Learning of Control Primitives: Skill Discovery via Reset-Games
Kelvin Xu
Siddharth Verma
Chelsea Finn
Sergey Levine
CLL
83
34
0
10 Nov 2020
Trajectory Planning for Autonomous Vehicles Using Hierarchical Reinforcement Learning
Kaleb Ben Naveed
Zhiqian Qiao
John M. Dolan
45
58
0
09 Nov 2020
Behavior Planning at Urban Intersections through Hierarchical Reinforcement Learning
Zhiqian Qiao
J. Schneider
John M. Dolan
40
23
0
09 Nov 2020
Deep reinforcement learning for RAN optimization and control
Yu Chen
Jie Chen
G. Krishnamurthi
Huijing Yang
Huahui Wang
Wenjie Zhao
39
1
0
09 Nov 2020
Multi-Agent Reinforcement Learning for Channel Assignment and Power Allocation in Platoon-Based C-V2X Systems
Hung V. Vu
Mohammad Farzanullah
Zheyu Liu
D. Nguyen
R. Morawski
T. Le-Ngoc
38
15
0
09 Nov 2020
Explaining Deep Graph Networks with Molecular Counterfactuals
Danilo Numeroso
D. Bacciu
57
10
0
09 Nov 2020
Reinforcement Learning for Sparse-Reward Object-Interaction Tasks in a First-person Simulated 3D Environment
Wilka Carvalho
Anthony Liang
Kimin Lee
Sungryull Sohn
Honglak Lee
Richard L. Lewis
Satinder Singh
OffRL
56
9
0
28 Oct 2020
Learning to Represent Action Values as a Hypergraph on the Action Vertices
Arash Tavakoli
Mehdi Fatemi
Petar Kormushev
81
23
0
28 Oct 2020
Hamilton-Jacobi Deep Q-Learning for Deterministic Continuous-Time Systems with Lipschitz Continuous Controls
Jeongho Kim
Jaeuk Shin
Insoon Yang
61
35
0
27 Oct 2020
OPAL: Offline Primitive Discovery for Accelerating Offline Reinforcement Learning
Anurag Ajay
Aviral Kumar
Pulkit Agrawal
Sergey Levine
Ofir Nachum
OffRL
OnRL
103
160
0
26 Oct 2020
Towards Scale-Invariant Graph-related Problem Solving by Iterative Homogeneous Graph Neural Networks
Hao Tang
Zhiao Huang
Jiayuan Gu
Bao-Liang Lu
Hao Su
AI4CE
70
9
0
26 Oct 2020
Multi-Graph Tensor Networks
Yao Xu
Kriton Konstantinidis
Danilo P. Mandic
54
7
0
25 Oct 2020
Multi-UAV Path Planning for Wireless Data Harvesting with Deep Reinforcement Learning
Harald Bayerlein
Mirco Theile
Marco Caccamo
David Gesbert
90
124
0
23 Oct 2020
Deep Q-Network-based Adaptive Alert Threshold Selection Policy for Payment Fraud Systems in Retail Banking
Hongda Shen
Eren Kurshan
72
21
0
21 Oct 2020
Improving Generalization in Reinforcement Learning with Mixture Regularization
Kaixin Wang
Bingyi Kang
Jie Shao
Jiashi Feng
188
120
0
21 Oct 2020
Iterative Amortized Policy Optimization
Joseph Marino
Alexandre Piché
Alessandro Davide Ialongo
Yisong Yue
OffRL
117
21
0
20 Oct 2020
Implicit recurrent networks: A novel approach to stationary input processing with recurrent neural networks in deep learning
Sebastian Sanokowski
17
1
0
20 Oct 2020
Chance-Constrained Control with Lexicographic Deep Reinforcement Learning
Alessandro Giuseppi
A. Pietrabissa
33
7
0
19 Oct 2020
DBA bandits: Self-driving index tuning under ad-hoc, analytical workloads with safety guarantees
R. Perera
Bastian Oetomo
Benjamin I. P. Rubinstein
Renata Borovica-Gajic
54
32
0
19 Oct 2020
Softmax Deep Double Deterministic Policy Gradients
Ling Pan
Qingpeng Cai
Longbo Huang
118
93
0
19 Oct 2020
Average-reward model-free reinforcement learning: a systematic review and literature mapping
Vektor Dewanto
George Dunn
A. Eshragh
M. Gallagher
Fred Roosta
83
30
0
18 Oct 2020
Understanding Information Processing in Human Brain by Interpreting Machine Learning Models
Ilya Kuzovkin
HAI
24
2
0
17 Oct 2020
DOOM: A Novel Adversarial-DRL-Based Op-Code Level Metamorphic Malware Obfuscator for the Enhancement of IDS
Mohit Sewak
S. K. Sahay
Hemant Rathore
40
18
0
16 Oct 2020
Efficient Robotic Object Search via HIEM: Hierarchical Policy Learning with Intrinsic-Extrinsic Modeling
Xin Ye
Yezhou Yang
76
15
0
16 Oct 2020
A Nesterov's Accelerated quasi-Newton method for Global Routing using Deep Reinforcement Learning
S. Indrapriyadarsini
Shahrzad Mahboubi
H. Ninomiya
T. Kamio
H. Asai
26
5
0
15 Oct 2020
UAV Path Planning using Global and Local Map Information with Deep Reinforcement Learning
Mirco Theile
Harald Bayerlein
R. Nai
David Gesbert
Marco Caccamo
105
54
0
14 Oct 2020
Human-centric Dialog Training via Offline Reinforcement Learning
Natasha Jaques
J. Shen
Asma Ghandeharioun
Craig Ferguson
Àgata Lapedriza
Noah J. Jones
S. Gu
Rosalind W. Picard
OffRL
86
96
0
12 Oct 2020
Deep Echo State Q-Network (DEQN) and Its Application in Dynamic Spectrum Sharing for 5G and Beyond
Hao-Hsuan Chang
Lingjia Liu
Yuhao Yi
73
47
0
12 Oct 2020
A DRL-based Multiagent Cooperative Control Framework for CAV Networks: a Graphic Convolution Q Network
Jiqian Dong
Sikai Chen
P. Ha
Yujie Li
Samuel Labi
74
37
0
12 Oct 2020
Distributed Resource Allocation with Multi-Agent Deep Reinforcement Learning for 5G-V2V Communication
Alperen Gündogan
H. Gürsu
V. Pauli
W. Kellerer
20
30
0
11 Oct 2020
Graph Convolutional Value Decomposition in Multi-Agent Reinforcement Learning
Navid Naderializadeh
Fan Hung
S. Soleyman
D. Khosla
58
28
0
09 Oct 2020
Parameterized Reinforcement Learning for Optical System Optimization
H. Wankerl
M. L. Stern
Ali Mahdavi
C. Eichler
E. Lang
111
23
0
09 Oct 2020
Learning Value Functions in Deep Policy Gradients using Residual Variance
Yannis Flet-Berliac
Reda Ouhamma
Odalric-Ambrym Maillard
Philippe Preux
OffRL
55
1
0
09 Oct 2020
UneVEn: Universal Value Exploration for Multi-Agent Reinforcement Learning
Tarun Gupta
Anuj Mahajan
Bei Peng
Wendelin Bohmer
Shimon Whiteson
OffRL
120
50
0
06 Oct 2020
Reinforcement Learning with Random Delays
Simon Ramstedt
Yann Bouteiller
Giovanni Beltrame
C. Pal
Jonathan Binas
227
61
0
06 Oct 2020
Reward Machines: Exploiting Reward Function Structure in Reinforcement Learning
Rodrigo Toro Icarte
Toryn Q. Klassen
Richard Valenzano
Sheila A. McIlraith
OffRL
148
222
0
06 Oct 2020
Temporal Difference Uncertainties as a Signal for Exploration
Sebastian Flennerhag
Jane X. Wang
Pablo Sprechmann
Francesco Visin
Alexandre Galashov
Steven Kapturowski
Diana Borsa
N. Heess
André Barreto
Razvan Pascanu
OffRL
49
14
0
05 Oct 2020
Mastering Atari with Discrete World Models
Danijar Hafner
Timothy Lillicrap
Mohammad Norouzi
Jimmy Ba
DRL
192
876
0
05 Oct 2020
The act of remembering: a study in partially observable reinforcement learning
Rodrigo Toro Icarte
Richard Valenzano
Toryn Q. Klassen
Phillip J. K. Christoffersen
Amir-massoud Farahmand
Sheila A. McIlraith
OffRL
40
11
0
05 Oct 2020
Policy Learning Using Weak Supervision
Jingkang Wang
Hongyi Guo
Zhaowei Zhu
Yang Liu
OffRL
74
15
0
05 Oct 2020
Test-Cost Sensitive Methods for Identifying Nearby Points
Seung Gyu Hyun
Christopher Leung
34
0
0
04 Oct 2020
Disentangling causal effects for hierarchical reinforcement learning
Oriol Corcoll
Raul Vicente
CML
79
9
0
03 Oct 2020
Student-Initiated Action Advising via Advice Novelty
Ercüment Ilhan
Jeremy Gow
Diego Perez
37
9
0
01 Oct 2020
Bayesian Meta-reinforcement Learning for Traffic Signal Control
Yayi Zou
Zhiwei Qin
BDL
36
3
0
01 Oct 2020
Facilitating Connected Autonomous Vehicle Operations Using Space-weighted Information Fusion and Deep Reinforcement Learning Based Control
Jiqian Dong
Sikai Chen
Yujie Li
Runjia Du
Aaron Steinfeld
Samuel Labi
66
11
0
30 Sep 2020
Toolpath design for additive manufacturing using deep reinforcement learning
M. Mozaffar
Ablodghani Ebrahimi
Jian Cao
AI4CE
40
7
0
30 Sep 2020
Reannealing of Decaying Exploration Based On Heuristic Measure in Deep Q-Network
Xing Wang
A. Vinel
35
0
0
29 Sep 2020
Finite-Time Analysis for Double Q-learning
Huaqing Xiong
Linna Zhao
Yingbin Liang
Wei Zhang
71
31
0
29 Sep 2020
Learning to Play against Any Mixture of Opponents
Max O. Smith
Thomas W. Anthony
Yongzhao Wang
Michael P. Wellman
OffRL
75
9
0
29 Sep 2020
Previous
1
2
3
...
28
29
30
...
44
45
46
Next