Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
1509.06461
Cited By
v1
v2
v3 (latest)
Deep Reinforcement Learning with Double Q-learning
22 September 2015
H. V. Hasselt
A. Guez
David Silver
OffRL
Re-assign community
ArXiv (abs)
PDF
HTML
Papers citing
"Deep Reinforcement Learning with Double Q-learning"
50 / 2,291 papers shown
Title
Coordinated Reinforcement Learning for Optimizing Mobile Networks
Maxime Bouton
Hasan Farooq
Julien Forgeat
Shruti Bothe
Meral Shirazipour
P. Karlsson
63
12
0
30 Sep 2021
Dr Jekyll and Mr Hyde: the Strange Case of Off-Policy Policy Updates
Romain Laroche
Rémi Tachet des Combes
91
8
0
29 Sep 2021
On the Estimation Bias in Double Q-Learning
Zhizhou Ren
Guangxiang Zhu
Haotian Hu
Beining Han
Jian-Hai Chen
Chongjie Zhang
78
17
0
29 Sep 2021
Deep Reinforcement Q-Learning for Intelligent Traffic Signal Control with Partial Detection
Romain Ducrocq
N. Farhi
55
15
0
29 Sep 2021
Exploratory State Representation Learning
Astrid Merckling
Nicolas Perrin-Gilbert
Alexandre Coninx
Stéphane Doncieux
OffRL
77
6
0
28 Sep 2021
MiniHack the Planet: A Sandbox for Open-Ended Reinforcement Learning Research
Mikayel Samvelyan
Robert Kirk
Vitaly Kurin
Jack Parker-Holder
Minqi Jiang
Eric Hambro
Fabio Petroni
Heinrich Küttler
Edward Grefenstette
Tim Rocktaschel
OffRL
319
91
0
27 Sep 2021
Stackelberg Actor-Critic: Game-Theoretic Reinforcement Learning Algorithms
Liyuan Zheng
Tanner Fiez
Zane Alumbaugh
Benjamin J. Chasnov
Lillian J. Ratliff
OffRL
99
42
0
25 Sep 2021
The
f
f
f
-Divergence Reinforcement Learning Framework
Chen Gong
Qiang He
Yunpeng Bai
Zhouyi Yang
Xiaoyu Chen
Xinwen Hou
Xianjie Zhang
Yu Liu
Guoliang Fan
68
3
0
24 Sep 2021
Parameter-free Reduction of the Estimation Bias in Deep Reinforcement Learning for Deterministic Policy Gradients
Baturay Saglam
Furkan B. Mutlu
Dogan C. Cicek
Suleyman S. Kozat
OffRL
53
3
0
24 Sep 2021
Improved Soft Actor-Critic: Mixing Prioritized Off-Policy Samples with On-Policy Experience
C. Banerjee
Zhiyong Chen
N. Noman
51
33
0
24 Sep 2021
ADVERSARIALuscator: An Adversarial-DRL Based Obfuscator and Metamorphic Malware SwarmGenerator
Mohit Sewak
S. K. Sahay
Hemant Rathore
AAML
51
8
0
23 Sep 2021
Estimation Error Correction in Deep Reinforcement Learning for Deterministic Actor-Critic Methods
Baturay Saglam
Enes Duran
Dogan C. Cicek
Furkan B. Mutlu
Suleyman S. Kozat
OffRL
73
12
0
22 Sep 2021
MEPG: A Minimalist Ensemble Policy Gradient Framework for Deep Reinforcement Learning
Qiang He
Yuxun Qu
Chen Gong
Xinwen Hou
OffRL
85
10
0
22 Sep 2021
Benchmarking Lane-changing Decision-making for Deep Reinforcement Learning
Junjie Wang
Qichao Zhang
Dongbin Zhao
OffRL
26
1
0
22 Sep 2021
Off-line approximate dynamic programming for the vehicle routing problem with a highly variable customer basis and stochastic demands
M. Dastpak
Fausto Errico
O. Jabali
57
6
0
21 Sep 2021
Generalization in Text-based Games via Hierarchical Reinforcement Learning
Yunqiu Xu
Meng Fang
Ling Chen
Yali Du
Chengqi Zhang
AI4CE
85
21
0
21 Sep 2021
A Survey of Text Games for Reinforcement Learning informed by Natural Language
P. Osborne
Heido Nomm
André Freitas
AI4CE
101
24
0
20 Sep 2021
Dual Behavior Regularized Reinforcement Learning
Chapman Siu
Jason M. Traish
R. Xu
OffRL
61
1
0
19 Sep 2021
RAPID-RL: A Reconfigurable Architecture with Preemptive-Exits for Efficient Deep-Reinforcement Learning
Adarsh Kosta
Malik Aqeel Anwar
Priyadarshini Panda
A. Raychowdhury
Kaushik Roy
30
4
0
16 Sep 2021
DCUR: Data Curriculum for Teaching via Samples with Reinforcement Learning
Daniel Seita
Abhinav Gopal
Zhao Mandi
John F. Canny
OffRL
OnRL
44
0
0
15 Sep 2021
Exploration in Deep Reinforcement Learning: From Single-Agent to Multiagent Domain
Jianye Hao
Tianpei Yang
Hongyao Tang
Chenjia Bai
Jinyi Liu
Zhaopeng Meng
Peng Liu
Zhen Wang
OffRL
86
102
0
14 Sep 2021
An Empirical Comparison of Off-policy Prediction Learning Algorithms in the Four Rooms Environment
Sina Ghiassian
R. Sutton
AAML
OffRL
87
6
0
10 Sep 2021
Boosting Graph Search with Attention Network for Solving the General Orienteering Problem
Zongtao Liu
Jing Xu
Jintao Su
Tao Xiao
Yang Yang
35
1
0
10 Sep 2021
PowerGym: A Reinforcement Learning Environment for Volt-Var Control in Power Distribution Systems
Ting-Han Fan
Xian Yeow Lee
Yubo Wang
178
24
0
08 Sep 2021
A Survey of Deep Reinforcement Learning in Recommender Systems: A Systematic Review and Future Directions
Xiaocong Chen
L. Yao
Julian McAuley
Guanglin Zhou
Xianzhi Wang
AI4TS
79
62
0
08 Sep 2021
ADER:Adapting between Exploration and Robustness for Actor-Critic Methods
Bo Zhou
Kejiao Li
Hongsheng Zeng
Fan Wang
Hao Tian
OffRL
69
1
0
08 Sep 2021
Temporal Shift Reinforcement Learning
Deep Thomas
Tichakorn Wongpiromsarn
Ali Jannesari
OffRL
33
0
0
05 Sep 2021
An Exploration of Deep Learning Methods in Hungry Geese
Nikzad Khani
Matthew Kluska
27
0
0
05 Sep 2021
Event-Based Communication in Distributed Q-Learning
Daniel Jarne Ornia
M. Mazo
65
2
0
03 Sep 2021
Catastrophic Interference in Reinforcement Learning: A Solution Based on Context Division and Knowledge Distillation
Tiantian Zhang
Xueqian Wang
Bin Liang
Bo Yuan
OffRL
80
18
0
01 Sep 2021
A Survey of Exploration Methods in Reinforcement Learning
Susan Amin
Maziar Gomrokchi
Harsh Satija
H. V. Hoof
Doina Precup
OffRL
100
84
0
01 Sep 2021
Phy-Q as a measure for physical reasoning intelligence
Cheng Xue
Vimukthini Pinto
C. Gamage
Ekaterina Nikonova
Peng Zhang
Jochen Renz
LRM
68
12
0
31 Aug 2021
A Policy Efficient Reduction Approach to Convex Constrained Deep Reinforcement Learning
Tianchi Cai
Wenpeng Zhang
Lihong Gu
Xiaodong Zeng
Jinjie Gu
21
0
0
29 Aug 2021
Autonomous Curiosity for Real-Time Training Onboard Robotic Agents
Ervin Teng
Bob Iannucci
50
6
0
29 Aug 2021
Flying Through a Narrow Gap Using End-to-end Deep Reinforcement Learning Augmented with Curriculum Learning and Sim2Real
Chenxi Xiao
Peng Lu
Qizhi He
33
35
0
29 Aug 2021
Reinforcement Learning based Condition-oriented Maintenance Scheduling for Flow Line Systems
Raphael Lamprecht
Ferdinand Wurst
Marco F. Huber
40
3
0
27 Aug 2021
WAD: A Deep Reinforcement Learning Agent for Urban Autonomous Driving
Arjit Sharma
Sahil Sharma
25
3
0
27 Aug 2021
Federated Reinforcement Learning: Techniques, Applications, and Open Challenges
Jiaju Qi
Qihao Zhou
Lei Lei
Kan Zheng
FedML
111
159
0
26 Aug 2021
Adversary agent reinforcement learning for pursuit-evasion
X. Huang
15
2
0
25 Aug 2021
No DBA? No regret! Multi-armed bandits for index tuning of analytical and HTAP workloads with provable guarantees
R. Perera
Bastian Oetomo
Benjamin I. P. Rubinstein
Renata Borovica-Gajic
48
8
0
23 Aug 2021
Personalized next-best action recommendation with multi-party interaction learning for automated decision-making
LongBing Cao
Chengzhang Zhu
34
9
0
19 Aug 2021
End-to-End Urban Driving by Imitating a Reinforcement Learning Coach
Zhejun Zhang
Alexander Liniger
Dengxin Dai
Feng Yu
Luc Van Gool
116
211
0
18 Aug 2021
Structured Outdoor Architecture Reconstruction by Exploration and Classification
Fuyang Zhang
Xiang Xu
Nelson Nauata
Yasutaka Furukawa
3DV
45
12
0
18 Aug 2021
Using Cyber Terrain in Reinforcement Learning for Penetration Testing
Rohit Gangupantulu
Tyler Cody
Paul Park
Abdul Rahman
Logan Eisenbeiser
Dan Radke
Ryan Clark
56
38
0
16 Aug 2021
Optimal Actor-Critic Policy with Optimized Training Datasets
C. Banerjee
Zhiyong Chen
N. Noman
M. Zamani
OffRL
66
7
0
16 Aug 2021
DQN Control Solution for KDD Cup 2021 City Brain Challenge
Yitian Chen
Kunlong Chen
Kunjin Chen
Lin Wang
31
0
0
14 Aug 2021
TDM: Trustworthy Decision-Making via Interpretability Enhancement
Daoming Lyu
Fangkai Yang
Hugh Kwon
Wen Dong
L. Yilmaz
Bo Liu
23
12
0
13 Aug 2021
DRQN-based 3D Obstacle Avoidance with a Limited Field of View
Yuán Chen
Guangda Chen
Lifan Pan
Jun Ma
Yu Zhang
Yanyong Zhang
Jianmin Ji
37
7
0
12 Aug 2021
Reinforcement Learning Approach to Active Learning for Image Classification
Thorben Werner
18
1
0
12 Aug 2021
Graph Attention Network-based Multi-agent Reinforcement Learning for Slicing Resource Management in Dense Cellular Network
Yan Shao
Rongpeng Li
Bing Hu
Yingxiao Wu
Zhifeng Zhao
Honggang Zhang
62
47
0
11 Aug 2021
Previous
1
2
3
...
22
23
24
...
44
45
46
Next