ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1509.06461
  4. Cited By
Deep Reinforcement Learning with Double Q-learning
v1v2v3 (latest)

Deep Reinforcement Learning with Double Q-learning

22 September 2015
H. V. Hasselt
A. Guez
David Silver
    OffRL
ArXiv (abs)PDFHTML

Papers citing "Deep Reinforcement Learning with Double Q-learning"

50 / 2,291 papers shown
Title
Coordinated Reinforcement Learning for Optimizing Mobile Networks
Coordinated Reinforcement Learning for Optimizing Mobile Networks
Maxime Bouton
Hasan Farooq
Julien Forgeat
Shruti Bothe
Meral Shirazipour
P. Karlsson
63
12
0
30 Sep 2021
Dr Jekyll and Mr Hyde: the Strange Case of Off-Policy Policy Updates
Dr Jekyll and Mr Hyde: the Strange Case of Off-Policy Policy Updates
Romain Laroche
Rémi Tachet des Combes
91
8
0
29 Sep 2021
On the Estimation Bias in Double Q-Learning
On the Estimation Bias in Double Q-Learning
Zhizhou Ren
Guangxiang Zhu
Haotian Hu
Beining Han
Jian-Hai Chen
Chongjie Zhang
78
17
0
29 Sep 2021
Deep Reinforcement Q-Learning for Intelligent Traffic Signal Control
  with Partial Detection
Deep Reinforcement Q-Learning for Intelligent Traffic Signal Control with Partial Detection
Romain Ducrocq
N. Farhi
55
15
0
29 Sep 2021
Exploratory State Representation Learning
Exploratory State Representation Learning
Astrid Merckling
Nicolas Perrin-Gilbert
Alexandre Coninx
Stéphane Doncieux
OffRL
77
6
0
28 Sep 2021
MiniHack the Planet: A Sandbox for Open-Ended Reinforcement Learning
  Research
MiniHack the Planet: A Sandbox for Open-Ended Reinforcement Learning Research
Mikayel Samvelyan
Robert Kirk
Vitaly Kurin
Jack Parker-Holder
Minqi Jiang
Eric Hambro
Fabio Petroni
Heinrich Küttler
Edward Grefenstette
Tim Rocktaschel
OffRL
319
91
0
27 Sep 2021
Stackelberg Actor-Critic: Game-Theoretic Reinforcement Learning
  Algorithms
Stackelberg Actor-Critic: Game-Theoretic Reinforcement Learning Algorithms
Liyuan Zheng
Tanner Fiez
Zane Alumbaugh
Benjamin J. Chasnov
Lillian J. Ratliff
OffRL
99
42
0
25 Sep 2021
The $f$-Divergence Reinforcement Learning Framework
The fff-Divergence Reinforcement Learning Framework
Chen Gong
Qiang He
Yunpeng Bai
Zhouyi Yang
Xiaoyu Chen
Xinwen Hou
Xianjie Zhang
Yu Liu
Guoliang Fan
68
3
0
24 Sep 2021
Parameter-free Reduction of the Estimation Bias in Deep Reinforcement
  Learning for Deterministic Policy Gradients
Parameter-free Reduction of the Estimation Bias in Deep Reinforcement Learning for Deterministic Policy Gradients
Baturay Saglam
Furkan B. Mutlu
Dogan C. Cicek
Suleyman S. Kozat
OffRL
53
3
0
24 Sep 2021
Improved Soft Actor-Critic: Mixing Prioritized Off-Policy Samples with
  On-Policy Experience
Improved Soft Actor-Critic: Mixing Prioritized Off-Policy Samples with On-Policy Experience
C. Banerjee
Zhiyong Chen
N. Noman
51
33
0
24 Sep 2021
ADVERSARIALuscator: An Adversarial-DRL Based Obfuscator and Metamorphic
  Malware SwarmGenerator
ADVERSARIALuscator: An Adversarial-DRL Based Obfuscator and Metamorphic Malware SwarmGenerator
Mohit Sewak
S. K. Sahay
Hemant Rathore
AAML
51
8
0
23 Sep 2021
Estimation Error Correction in Deep Reinforcement Learning for
  Deterministic Actor-Critic Methods
Estimation Error Correction in Deep Reinforcement Learning for Deterministic Actor-Critic Methods
Baturay Saglam
Enes Duran
Dogan C. Cicek
Furkan B. Mutlu
Suleyman S. Kozat
OffRL
73
12
0
22 Sep 2021
MEPG: A Minimalist Ensemble Policy Gradient Framework for Deep
  Reinforcement Learning
MEPG: A Minimalist Ensemble Policy Gradient Framework for Deep Reinforcement Learning
Qiang He
Yuxun Qu
Chen Gong
Xinwen Hou
OffRL
85
10
0
22 Sep 2021
Benchmarking Lane-changing Decision-making for Deep Reinforcement
  Learning
Benchmarking Lane-changing Decision-making for Deep Reinforcement Learning
Junjie Wang
Qichao Zhang
Dongbin Zhao
OffRL
26
1
0
22 Sep 2021
Off-line approximate dynamic programming for the vehicle routing problem
  with a highly variable customer basis and stochastic demands
Off-line approximate dynamic programming for the vehicle routing problem with a highly variable customer basis and stochastic demands
M. Dastpak
Fausto Errico
O. Jabali
57
6
0
21 Sep 2021
Generalization in Text-based Games via Hierarchical Reinforcement
  Learning
Generalization in Text-based Games via Hierarchical Reinforcement Learning
Yunqiu Xu
Meng Fang
Ling Chen
Yali Du
Chengqi Zhang
AI4CE
85
21
0
21 Sep 2021
A Survey of Text Games for Reinforcement Learning informed by Natural
  Language
A Survey of Text Games for Reinforcement Learning informed by Natural Language
P. Osborne
Heido Nomm
André Freitas
AI4CE
101
24
0
20 Sep 2021
Dual Behavior Regularized Reinforcement Learning
Dual Behavior Regularized Reinforcement Learning
Chapman Siu
Jason M. Traish
R. Xu
OffRL
61
1
0
19 Sep 2021
RAPID-RL: A Reconfigurable Architecture with Preemptive-Exits for
  Efficient Deep-Reinforcement Learning
RAPID-RL: A Reconfigurable Architecture with Preemptive-Exits for Efficient Deep-Reinforcement Learning
Adarsh Kosta
Malik Aqeel Anwar
Priyadarshini Panda
A. Raychowdhury
Kaushik Roy
30
4
0
16 Sep 2021
DCUR: Data Curriculum for Teaching via Samples with Reinforcement
  Learning
DCUR: Data Curriculum for Teaching via Samples with Reinforcement Learning
Daniel Seita
Abhinav Gopal
Zhao Mandi
John F. Canny
OffRLOnRL
44
0
0
15 Sep 2021
Exploration in Deep Reinforcement Learning: From Single-Agent to
  Multiagent Domain
Exploration in Deep Reinforcement Learning: From Single-Agent to Multiagent Domain
Jianye Hao
Tianpei Yang
Hongyao Tang
Chenjia Bai
Jinyi Liu
Zhaopeng Meng
Peng Liu
Zhen Wang
OffRL
86
102
0
14 Sep 2021
An Empirical Comparison of Off-policy Prediction Learning Algorithms in
  the Four Rooms Environment
An Empirical Comparison of Off-policy Prediction Learning Algorithms in the Four Rooms Environment
Sina Ghiassian
R. Sutton
AAMLOffRL
87
6
0
10 Sep 2021
Boosting Graph Search with Attention Network for Solving the General
  Orienteering Problem
Boosting Graph Search with Attention Network for Solving the General Orienteering Problem
Zongtao Liu
Jing Xu
Jintao Su
Tao Xiao
Yang Yang
35
1
0
10 Sep 2021
PowerGym: A Reinforcement Learning Environment for Volt-Var Control in
  Power Distribution Systems
PowerGym: A Reinforcement Learning Environment for Volt-Var Control in Power Distribution Systems
Ting-Han Fan
Xian Yeow Lee
Yubo Wang
178
24
0
08 Sep 2021
A Survey of Deep Reinforcement Learning in Recommender Systems: A
  Systematic Review and Future Directions
A Survey of Deep Reinforcement Learning in Recommender Systems: A Systematic Review and Future Directions
Xiaocong Chen
L. Yao
Julian McAuley
Guanglin Zhou
Xianzhi Wang
AI4TS
79
62
0
08 Sep 2021
ADER:Adapting between Exploration and Robustness for Actor-Critic
  Methods
ADER:Adapting between Exploration and Robustness for Actor-Critic Methods
Bo Zhou
Kejiao Li
Hongsheng Zeng
Fan Wang
Hao Tian
OffRL
69
1
0
08 Sep 2021
Temporal Shift Reinforcement Learning
Temporal Shift Reinforcement Learning
Deep Thomas
Tichakorn Wongpiromsarn
Ali Jannesari
OffRL
33
0
0
05 Sep 2021
An Exploration of Deep Learning Methods in Hungry Geese
An Exploration of Deep Learning Methods in Hungry Geese
Nikzad Khani
Matthew Kluska
27
0
0
05 Sep 2021
Event-Based Communication in Distributed Q-Learning
Event-Based Communication in Distributed Q-Learning
Daniel Jarne Ornia
M. Mazo
65
2
0
03 Sep 2021
Catastrophic Interference in Reinforcement Learning: A Solution Based on
  Context Division and Knowledge Distillation
Catastrophic Interference in Reinforcement Learning: A Solution Based on Context Division and Knowledge Distillation
Tiantian Zhang
Xueqian Wang
Bin Liang
Bo Yuan
OffRL
80
18
0
01 Sep 2021
A Survey of Exploration Methods in Reinforcement Learning
A Survey of Exploration Methods in Reinforcement Learning
Susan Amin
Maziar Gomrokchi
Harsh Satija
H. V. Hoof
Doina Precup
OffRL
100
84
0
01 Sep 2021
Phy-Q as a measure for physical reasoning intelligence
Phy-Q as a measure for physical reasoning intelligence
Cheng Xue
Vimukthini Pinto
C. Gamage
Ekaterina Nikonova
Peng Zhang
Jochen Renz
LRM
68
12
0
31 Aug 2021
A Policy Efficient Reduction Approach to Convex Constrained Deep
  Reinforcement Learning
A Policy Efficient Reduction Approach to Convex Constrained Deep Reinforcement Learning
Tianchi Cai
Wenpeng Zhang
Lihong Gu
Xiaodong Zeng
Jinjie Gu
21
0
0
29 Aug 2021
Autonomous Curiosity for Real-Time Training Onboard Robotic Agents
Autonomous Curiosity for Real-Time Training Onboard Robotic Agents
Ervin Teng
Bob Iannucci
50
6
0
29 Aug 2021
Flying Through a Narrow Gap Using End-to-end Deep Reinforcement Learning
  Augmented with Curriculum Learning and Sim2Real
Flying Through a Narrow Gap Using End-to-end Deep Reinforcement Learning Augmented with Curriculum Learning and Sim2Real
Chenxi Xiao
Peng Lu
Qizhi He
33
35
0
29 Aug 2021
Reinforcement Learning based Condition-oriented Maintenance Scheduling
  for Flow Line Systems
Reinforcement Learning based Condition-oriented Maintenance Scheduling for Flow Line Systems
Raphael Lamprecht
Ferdinand Wurst
Marco F. Huber
40
3
0
27 Aug 2021
WAD: A Deep Reinforcement Learning Agent for Urban Autonomous Driving
WAD: A Deep Reinforcement Learning Agent for Urban Autonomous Driving
Arjit Sharma
Sahil Sharma
25
3
0
27 Aug 2021
Federated Reinforcement Learning: Techniques, Applications, and Open
  Challenges
Federated Reinforcement Learning: Techniques, Applications, and Open Challenges
Jiaju Qi
Qihao Zhou
Lei Lei
Kan Zheng
FedML
111
159
0
26 Aug 2021
Adversary agent reinforcement learning for pursuit-evasion
Adversary agent reinforcement learning for pursuit-evasion
X. Huang
15
2
0
25 Aug 2021
No DBA? No regret! Multi-armed bandits for index tuning of analytical
  and HTAP workloads with provable guarantees
No DBA? No regret! Multi-armed bandits for index tuning of analytical and HTAP workloads with provable guarantees
R. Perera
Bastian Oetomo
Benjamin I. P. Rubinstein
Renata Borovica-Gajic
48
8
0
23 Aug 2021
Personalized next-best action recommendation with multi-party
  interaction learning for automated decision-making
Personalized next-best action recommendation with multi-party interaction learning for automated decision-making
LongBing Cao
Chengzhang Zhu
34
9
0
19 Aug 2021
End-to-End Urban Driving by Imitating a Reinforcement Learning Coach
End-to-End Urban Driving by Imitating a Reinforcement Learning Coach
Zhejun Zhang
Alexander Liniger
Dengxin Dai
Feng Yu
Luc Van Gool
116
211
0
18 Aug 2021
Structured Outdoor Architecture Reconstruction by Exploration and
  Classification
Structured Outdoor Architecture Reconstruction by Exploration and Classification
Fuyang Zhang
Xiang Xu
Nelson Nauata
Yasutaka Furukawa
3DV
45
12
0
18 Aug 2021
Using Cyber Terrain in Reinforcement Learning for Penetration Testing
Using Cyber Terrain in Reinforcement Learning for Penetration Testing
Rohit Gangupantulu
Tyler Cody
Paul Park
Abdul Rahman
Logan Eisenbeiser
Dan Radke
Ryan Clark
56
38
0
16 Aug 2021
Optimal Actor-Critic Policy with Optimized Training Datasets
Optimal Actor-Critic Policy with Optimized Training Datasets
C. Banerjee
Zhiyong Chen
N. Noman
M. Zamani
OffRL
66
7
0
16 Aug 2021
DQN Control Solution for KDD Cup 2021 City Brain Challenge
DQN Control Solution for KDD Cup 2021 City Brain Challenge
Yitian Chen
Kunlong Chen
Kunjin Chen
Lin Wang
31
0
0
14 Aug 2021
TDM: Trustworthy Decision-Making via Interpretability Enhancement
TDM: Trustworthy Decision-Making via Interpretability Enhancement
Daoming Lyu
Fangkai Yang
Hugh Kwon
Wen Dong
L. Yilmaz
Bo Liu
23
12
0
13 Aug 2021
DRQN-based 3D Obstacle Avoidance with a Limited Field of View
DRQN-based 3D Obstacle Avoidance with a Limited Field of View
Yuán Chen
Guangda Chen
Lifan Pan
Jun Ma
Yu Zhang
Yanyong Zhang
Jianmin Ji
37
7
0
12 Aug 2021
Reinforcement Learning Approach to Active Learning for Image
  Classification
Reinforcement Learning Approach to Active Learning for Image Classification
Thorben Werner
18
1
0
12 Aug 2021
Graph Attention Network-based Multi-agent Reinforcement Learning for
  Slicing Resource Management in Dense Cellular Network
Graph Attention Network-based Multi-agent Reinforcement Learning for Slicing Resource Management in Dense Cellular Network
Yan Shao
Rongpeng Li
Bing Hu
Yingxiao Wu
Zhifeng Zhao
Honggang Zhang
62
47
0
11 Aug 2021
Previous
123...222324...444546
Next