ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1509.06461
  4. Cited By
Deep Reinforcement Learning with Double Q-learning
v1v2v3 (latest)

Deep Reinforcement Learning with Double Q-learning

22 September 2015
H. V. Hasselt
A. Guez
David Silver
    OffRL
ArXiv (abs)PDFHTML

Papers citing "Deep Reinforcement Learning with Double Q-learning"

50 / 2,291 papers shown
Title
Autonomous robotic nanofabrication with reinforcement learning
Autonomous robotic nanofabrication with reinforcement learning
Philipp Leinen
Malte Esders
Kristof T. Schütt
C. Wagner
K. Müller
F. Tautz
39
55
0
27 Feb 2020
Using a thousand optimization tasks to learn hyperparameter search
  strategies
Using a thousand optimization tasks to learn hyperparameter search strategies
Luke Metz
Niru Maheswaranathan
Ruoxi Sun
C. Freeman
Ben Poole
Jascha Narain Sohl-Dickstein
119
46
0
27 Feb 2020
Review, Analysis and Design of a Comprehensive Deep Reinforcement
  Learning Framework
Review, Analysis and Design of a Comprehensive Deep Reinforcement Learning Framework
Ngoc Duy Nguyen
Thanh Thi Nguyen
Hai V. Nguyen
Doug Creighton
S. Nahavandi
166
4
0
27 Feb 2020
A Visual Communication Map for Multi-Agent Deep Reinforcement Learning
A Visual Communication Map for Multi-Agent Deep Reinforcement Learning
Ngoc Duy Nguyen
Thanh Thi Nguyen
Doug Creighton
S. Nahavandi
44
5
0
27 Feb 2020
Learning Scalable Multi-Agent Coordination by Spatial Differentiation
  for Traffic Signal Control
Learning Scalable Multi-Agent Coordination by Spatial Differentiation for Traffic Signal Control
Junjia Liu
Huimin Zhang
Zhuang Fu
Yao Wang
35
2
0
27 Feb 2020
CybORG: An Autonomous Cyber Operations Research Gym
CybORG: An Autonomous Cyber Operations Research Gym
Callum Baillie
Maxwell Standen
Jonathon Schwartz
Michael Docking
David Bowman
Junae Kim
42
30
0
25 Feb 2020
A Double Q-Learning Approach for Navigation of Aerial Vehicles with
  Connectivity Constraint
A Double Q-Learning Approach for Navigation of Aerial Vehicles with Connectivity Constraint
Behzad Khamidehi
E. Sousa
28
16
0
24 Feb 2020
How Transferable are the Representations Learned by Deep Q Agents?
How Transferable are the Representations Learned by Deep Q Agents?
Jacob Tyo
Zachary Chase Lipton
OffRL
65
6
0
24 Feb 2020
Disentangling Controllable Object through Video Prediction Improves
  Visual Reinforcement Learning
Disentangling Controllable Object through Video Prediction Improves Visual Reinforcement Learning
Yuanyi Zhong
Alex Schwing
Jian Peng
DRL
121
5
0
21 Feb 2020
Learning Dynamic Belief Graphs to Generalize on Text-Based Games
Learning Dynamic Belief Graphs to Generalize on Text-Based Games
Ashutosh Adhikari
Xingdi Yuan
Marc-Alexandre Côté
M. Zelinka
Marc-Antoine Rondeau
Romain Laroche
Pascal Poupart
Jian Tang
Adam Trischler
William L. Hamilton
AI4CE
94
81
0
21 Feb 2020
Value-driven Hindsight Modelling
Value-driven Hindsight Modelling
A. Guez
Fabio Viola
T. Weber
Lars Buesing
Steven Kapturowski
Doina Precup
David Silver
N. Heess
OffRL
83
12
0
19 Feb 2020
Informative Path Planning for Mobile Sensing with Reinforcement Learning
Informative Path Planning for Mobile Sensing with Reinforcement Learning
Yongyong Wei
Rong Zheng
81
34
0
18 Feb 2020
Kalman meets Bellman: Improving Policy Evaluation through Value Tracking
Kalman meets Bellman: Improving Policy Evaluation through Value Tracking
Shirli Di-Castro Shashua
Shie Mannor
OffRL
64
12
0
17 Feb 2020
Reinforcement learning for the privacy preservation and manipulation of
  eye tracking data
Reinforcement learning for the privacy preservation and manipulation of eye tracking data
Wolfgang Fuhl
Efe Bozkir
Enkelejda Kasneci
55
1
0
17 Feb 2020
Reinforced active learning for image segmentation
Reinforced active learning for image segmentation
Arantxa Casanova
Pedro H. O. Pinheiro
Negar Rostamzadeh
C. Pal
85
109
0
16 Feb 2020
First Order Constrained Optimization in Policy Space
First Order Constrained Optimization in Policy Space
Yiming Zhang
Q. Vuong
George Andriopoulos
46
4
0
16 Feb 2020
Maxmin Q-learning: Controlling the Estimation Bias of Q-learning
Maxmin Q-learning: Controlling the Estimation Bias of Q-learning
Qingfeng Lan
Yangchen Pan
Alona Fyshe
Martha White
73
180
0
16 Feb 2020
Resource Management in Wireless Networks via Multi-Agent Deep
  Reinforcement Learning
Resource Management in Wireless Networks via Multi-Agent Deep Reinforcement Learning
Navid Naderializadeh
J. Sydir
M. Simsek
Hosein Nikopour
79
128
0
14 Feb 2020
Fast Reinforcement Learning for Anti-jamming Communications
Fast Reinforcement Learning for Anti-jamming Communications
P. Ye
Yuan-Gen Wang
Jin Li
Liang Xiao
62
5
0
13 Feb 2020
Regret Bounds for Discounted MDPs
Regret Bounds for Discounted MDPs
Shuang Liu
H. Su
OffRL
71
19
0
12 Feb 2020
Robot Navigation with Map-Based Deep Reinforcement Learning
Robot Navigation with Map-Based Deep Reinforcement Learning
Guangda Chen
Lifan Pan
Yuán Chen
Pei Xu
Zhiqiang Wang
Peichen Wu
Jianmin Ji
Xiaoping Chen
79
29
0
11 Feb 2020
AI Online Filters to Real World Image Recognition
AI Online Filters to Real World Image Recognition
Hai Xiao
Jin Shang
Mengyuan Huang
15
0
0
11 Feb 2020
Learning Structured Communication for Multi-agent Reinforcement Learning
Learning Structured Communication for Multi-agent Reinforcement Learning
Junjie Sheng
Xiangfeng Wang
Bo Jin
Junchi Yan
Wenhao Li
Tsung-Hui Chang
Jun Wang
H. Zha
44
52
0
11 Feb 2020
Discrete Action On-Policy Learning with Action-Value Critic
Discrete Action On-Policy Learning with Action-Value Critic
Yuguang Yue
Yunhao Tang
Mingzhang Yin
Mingyuan Yin
OffRL
78
5
0
10 Feb 2020
Autonomous quadrotor obstacle avoidance based on dueling double deep
  recurrent Q-learning with monocular vision
Autonomous quadrotor obstacle avoidance based on dueling double deep recurrent Q-learning with monocular vision
Jiajun Ou
Xiao Guo
Ming Zhu
Wenjie Lou
56
32
0
10 Feb 2020
Evolution of a Complex Predator-Prey Ecosystem on Large-scale
  Multi-Agent Deep Reinforcement Learning
Evolution of a Complex Predator-Prey Ecosystem on Large-scale Multi-Agent Deep Reinforcement Learning
Jun Yamada
John Shawe-Taylor
Zafeirios Fountas
30
9
0
09 Feb 2020
Multi-task Reinforcement Learning with a Planning Quasi-Metric
Multi-task Reinforcement Learning with a Planning Quasi-Metric
Vincent Micheli
Karthigan Sinnathamby
Franccois Fleuret
70
2
0
08 Feb 2020
Comprehensive and Efficient Data Labeling via Adaptive Model Scheduling
Comprehensive and Efficient Data Labeling via Adaptive Model Scheduling
Mu Yuan
Lan Zhang
Xiangyang Li
Hui Xiong
VLM
56
17
0
08 Feb 2020
Adaptive Approximate Policy Iteration
Adaptive Approximate Policy Iteration
Botao Hao
N. Lazić
Yasin Abbasi-Yadkori
Pooria Joulani
Csaba Szepesvári
92
14
0
08 Feb 2020
BRPO: Batch Residual Policy Optimization
BRPO: Batch Residual Policy Optimization
Kentaro Kanamori
Yinlam Chow
Takuya Takagi
Hiroki Arimura
Honglak Lee
Ken Kobayashi
Craig Boutilier
OffRL
236
45
0
08 Feb 2020
Dynamic Energy Dispatch Based on Deep Reinforcement Learning in
  IoT-Driven Smart Isolated Microgrids
Dynamic Energy Dispatch Based on Deep Reinforcement Learning in IoT-Driven Smart Isolated Microgrids
Lei Lei
Yue Tan
Glenn Dahlenburg
W. Xiang
K. Zheng
76
71
0
07 Feb 2020
Transfer Heterogeneous Knowledge Among Peer-to-Peer Teammates: A Model
  Distillation Approach
Transfer Heterogeneous Knowledge Among Peer-to-Peer Teammates: A Model Distillation Approach
Zeyue Xue
Shuang Luo
Chao-Xiang Wu
Pan Zhou
Kaigui Bian
Wei Du
46
4
0
06 Feb 2020
Does the Markov Decision Process Fit the Data: Testing for the Markov
  Property in Sequential Decision Making
Does the Markov Decision Process Fit the Data: Testing for the Markov Property in Sequential Decision Making
C. Shi
Runzhe Wan
R. Song
Wenbin Lu
Ling Leng
82
39
0
05 Feb 2020
Prophet: Proactive Candidate-Selection for Federated Learning by Predicting the Qualities of Training and Reporting Phases
Huawei Huang
Kangying Lin
Song Guo
Pan Zhou
Zibin Zheng
39
7
0
03 Feb 2020
Deep Reinforcement Learning for Autonomous Driving: A Survey
Deep Reinforcement Learning for Autonomous Driving: A Survey
B. R. Kiran
Ibrahim Sobh
V. Talpaert
Patrick Mannion
A. A. Sallab
S. Yogamani
P. Pérez
367
1,710
0
02 Feb 2020
Survey of Deep Reinforcement Learning for Motion Planning of Autonomous
  Vehicles
Survey of Deep Reinforcement Learning for Motion Planning of Autonomous Vehicles
S. Aradi
290
461
0
30 Jan 2020
Multiple Access in Dynamic Cell-Free Networks: Outage Performance and
  Deep Reinforcement Learning-Based Design
Multiple Access in Dynamic Cell-Free Networks: Outage Performance and Deep Reinforcement Learning-Based Design
Yasser F. Al-Eryani
Mohamed Akrout
Ekram Hossain
13
2
0
29 Jan 2020
Data-driven control of micro-climate in buildings: an event-triggered
  reinforcement learning approach
Data-driven control of micro-climate in buildings: an event-triggered reinforcement learning approach
A. H. Hosseinloo
Alexander Ryzhov
A. Bischi
H. Ouerdane
K. Turitsyn
M. Dahleh
AI4CE
30
42
0
28 Jan 2020
RIS Enhanced Massive Non-orthogonal Multiple Access Networks: Deployment
  and Passive Beamforming Design
RIS Enhanced Massive Non-orthogonal Multiple Access Networks: Deployment and Passive Beamforming Design
Xinyu Liu
Yuanwei Liu
Yue Chen
H. Vincent Poor
47
158
0
28 Jan 2020
Challenges and Countermeasures for Adversarial Attacks on Deep
  Reinforcement Learning
Challenges and Countermeasures for Adversarial Attacks on Deep Reinforcement Learning
Inaam Ilahi
Muhammad Usama
Junaid Qadir
M. Janjua
Ala I. Al-Fuqaha
D. Hoang
Dusit Niyato
AAML
147
137
0
27 Jan 2020
Developing Multi-Task Recommendations with Long-Term Rewards via Policy
  Distilled Reinforcement Learning
Developing Multi-Task Recommendations with Long-Term Rewards via Policy Distilled Reinforcement Learning
Xi Liu
Li Li
Ping-Chun Hsieh
Muhe Xie
Yong Ge
Rui Chen
OffRL
49
3
0
27 Jan 2020
Stacked Auto Encoder Based Deep Reinforcement Learning for Online
  Resource Scheduling in Large-Scale MEC Networks
Stacked Auto Encoder Based Deep Reinforcement Learning for Online Resource Scheduling in Large-Scale MEC Networks
Feibo Jiang
Kezhi Wang
Li Dong
Cunhua Pan
Kun Yang
OffRL
60
39
0
24 Jan 2020
Interpretable End-to-end Urban Autonomous Driving with Latent Deep
  Reinforcement Learning
Interpretable End-to-end Urban Autonomous Driving with Latent Deep Reinforcement Learning
Jianyu Chen
Shengbo Eben Li
Masayoshi Tomizuka
148
244
0
23 Jan 2020
Contract-connection:An efficient communication protocol for Distributed
  Ledger Technology
Contract-connection:An efficient communication protocol for Distributed Ledger Technology
Yibin Xu
Yangyu Huang
10
2
0
20 Jan 2020
FRESH: Interactive Reward Shaping in High-Dimensional State Spaces using
  Human Feedback
FRESH: Interactive Reward Shaping in High-Dimensional State Spaces using Human Feedback
Baicen Xiao
Qifan Lu
Bhaskar Ramasubramanian
Andrew Clark
L. Bushnell
Radha Poovendran
73
25
0
19 Jan 2020
PoPS: Policy Pruning and Shrinking for Deep Reinforcement Learning
PoPS: Policy Pruning and Shrinking for Deep Reinforcement Learning
Dor Livne
Kobi Cohen
61
52
0
14 Jan 2020
Multi-Robot Formation Control Using Reinforcement Learning
Multi-Robot Formation Control Using Reinforcement Learning
Abhay Rawat
K. Karlapalem
36
4
0
13 Jan 2020
Distributional Soft Actor-Critic: Off-Policy Reinforcement Learning for
  Addressing Value Estimation Errors
Distributional Soft Actor-Critic: Off-Policy Reinforcement Learning for Addressing Value Estimation Errors
Jingliang Duan
Yang Guan
Shengbo Eben Li
Yangang Ren
B. Cheng
OffRL
90
185
0
09 Jan 2020
EEG-based Drowsiness Estimation for Driving Safety using Deep Q-Learning
EEG-based Drowsiness Estimation for Driving Safety using Deep Q-Learning
Yurui Ming
Dongrui Wu
Yu-Kai Wang
Yuhui Shi
Chin-Teng Lin
31
18
0
08 Jan 2020
Perception and Navigation in Autonomous Systems in the Era of Learning:
  A Survey
Perception and Navigation in Autonomous Systems in the Era of Learning: A Survey
Yang Tang
Chaoqiang Zhao
Jianrui Wang
Chongzhen Zhang
Qiyu Sun
Weixing Zheng
W. Du
Feng Qian
Jürgen Kurths
157
76
0
08 Jan 2020
Previous
123...333435...444546
Next