ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1509.06461
  4. Cited By
Deep Reinforcement Learning with Double Q-learning

Deep Reinforcement Learning with Double Q-learning

22 September 2015
H. V. Hasselt
A. Guez
David Silver
    OffRL
ArXivPDFHTML

Papers citing "Deep Reinforcement Learning with Double Q-learning"

50 / 947 papers shown
Title
M$^2$DQN: A Robust Method for Accelerating Deep Q-learning Network
M2^22DQN: A Robust Method for Accelerating Deep Q-learning Network
Zhe Zhang
Yukun Zou
Junjie Lai
Qinglong Xu
18
4
0
16 Sep 2022
Reducing Variance in Temporal-Difference Value Estimation via Ensemble
  of Deep Networks
Reducing Variance in Temporal-Difference Value Estimation via Ensemble of Deep Networks
Litian Liang
Yaosheng Xu
Stephen Marcus McAleer
Dailin Hu
Alexander Ihler
Pieter Abbeel
Roy Fox
OOD
32
17
0
16 Sep 2022
On the Reuse Bias in Off-Policy Reinforcement Learning
On the Reuse Bias in Off-Policy Reinforcement Learning
Chengyang Ying
Zhongkai Hao
Xinning Zhou
Hang Su
Dong Yan
Jun Zhu
OffRL
45
3
0
15 Sep 2022
MIXRTs: Toward Interpretable Multi-Agent Reinforcement Learning via Mixing Recurrent Soft Decision Trees
MIXRTs: Toward Interpretable Multi-Agent Reinforcement Learning via Mixing Recurrent Soft Decision Trees
Zichuan Liu
Zichuan Liu
Zhi Wang
Yuanyang Zhu
Chunlin Chen
66
5
0
15 Sep 2022
Ask Before You Act: Generalising to Novel Environments by Asking
  Questions
Ask Before You Act: Generalising to Novel Environments by Asking Questions
Ross Murphy
S. Mosesov
Javier Leguina Peral
Thymo ter Doest
LRM
32
0
0
10 Sep 2022
Task-Agnostic Learning to Accomplish New Tasks
Task-Agnostic Learning to Accomplish New Tasks
Xianqi Zhang
Xingtao Wang
Xu Liu
Wenrui Wang
Xiaopeng Fan
Debin Zhao
OffRL
91
0
0
09 Sep 2022
Reward Delay Attacks on Deep Reinforcement Learning
Reward Delay Attacks on Deep Reinforcement Learning
Anindya Sarkar
Jiarui Feng
Yevgeniy Vorobeychik
Christopher Gill
Ning Zhang
AAML
13
6
0
08 Sep 2022
Prediction Based Decision Making for Autonomous Highway Driving
Prediction Based Decision Making for Autonomous Highway Driving
Mustafa Yildirim
Sajjad Mozaffari
Lucy McCutcheon
M. Dianati
Alireza Tamaddoni-Nezhad Saber Fallah
18
7
0
05 Sep 2022
Model-Free Deep Reinforcement Learning in Software-Defined Networks
Model-Free Deep Reinforcement Learning in Software-Defined Networks
Luke Borchjes
Clement N. Nyirenda
L. Leenen
29
1
0
03 Sep 2022
Reinforcement Learning with Prior Policy Guidance for Motion Planning of
  Dual-Arm Free-Floating Space Robot
Reinforcement Learning with Prior Policy Guidance for Motion Planning of Dual-Arm Free-Floating Space Robot
Yu-wen Cao
Shengjie Wang
Xiang Zheng
Wen-Xuan Ma
Xinru Xie
Lei Liu
13
25
0
03 Sep 2022
Distributed Ensembles of Reinforcement Learning Agents for Electricity
  Control
Distributed Ensembles of Reinforcement Learning Agents for Electricity Control
Pierrick Pochelu
S. Petiton
B. Conche
AI4CE
33
2
0
30 Aug 2022
An intelligent algorithmic trading based on a risk-return reinforcement
  learning algorithm
An intelligent algorithmic trading based on a risk-return reinforcement learning algorithm
Boyin Jin
24
1
0
23 Aug 2022
Quantum Multi-Agent Meta Reinforcement Learning
Quantum Multi-Agent Meta Reinforcement Learning
Won Joon Yun
Jihong Park
Joongheon Kim
32
37
0
22 Aug 2022
A Review of the Convergence of 5G/6G Architecture and Deep Learning
A Review of the Convergence of 5G/6G Architecture and Deep Learning
O. Odeyomi
Olubiyi O. Akintade
T. Olowu
G. Záruba
AILaw
3DV
AI4TS
28
1
0
16 Aug 2022
Implicit Two-Tower Policies
Implicit Two-Tower Policies
Yunfan Zhao
Qingkai Pan
K. Choromanski
Deepali Jain
Vikas Sindhwani
OffRL
36
3
0
02 Aug 2022
DRL-M4MR: An Intelligent Multicast Routing Approach Based on DQN Deep
  Reinforcement Learning in SDN
DRL-M4MR: An Intelligent Multicast Routing Approach Based on DQN Deep Reinforcement Learning in SDN
Chenwei Zhao
Miao Ye
Xingsi Xue
Jianhui Lv
Qiuxiang Jiang
Yong Wang
27
17
0
31 Jul 2022
Future-Dependent Value-Based Off-Policy Evaluation in POMDPs
Future-Dependent Value-Based Off-Policy Evaluation in POMDPs
Masatoshi Uehara
Haruka Kiyohara
Andrew Bennett
Victor Chernozhukov
Nan Jiang
Nathan Kallus
C. Shi
Wen Sun
OffRL
34
16
0
26 Jul 2022
Adaptive Decision Making at the Intersection for Autonomous Vehicles
  Based on Skill Discovery
Adaptive Decision Making at the Intersection for Autonomous Vehicles Based on Skill Discovery
Xianqi He
Ling Yang
Chao Lu
Zirui Li
Jian-wei Gong
29
1
0
24 Jul 2022
Graph-Structured Policy Learning for Multi-Goal Manipulation Tasks
Graph-Structured Policy Learning for Multi-Goal Manipulation Tasks
David Klee
Ondrej Biza
Robert Platt
OffRL
32
1
0
22 Jul 2022
Romanus: Robust Task Offloading in Modular Multi-Sensor Autonomous
  Driving Systems
Romanus: Robust Task Offloading in Modular Multi-Sensor Autonomous Driving Systems
Luke Chen
Mohanad Odema
Mohammad Abdullah Al Faruque
32
4
0
18 Jul 2022
Robust AI Driving Strategy for Autonomous Vehicles
Robust AI Driving Strategy for Autonomous Vehicles
S. Nageshrao
Yousaf Rahman
V. Ivanovic
M. Janković
E. Tseng
M. Hafner
Dimitar Filev
44
4
0
16 Jul 2022
Self-Play PSRO: Toward Optimal Populations in Two-Player Zero-Sum Games
Self-Play PSRO: Toward Optimal Populations in Two-Player Zero-Sum Games
Stephen Marcus McAleer
JB Lanier
Kevin A. Wang
Pierre Baldi
Roy Fox
Tuomas Sandholm
40
18
0
13 Jul 2022
Hindsight Learning for MDPs with Exogenous Inputs
Hindsight Learning for MDPs with Exogenous Inputs
Sean R. Sinclair
Felipe Vieira Frujeri
Ching-An Cheng
Luke Marshall
Hugo Barbalho
Jingling Li
Jennifer Neville
Ishai Menache
Adith Swaminathan
18
23
0
13 Jul 2022
Multi-objective Optimization of Notifications Using Offline
  Reinforcement Learning
Multi-objective Optimization of Notifications Using Offline Reinforcement Learning
Prakruthi Prabhakar
Yiping Yuan
Guangyu Yang
Wensheng Sun
A. Muralidharan
OffRL
28
6
0
07 Jul 2022
Offline RL Policies Should be Trained to be Adaptive
Offline RL Policies Should be Trained to be Adaptive
Dibya Ghosh
Anurag Ajay
Pulkit Agrawal
Sergey Levine
OffRL
35
45
0
05 Jul 2022
Robust Reinforcement Learning in Continuous Control Tasks with
  Uncertainty Set Regularization
Robust Reinforcement Learning in Continuous Control Tasks with Uncertainty Set Regularization
Yuan Zhang
Jianhong Wang
Joschka Boedecker
43
3
0
05 Jul 2022
Asynchronous Curriculum Experience Replay: A Deep Reinforcement Learning
  Approach for UAV Autonomous Motion Control in Unknown Dynamic Environments
Asynchronous Curriculum Experience Replay: A Deep Reinforcement Learning Approach for UAV Autonomous Motion Control in Unknown Dynamic Environments
Zijian Hu
Xiao-guang Gao
Kaifang Wan
Qianglong Wang
Yiwei Zhai
45
10
0
04 Jul 2022
A Survey on Active Simultaneous Localization and Mapping: State of the
  Art and New Frontiers
A Survey on Active Simultaneous Localization and Mapping: State of the Art and New Frontiers
Julio A. Placed
Jared Strader
Henry Carrillo
Nikolay Atanasov
Vadim Indelman
Luca Carlone
J. A. Castellanos
35
178
0
01 Jul 2022
Conditionally Elicitable Dynamic Risk Measures for Deep Reinforcement
  Learning
Conditionally Elicitable Dynamic Risk Measures for Deep Reinforcement Learning
Anthony Coache
S. Jaimungal
Á. Cartea
30
13
0
29 Jun 2022
Imitate then Transcend: Multi-Agent Optimal Execution with Dual-Window
  Denoise PPO
Imitate then Transcend: Multi-Agent Optimal Execution with Dual-Window Denoise PPO
Jin Fang
Jiacheng Weng
Yi Xiang
Xinwen Zhang
OffRL
29
2
0
21 Jun 2022
Sampling Efficient Deep Reinforcement Learning through Preference-Guided
  Stochastic Exploration
Sampling Efficient Deep Reinforcement Learning through Preference-Guided Stochastic Exploration
Wenhui Huang
Cong Zhang
Jingda Wu
Xiangkun He
Jie Zhang
Chengqi Lv
32
8
0
20 Jun 2022
Cooperative Edge Caching via Multi Agent Reinforcement Learning in Fog
  Radio Access Networks
Cooperative Edge Caching via Multi Agent Reinforcement Learning in Fog Radio Access Networks
Qi Chang
Yanxiang Jiang
F. Zheng
M. Bennis
X. You
11
7
0
20 Jun 2022
A Universal Adversarial Policy for Text Classifiers
A Universal Adversarial Policy for Text Classifiers
Gallil Maimon
Lior Rokach
AAML
19
10
0
19 Jun 2022
BYOL-Explore: Exploration by Bootstrapped Prediction
BYOL-Explore: Exploration by Bootstrapped Prediction
Z. Guo
S. Thakoor
Miruna Pislar
Bernardo Avila-Pires
Florent Altché
...
Yunhao Tang
Michal Valko
Rémi Munos
M. G. Azar
Bilal Piot
27
69
0
16 Jun 2022
A Search-Based Testing Approach for Deep Reinforcement Learning Agents
A Search-Based Testing Approach for Deep Reinforcement Learning Agents
Amirhossein Zolfagharian
Manel Abdellatif
Lionel C. Briand
M. Bagherzadeh
Ramesh S
55
27
0
15 Jun 2022
Does Self-supervised Learning Really Improve Reinforcement Learning from
  Pixels?
Does Self-supervised Learning Really Improve Reinforcement Learning from Pixels?
Xiang Li
Jinghuan Shang
Srijan Das
Michael S. Ryoo
SSL
40
32
0
10 Jun 2022
AAM-Gym: Artificial Intelligence Testbed for Advanced Air Mobility
AAM-Gym: Artificial Intelligence Testbed for Advanced Air Mobility
Marc Brittain
Luis E. Alvarez
Kara Breeden
Ian Jessen
28
8
0
09 Jun 2022
Goal-Space Planning with Subgoal Models
Goal-Space Planning with Subgoal Models
Chun-Ping Lo
Kevin Roice
Parham Mohammad Panahi
Scott M. Jordan
Adam White
Gábor Mihucz
Farzane Aminmansour
Martha White
29
5
0
06 Jun 2022
Reincarnating Reinforcement Learning: Reusing Prior Computation to
  Accelerate Progress
Reincarnating Reinforcement Learning: Reusing Prior Computation to Accelerate Progress
Rishabh Agarwal
Max Schwarzer
Pablo Samuel Castro
Rameswar Panda
Marc G. Bellemare
OffRL
OnRL
37
63
0
03 Jun 2022
Deep Transformer Q-Networks for Partially Observable Reinforcement
  Learning
Deep Transformer Q-Networks for Partially Observable Reinforcement Learning
Kevin Esslinger
Robert Platt
Chris Amato
OffRL
35
36
0
02 Jun 2022
Watch Out for the Safety-Threatening Actors: Proactively Mitigating
  Safety Hazards
Watch Out for the Safety-Threatening Actors: Proactively Mitigating Safety Hazards
Saurabh Jha
Shengkun Cui
Zbigniew T. Kalbarczyk
Ravishankar Iyer
LLMSV
24
1
0
02 Jun 2022
Efficient Reward Poisoning Attacks on Online Deep Reinforcement Learning
Efficient Reward Poisoning Attacks on Online Deep Reinforcement Learning
Yinglun Xu
Qi Zeng
Gagandeep Singh
AAML
42
6
0
30 May 2022
Group-wise Reinforcement Feature Generation for Optimal and Explainable
  Representation Space Reconstruction
Group-wise Reinforcement Feature Generation for Optimal and Explainable Representation Space Reconstruction
Dongjie Wang
Yanjie Fu
Kunpeng Liu
Xiaolin Li
Yan Solihin
37
30
0
28 May 2022
Reinforcement Learning for Branch-and-Bound Optimisation using
  Retrospective Trajectories
Reinforcement Learning for Branch-and-Bound Optimisation using Retrospective Trajectories
Christopher W. F. Parsonson
Alexandre Laterre
Thomas D. Barrett
34
19
0
28 May 2022
Automated Dynamic Algorithm Configuration
Automated Dynamic Algorithm Configuration
Steven Adriaensen
André Biedenkapp
Gresa Shala
Noor H. Awad
Theresa Eimer
Marius Lindauer
Frank Hutter
44
37
0
27 May 2022
Verifying Learning-Based Robotic Navigation Systems
Verifying Learning-Based Robotic Navigation Systems
Guy Amir
Davide Corsi
Raz Yerushalmi
Luca Marzari
D. Harel
Alessandro Farinelli
Guy Katz
94
37
0
26 May 2022
An Experimental Comparison Between Temporal Difference and Residual
  Gradient with Neural Network Approximation
An Experimental Comparison Between Temporal Difference and Residual Gradient with Neural Network Approximation
Shuyu Yin
Yaoyu Zhang
Peilin Liu
Z. Xu
31
2
0
25 May 2022
Deep Reinforcement Learning for Multi-class Imbalanced Training
Deep Reinforcement Learning for Multi-class Imbalanced Training
Jenny Yang
Rasheed el-Bouri
Odhran O'Donoghue
Alexander S. Lachapelle
A. Soltan
David Clifton
OffRL
AI4CE
24
10
0
24 May 2022
MetaSlicing: A Novel Resource Allocation Framework for Metaverse
MetaSlicing: A Novel Resource Allocation Framework for Metaverse
N. Chu
D. Hoang
Diep N. Nguyen
Khoa T. Phan
E. Dutkiewicz
Dusist Niyato
Tao Shu
55
46
0
23 May 2022
Memory-efficient Reinforcement Learning with Value-based Knowledge
  Consolidation
Memory-efficient Reinforcement Learning with Value-based Knowledge Consolidation
Qingfeng Lan
Yangchen Pan
Jun Luo
A. R. Mahmood
OffRL
39
8
0
22 May 2022
Previous
123...678...171819
Next