ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1509.06461
  4. Cited By
Deep Reinforcement Learning with Double Q-learning
v1v2v3 (latest)

Deep Reinforcement Learning with Double Q-learning

22 September 2015
H. V. Hasselt
A. Guez
David Silver
    OffRL
ArXiv (abs)PDFHTML

Papers citing "Deep Reinforcement Learning with Double Q-learning"

50 / 2,291 papers shown
Title
Proximal Policy Optimization via Enhanced Exploration Efficiency
Proximal Policy Optimization via Enhanced Exploration Efficiency
Junwei Zhang
Zhenghao Zhang
Shuai Han
Shuai Lu
131
44
0
11 Nov 2020
Continual Learning of Control Primitives: Skill Discovery via
  Reset-Games
Continual Learning of Control Primitives: Skill Discovery via Reset-Games
Kelvin Xu
Siddharth Verma
Chelsea Finn
Sergey Levine
CLL
83
34
0
10 Nov 2020
Trajectory Planning for Autonomous Vehicles Using Hierarchical
  Reinforcement Learning
Trajectory Planning for Autonomous Vehicles Using Hierarchical Reinforcement Learning
Kaleb Ben Naveed
Zhiqian Qiao
John M. Dolan
45
58
0
09 Nov 2020
Behavior Planning at Urban Intersections through Hierarchical
  Reinforcement Learning
Behavior Planning at Urban Intersections through Hierarchical Reinforcement Learning
Zhiqian Qiao
J. Schneider
John M. Dolan
40
23
0
09 Nov 2020
Deep reinforcement learning for RAN optimization and control
Deep reinforcement learning for RAN optimization and control
Yu Chen
Jie Chen
G. Krishnamurthi
Huijing Yang
Huahui Wang
Wenjie Zhao
39
1
0
09 Nov 2020
Multi-Agent Reinforcement Learning for Channel Assignment and Power
  Allocation in Platoon-Based C-V2X Systems
Multi-Agent Reinforcement Learning for Channel Assignment and Power Allocation in Platoon-Based C-V2X Systems
Hung V. Vu
Mohammad Farzanullah
Zheyu Liu
D. Nguyen
R. Morawski
T. Le-Ngoc
34
15
0
09 Nov 2020
Explaining Deep Graph Networks with Molecular Counterfactuals
Explaining Deep Graph Networks with Molecular Counterfactuals
Danilo Numeroso
D. Bacciu
57
10
0
09 Nov 2020
Reinforcement Learning for Sparse-Reward Object-Interaction Tasks in a
  First-person Simulated 3D Environment
Reinforcement Learning for Sparse-Reward Object-Interaction Tasks in a First-person Simulated 3D Environment
Wilka Carvalho
Anthony Liang
Kimin Lee
Sungryull Sohn
Honglak Lee
Richard L. Lewis
Satinder Singh
OffRL
56
9
0
28 Oct 2020
Learning to Represent Action Values as a Hypergraph on the Action
  Vertices
Learning to Represent Action Values as a Hypergraph on the Action Vertices
Arash Tavakoli
Mehdi Fatemi
Petar Kormushev
81
23
0
28 Oct 2020
Hamilton-Jacobi Deep Q-Learning for Deterministic Continuous-Time
  Systems with Lipschitz Continuous Controls
Hamilton-Jacobi Deep Q-Learning for Deterministic Continuous-Time Systems with Lipschitz Continuous Controls
Jeongho Kim
Jaeuk Shin
Insoon Yang
61
35
0
27 Oct 2020
OPAL: Offline Primitive Discovery for Accelerating Offline Reinforcement
  Learning
OPAL: Offline Primitive Discovery for Accelerating Offline Reinforcement Learning
Anurag Ajay
Aviral Kumar
Pulkit Agrawal
Sergey Levine
Ofir Nachum
OffRLOnRL
103
160
0
26 Oct 2020
Towards Scale-Invariant Graph-related Problem Solving by Iterative
  Homogeneous Graph Neural Networks
Towards Scale-Invariant Graph-related Problem Solving by Iterative Homogeneous Graph Neural Networks
Hao Tang
Zhiao Huang
Jiayuan Gu
Bao-Liang Lu
Hao Su
AI4CE
70
9
0
26 Oct 2020
Multi-Graph Tensor Networks
Multi-Graph Tensor Networks
Yao Xu
Kriton Konstantinidis
Danilo P. Mandic
54
7
0
25 Oct 2020
Multi-UAV Path Planning for Wireless Data Harvesting with Deep
  Reinforcement Learning
Multi-UAV Path Planning for Wireless Data Harvesting with Deep Reinforcement Learning
Harald Bayerlein
Mirco Theile
Marco Caccamo
David Gesbert
90
124
0
23 Oct 2020
Deep Q-Network-based Adaptive Alert Threshold Selection Policy for
  Payment Fraud Systems in Retail Banking
Deep Q-Network-based Adaptive Alert Threshold Selection Policy for Payment Fraud Systems in Retail Banking
Hongda Shen
Eren Kurshan
72
21
0
21 Oct 2020
Improving Generalization in Reinforcement Learning with Mixture
  Regularization
Improving Generalization in Reinforcement Learning with Mixture Regularization
Kaixin Wang
Bingyi Kang
Jie Shao
Jiashi Feng
188
120
0
21 Oct 2020
Iterative Amortized Policy Optimization
Iterative Amortized Policy Optimization
Joseph Marino
Alexandre Piché
Alessandro Davide Ialongo
Yisong Yue
OffRL
117
21
0
20 Oct 2020
Implicit recurrent networks: A novel approach to stationary input
  processing with recurrent neural networks in deep learning
Implicit recurrent networks: A novel approach to stationary input processing with recurrent neural networks in deep learning
Sebastian Sanokowski
13
1
0
20 Oct 2020
Chance-Constrained Control with Lexicographic Deep Reinforcement
  Learning
Chance-Constrained Control with Lexicographic Deep Reinforcement Learning
Alessandro Giuseppi
A. Pietrabissa
33
7
0
19 Oct 2020
DBA bandits: Self-driving index tuning under ad-hoc, analytical
  workloads with safety guarantees
DBA bandits: Self-driving index tuning under ad-hoc, analytical workloads with safety guarantees
R. Perera
Bastian Oetomo
Benjamin I. P. Rubinstein
Renata Borovica-Gajic
54
32
0
19 Oct 2020
Softmax Deep Double Deterministic Policy Gradients
Softmax Deep Double Deterministic Policy Gradients
Ling Pan
Qingpeng Cai
Longbo Huang
118
93
0
19 Oct 2020
Average-reward model-free reinforcement learning: a systematic review
  and literature mapping
Average-reward model-free reinforcement learning: a systematic review and literature mapping
Vektor Dewanto
George Dunn
A. Eshragh
M. Gallagher
Fred Roosta
83
30
0
18 Oct 2020
Understanding Information Processing in Human Brain by Interpreting
  Machine Learning Models
Understanding Information Processing in Human Brain by Interpreting Machine Learning Models
Ilya Kuzovkin
HAI
24
2
0
17 Oct 2020
DOOM: A Novel Adversarial-DRL-Based Op-Code Level Metamorphic Malware
  Obfuscator for the Enhancement of IDS
DOOM: A Novel Adversarial-DRL-Based Op-Code Level Metamorphic Malware Obfuscator for the Enhancement of IDS
Mohit Sewak
S. K. Sahay
Hemant Rathore
36
18
0
16 Oct 2020
Efficient Robotic Object Search via HIEM: Hierarchical Policy Learning
  with Intrinsic-Extrinsic Modeling
Efficient Robotic Object Search via HIEM: Hierarchical Policy Learning with Intrinsic-Extrinsic Modeling
Xin Ye
Yezhou Yang
76
15
0
16 Oct 2020
A Nesterov's Accelerated quasi-Newton method for Global Routing using
  Deep Reinforcement Learning
A Nesterov's Accelerated quasi-Newton method for Global Routing using Deep Reinforcement Learning
S. Indrapriyadarsini
Shahrzad Mahboubi
H. Ninomiya
T. Kamio
H. Asai
26
5
0
15 Oct 2020
UAV Path Planning using Global and Local Map Information with Deep
  Reinforcement Learning
UAV Path Planning using Global and Local Map Information with Deep Reinforcement Learning
Mirco Theile
Harald Bayerlein
R. Nai
David Gesbert
Marco Caccamo
105
54
0
14 Oct 2020
Human-centric Dialog Training via Offline Reinforcement Learning
Human-centric Dialog Training via Offline Reinforcement Learning
Natasha Jaques
J. Shen
Asma Ghandeharioun
Craig Ferguson
Àgata Lapedriza
Noah J. Jones
S. Gu
Rosalind W. Picard
OffRL
86
96
0
12 Oct 2020
Deep Echo State Q-Network (DEQN) and Its Application in Dynamic Spectrum
  Sharing for 5G and Beyond
Deep Echo State Q-Network (DEQN) and Its Application in Dynamic Spectrum Sharing for 5G and Beyond
Hao-Hsuan Chang
Lingjia Liu
Yuhao Yi
73
47
0
12 Oct 2020
A DRL-based Multiagent Cooperative Control Framework for CAV Networks: a
  Graphic Convolution Q Network
A DRL-based Multiagent Cooperative Control Framework for CAV Networks: a Graphic Convolution Q Network
Jiqian Dong
Sikai Chen
P. Ha
Yujie Li
Samuel Labi
74
37
0
12 Oct 2020
Distributed Resource Allocation with Multi-Agent Deep Reinforcement
  Learning for 5G-V2V Communication
Distributed Resource Allocation with Multi-Agent Deep Reinforcement Learning for 5G-V2V Communication
Alperen Gündogan
H. Gürsu
V. Pauli
W. Kellerer
16
30
0
11 Oct 2020
Graph Convolutional Value Decomposition in Multi-Agent Reinforcement
  Learning
Graph Convolutional Value Decomposition in Multi-Agent Reinforcement Learning
Navid Naderializadeh
Fan Hung
S. Soleyman
D. Khosla
58
28
0
09 Oct 2020
Parameterized Reinforcement Learning for Optical System Optimization
Parameterized Reinforcement Learning for Optical System Optimization
H. Wankerl
M. L. Stern
Ali Mahdavi
C. Eichler
E. Lang
111
23
0
09 Oct 2020
Learning Value Functions in Deep Policy Gradients using Residual
  Variance
Learning Value Functions in Deep Policy Gradients using Residual Variance
Yannis Flet-Berliac
Reda Ouhamma
Odalric-Ambrym Maillard
Philippe Preux
OffRL
51
1
0
09 Oct 2020
UneVEn: Universal Value Exploration for Multi-Agent Reinforcement
  Learning
UneVEn: Universal Value Exploration for Multi-Agent Reinforcement Learning
Tarun Gupta
Anuj Mahajan
Bei Peng
Wendelin Bohmer
Shimon Whiteson
OffRL
120
50
0
06 Oct 2020
Reinforcement Learning with Random Delays
Reinforcement Learning with Random Delays
Simon Ramstedt
Yann Bouteiller
Giovanni Beltrame
C. Pal
Jonathan Binas
227
61
0
06 Oct 2020
Reward Machines: Exploiting Reward Function Structure in Reinforcement
  Learning
Reward Machines: Exploiting Reward Function Structure in Reinforcement Learning
Rodrigo Toro Icarte
Toryn Q. Klassen
Richard Valenzano
Sheila A. McIlraith
OffRL
148
222
0
06 Oct 2020
Temporal Difference Uncertainties as a Signal for Exploration
Temporal Difference Uncertainties as a Signal for Exploration
Sebastian Flennerhag
Jane X. Wang
Pablo Sprechmann
Francesco Visin
Alexandre Galashov
Steven Kapturowski
Diana Borsa
N. Heess
André Barreto
Razvan Pascanu
OffRL
49
14
0
05 Oct 2020
Mastering Atari with Discrete World Models
Mastering Atari with Discrete World Models
Danijar Hafner
Timothy Lillicrap
Mohammad Norouzi
Jimmy Ba
DRL
192
875
0
05 Oct 2020
The act of remembering: a study in partially observable reinforcement
  learning
The act of remembering: a study in partially observable reinforcement learning
Rodrigo Toro Icarte
Richard Valenzano
Toryn Q. Klassen
Phillip J. K. Christoffersen
Amir-massoud Farahmand
Sheila A. McIlraith
OffRL
40
11
0
05 Oct 2020
Policy Learning Using Weak Supervision
Policy Learning Using Weak Supervision
Jingkang Wang
Hongyi Guo
Zhaowei Zhu
Yang Liu
OffRL
74
15
0
05 Oct 2020
Test-Cost Sensitive Methods for Identifying Nearby Points
Test-Cost Sensitive Methods for Identifying Nearby Points
Seung Gyu Hyun
Christopher Leung
34
0
0
04 Oct 2020
Disentangling causal effects for hierarchical reinforcement learning
Disentangling causal effects for hierarchical reinforcement learning
Oriol Corcoll
Raul Vicente
CML
79
9
0
03 Oct 2020
Student-Initiated Action Advising via Advice Novelty
Student-Initiated Action Advising via Advice Novelty
Ercüment Ilhan
Jeremy Gow
Diego Perez
37
9
0
01 Oct 2020
Bayesian Meta-reinforcement Learning for Traffic Signal Control
Bayesian Meta-reinforcement Learning for Traffic Signal Control
Yayi Zou
Zhiwei Qin
BDL
32
3
0
01 Oct 2020
Facilitating Connected Autonomous Vehicle Operations Using
  Space-weighted Information Fusion and Deep Reinforcement Learning Based
  Control
Facilitating Connected Autonomous Vehicle Operations Using Space-weighted Information Fusion and Deep Reinforcement Learning Based Control
Jiqian Dong
Sikai Chen
Yujie Li
Runjia Du
Aaron Steinfeld
Samuel Labi
66
11
0
30 Sep 2020
Toolpath design for additive manufacturing using deep reinforcement
  learning
Toolpath design for additive manufacturing using deep reinforcement learning
M. Mozaffar
Ablodghani Ebrahimi
Jian Cao
AI4CE
40
7
0
30 Sep 2020
Reannealing of Decaying Exploration Based On Heuristic Measure in Deep
  Q-Network
Reannealing of Decaying Exploration Based On Heuristic Measure in Deep Q-Network
Xing Wang
A. Vinel
35
0
0
29 Sep 2020
Finite-Time Analysis for Double Q-learning
Finite-Time Analysis for Double Q-learning
Huaqing Xiong
Linna Zhao
Yingbin Liang
Wei Zhang
71
31
0
29 Sep 2020
Learning to Play against Any Mixture of Opponents
Learning to Play against Any Mixture of Opponents
Max O. Smith
Thomas W. Anthony
Yongzhao Wang
Michael P. Wellman
OffRL
75
9
0
29 Sep 2020
Previous
123...282930...444546
Next