ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1509.06461
  4. Cited By
Deep Reinforcement Learning with Double Q-learning

Deep Reinforcement Learning with Double Q-learning

22 September 2015
H. V. Hasselt
A. Guez
David Silver
    OffRL
ArXivPDFHTML

Papers citing "Deep Reinforcement Learning with Double Q-learning"

50 / 933 papers shown
Title
Reinforcement Learning for Sampling on Temporal Medical Imaging
  Sequences
Reinforcement Learning for Sampling on Temporal Medical Imaging Sequences
Zhishen Huang
36
1
0
28 Aug 2023
Learning Cyber Defence Tactics from Scratch with Multi-Agent
  Reinforcement Learning
Learning Cyber Defence Tactics from Scratch with Multi-Agent Reinforcement Learning
Jacob Wiebe
Ranwa Al Mallah
Li Li
AAML
38
3
0
25 Aug 2023
CoMIX: A Multi-agent Reinforcement Learning Training Architecture for
  Efficient Decentralized Coordination and Independent Decision-Making
CoMIX: A Multi-agent Reinforcement Learning Training Architecture for Efficient Decentralized Coordination and Independent Decision-Making
Giovanni Minelli
Mirco Musolesi
40
0
0
21 Aug 2023
Deep Reinforcement Learning for Artificial Upwelling Energy Management
Deep Reinforcement Learning for Artificial Upwelling Energy Management
Yiyuan Zhang
Wei Fan
20
3
0
20 Aug 2023
SMARLA: A Safety Monitoring Approach for Deep Reinforcement Learning
  Agents
SMARLA: A Safety Monitoring Approach for Deep Reinforcement Learning Agents
Amirhossein Zolfagharian
Manel Abdellatif
Lionel C. Briand
S. Ramesh
37
5
0
03 Aug 2023
Computation Offloading with Multiple Agents in Edge-Computing-Supported
  IoT
Computation Offloading with Multiple Agents in Edge-Computing-Supported IoT
Shihao Shen
Yiwen Han
Xiaofei Wang
Yan Wang
OffRL
23
79
0
01 Aug 2023
Reinforcement learning guided fuzz testing for a browser's HTML
  rendering engine
Reinforcement learning guided fuzz testing for a browser's HTML rendering engine
Martin Sablotny
B. S. Jensen
Jeremy Singer
21
0
0
27 Jul 2023
JoinGym: An Efficient Query Optimization Environment for Reinforcement
  Learning
JoinGym: An Efficient Query Optimization Environment for Reinforcement Learning
Kaiwen Wang
Junxiong Wang
Yueying Li
Nathan Kallus
Immanuel Trummer
Wen Sun
GP
60
2
0
21 Jul 2023
Machine Learning for SAT: Restricted Heuristics and New Graph
  Representations
Machine Learning for SAT: Restricted Heuristics and New Graph Representations
Mikhail Shirokikh
Ilya Shenbin
Anton M. Alekseev
Sergey I. Nikolenko
NAI
23
0
0
18 Jul 2023
Cramer Type Distances for Learning Gaussian Mixture Models by Gradient
  Descent
Cramer Type Distances for Learning Gaussian Mixture Models by Gradient Descent
Ruichong Zhang
30
0
0
13 Jul 2023
Diffusion Policies for Out-of-Distribution Generalization in Offline
  Reinforcement Learning
Diffusion Policies for Out-of-Distribution Generalization in Offline Reinforcement Learning
S. E. Ada
Erhan Öztop
Emre Ugur
OffRL
56
15
0
10 Jul 2023
A User Study on Explainable Online Reinforcement Learning for Adaptive
  Systems
A User Study on Explainable Online Reinforcement Learning for Adaptive Systems
Andreas Metzger
Jan Laufer
Felix Feit
Klaus Pohl
OffRL
OnRL
24
1
0
09 Jul 2023
Deep Attention Q-Network for Personalized Treatment Recommendation
Deep Attention Q-Network for Personalized Treatment Recommendation
Simin Ma
Junghwan Lee
N. Serban
Shihao Yang
OffRL
40
5
0
04 Jul 2023
Human-like Decision-making at Unsignalized Intersection using Social
  Value Orientation
Human-like Decision-making at Unsignalized Intersection using Social Value Orientation
Yan Tong
Licheng Wen
Pinlong Cai
Daocheng Fu
Song Mao
Yikang Li
37
2
0
30 Jun 2023
Decentralized Multi-Agent Reinforcement Learning with Global State
  Prediction
Decentralized Multi-Agent Reinforcement Learning with Global State Prediction
Josh Bloom
Pranjal Paliwal
Apratim Mukherjee
Carlo Pinciroli
35
3
0
22 Jun 2023
Adaptive Ensemble Q-learning: Minimizing Estimation Bias via Error
  Feedback
Adaptive Ensemble Q-learning: Minimizing Estimation Bias via Error Feedback
Hang Wang
Sen Lin
Junshan Zhang
29
19
0
20 Jun 2023
Enhancing variational quantum state diagonalization using reinforcement
  learning techniques
Enhancing variational quantum state diagonalization using reinforcement learning techniques
Akash Kundu
Przemyslaw Bedelek
M. Ostaszewski
Onur Danaci
Yash J. Patel
Vedran Dunjko
Jaroslaw Adam Miszczak
42
8
0
19 Jun 2023
Cooperative Multi-Objective Reinforcement Learning for Traffic Signal
  Control and Carbon Emission Reduction
Cooperative Multi-Objective Reinforcement Learning for Traffic Signal Control and Carbon Emission Reduction
Cheng Ruei Tang
J. Hsieh
Shin-You Teng
21
0
0
16 Jun 2023
Finite-Time Analysis of Minimax Q-Learning for Two-Player Zero-Sum
  Markov Games: Switching System Approach
Finite-Time Analysis of Minimax Q-Learning for Two-Player Zero-Sum Markov Games: Switching System Approach
Dong-hwan Lee
33
2
0
09 Jun 2023
Bigger, Better, Faster: Human-level Atari with human-level efficiency
Bigger, Better, Faster: Human-level Atari with human-level efficiency
Max Schwarzer
J. Obando-Ceron
Rameswar Panda
Marc G. Bellemare
Rishabh Agarwal
Pablo Samuel Castro
OffRL
54
87
0
30 May 2023
Provable and Practical: Efficient Exploration in Reinforcement Learning
  via Langevin Monte Carlo
Provable and Practical: Efficient Exploration in Reinforcement Learning via Langevin Monte Carlo
Haque Ishfaq
Qingfeng Lan
Pan Xu
A. R. Mahmood
Doina Precup
Anima Anandkumar
Kamyar Azizzadenesheli
BDL
OffRL
33
20
0
29 May 2023
VA-learning as a more efficient alternative to Q-learning
VA-learning as a more efficient alternative to Q-learning
Yunhao Tang
Rémi Munos
Mark Rowland
Michal Valko
OffRL
21
6
0
29 May 2023
Off-Policy RL Algorithms Can be Sample-Efficient for Continuous Control
  via Sample Multiple Reuse
Off-Policy RL Algorithms Can be Sample-Efficient for Continuous Control via Sample Multiple Reuse
Jiafei Lyu
Le Wan
Zongqing Lu
Xiu Li
OffRL
38
9
0
29 May 2023
On the Value of Myopic Behavior in Policy Reuse
On the Value of Myopic Behavior in Policy Reuse
Kang Xu
Chenjia Bai
Shuang Qiu
Haoran He
Bin Zhao
Zhen Wang
Wei Li
Xuelong Li
41
1
0
28 May 2023
A Comprehensive Overview and Comparative Analysis on Deep Learning Models: CNN, RNN, LSTM, GRU
A Comprehensive Overview and Comparative Analysis on Deep Learning Models: CNN, RNN, LSTM, GRU
Farhad Shiri
Thinagaran Perumal
N. Mustapha
Raihani Mohamed
AILaw
SyDa
ELM
AI4TS
60
130
0
27 May 2023
Accelerating Value Iteration with Anchoring
Accelerating Value Iteration with Anchoring
Jongmin Lee
Ernest K. Ryu
26
7
0
26 May 2023
Adaptive action supervision in reinforcement learning from real-world
  multi-agent demonstrations
Adaptive action supervision in reinforcement learning from real-world multi-agent demonstrations
Keisuke Fujii
Kazushi Tsutsui
Atom Scott
Hiroshi Nakahara
Naoya Takeishi
Yoshinobu Kawahara
34
6
0
22 May 2023
Deep PackGen: A Deep Reinforcement Learning Framework for Adversarial
  Network Packet Generation
Deep PackGen: A Deep Reinforcement Learning Framework for Adversarial Network Packet Generation
Soumyadeep Hore
Jalal Ghadermazi
Diwas Paudel
Ankit Shah
Tapas K. Das
Nathaniel D. Bastian
AAML
27
13
0
18 May 2023
Black-Box Targeted Reward Poisoning Attack Against Online Deep
  Reinforcement Learning
Black-Box Targeted Reward Poisoning Attack Against Online Deep Reinforcement Learning
Yinglun Xu
Gagandeep Singh
OffRL
AAML
34
3
0
18 May 2023
What Matters in Reinforcement Learning for Tractography
What Matters in Reinforcement Learning for Tractography
Antoine Théberge
Christian Desrosiers
Maxime Descoteaux
Pierre-Marc Jodoin
OffRL
29
2
0
15 May 2023
Task-Oriented Communication Design at Scale
Task-Oriented Communication Design at Scale
Arsham Mostaani
T. Vu
Hamed Habibi
Symeon Chatzinotas
Björn E. Ottersten
31
3
0
15 May 2023
Extracting Diagnosis Pathways from Electronic Health Records Using Deep
  Reinforcement Learning
Extracting Diagnosis Pathways from Electronic Health Records Using Deep Reinforcement Learning
Lillian Muyama
A. Neuraz
Adrien Coulet
18
0
0
10 May 2023
Rescue Conversations from Dead-ends: Efficient Exploration for
  Task-oriented Dialogue Policy Optimization
Rescue Conversations from Dead-ends: Efficient Exploration for Task-oriented Dialogue Policy Optimization
Yangyang Zhao
Zhenyu Wang
Mehdi Dastani
Shihan Wang
24
0
0
05 May 2023
Bridging Declarative, Procedural, and Conditional Metacognitive
  Knowledge Gap Using Deep Reinforcement Learning
Bridging Declarative, Procedural, and Conditional Metacognitive Knowledge Gap Using Deep Reinforcement Learning
Mark Abdelshiheed
J. Hostetter
Tiffany Barnes
Min Chi
19
4
0
23 Apr 2023
Towards Effective and Interpretable Human-Agent Collaboration in MOBA
  Games: A Communication Perspective
Towards Effective and Interpretable Human-Agent Collaboration in MOBA Games: A Communication Perspective
Yiming Gao
Feiyu Liu
Liang Wang
Zhenjie Lian
Weixuan Wang
...
Jiawei Wang
Qiang Fu
Wei Yang
Lanxiao Huang
Wei Liu
45
7
0
23 Apr 2023
Efficient Deep Reinforcement Learning Requires Regulating Overfitting
Efficient Deep Reinforcement Learning Requires Regulating Overfitting
Qiyang Li
Aviral Kumar
Ilya Kostrikov
Sergey Levine
OffRL
37
32
0
20 Apr 2023
MDDL: A Framework for Reinforcement Learning-based Position Allocation
  in Multi-Channel Feed
MDDL: A Framework for Reinforcement Learning-based Position Allocation in Multi-Channel Feed
Xiaowen Shi
Zehua Wang
Yuanying Cai
Xiaoxu Wu
Fan Yang
Guogang Liao
Yongkang Wang
Xingxing Wang
Dong Wang
OffRL
35
1
0
17 Apr 2023
Continual Semantic Segmentation with Automatic Memory Sample Selection
Continual Semantic Segmentation with Automatic Memory Sample Selection
Lanyun Zhu
Tianrun Chen
Jianxiong Yin
Simon See
Jing Liu
CLL
VLM
29
44
0
11 Apr 2023
Effective control of two-dimensional Rayleigh--Bénard convection:
  invariant multi-agent reinforcement learning is all you need
Effective control of two-dimensional Rayleigh--Bénard convection: invariant multi-agent reinforcement learning is all you need
Colin Vignon
Jean Rabault
Joel Vasanth
Francisco Alcántara-Ávila
M. Mortensen
Ricardo Vinuesa
AI4CE
33
40
0
05 Apr 2023
Mastering Pair Trading with Risk-Aware Recurrent Reinforcement Learning
Mastering Pair Trading with Risk-Aware Recurrent Reinforcement Learning
Weiguang Han
Jimin Huang
Qianqian Xie
Boyi Zhang
Yanzhao Lai
Min Peng
38
4
0
01 Apr 2023
Multi-Flow Transmission in Wireless Interference Networks: A Convergent
  Graph Learning Approach
Multi-Flow Transmission in Wireless Interference Networks: A Convergent Graph Learning Approach
Raz Paul
Kobi Cohen
Gil Kedar
39
5
0
27 Mar 2023
A Survey of Machine Learning-Based Ride-Hailing Planning
A Survey of Machine Learning-Based Ride-Hailing Planning
Dacheng Wen
Yupeng Li
F. Lau
29
4
0
26 Mar 2023
Deep Q-Network Based Decision Making for Autonomous Driving
Deep Q-Network Based Decision Making for Autonomous Driving
M. Ronecker
Yuan-xian Zhu
27
32
0
21 Mar 2023
Boundary-aware Supervoxel-level Iteratively Refined Interactive 3D Image
  Segmentation with Multi-agent Reinforcement Learning
Boundary-aware Supervoxel-level Iteratively Refined Interactive 3D Image Segmentation with Multi-agent Reinforcement Learning
Chaofan Ma
Qisen Xu
Xiangfeng Wang
Bo Jin
Xiaoyun Zhang
Yanfeng Wang
Ya Zhang
42
22
0
19 Mar 2023
Conversational Tree Search: A New Hybrid Dialog Task
Conversational Tree Search: A New Hybrid Dialog Task
Dirk Vath
Lindsey Vanderlyn
Ngoc Thang Vu
53
7
0
17 Mar 2023
Schrödinger's Camera: First Steps Towards a Quantum-Based Privacy
  Preserving Camera
Schrödinger's Camera: First Steps Towards a Quantum-Based Privacy Preserving Camera
Hannah Kirkland
S. Koppal
29
1
0
13 Mar 2023
Twice Regularized Markov Decision Processes: The Equivalence between
  Robustness and Regularization
Twice Regularized Markov Decision Processes: The Equivalence between Robustness and Regularization
E. Derman
Yevgeniy Men
M. Geist
Shie Mannor
47
1
0
12 Mar 2023
Reinforcement Learning Based Self-play and State Stacking Techniques for
  Noisy Air Combat Environment
Reinforcement Learning Based Self-play and State Stacking Techniques for Noisy Air Combat Environment
A. Tasbas
S. O. Sahin
N. K. Üre
23
3
0
06 Mar 2023
Double A3C: Deep Reinforcement Learning on OpenAI Gym Games
Double A3C: Deep Reinforcement Learning on OpenAI Gym Games
Yangxin Zhong
Jiajie He
Ling-Xue Kong
OffRL
17
2
0
04 Mar 2023
Understanding plasticity in neural networks
Understanding plasticity in neural networks
Clare Lyle
Zeyu Zheng
Evgenii Nikishin
Bernardo Avila-Pires
Razvan Pascanu
Will Dabney
AI4CE
45
98
0
02 Mar 2023
Previous
12345...171819
Next