Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
1509.06461
Cited By
Deep Reinforcement Learning with Double Q-learning
22 September 2015
H. V. Hasselt
A. Guez
David Silver
OffRL
Re-assign community
ArXiv
PDF
HTML
Papers citing
"Deep Reinforcement Learning with Double Q-learning"
50 / 933 papers shown
Title
Reinforcement Learning for Sampling on Temporal Medical Imaging Sequences
Zhishen Huang
36
1
0
28 Aug 2023
Learning Cyber Defence Tactics from Scratch with Multi-Agent Reinforcement Learning
Jacob Wiebe
Ranwa Al Mallah
Li Li
AAML
38
3
0
25 Aug 2023
CoMIX: A Multi-agent Reinforcement Learning Training Architecture for Efficient Decentralized Coordination and Independent Decision-Making
Giovanni Minelli
Mirco Musolesi
40
0
0
21 Aug 2023
Deep Reinforcement Learning for Artificial Upwelling Energy Management
Yiyuan Zhang
Wei Fan
20
3
0
20 Aug 2023
SMARLA: A Safety Monitoring Approach for Deep Reinforcement Learning Agents
Amirhossein Zolfagharian
Manel Abdellatif
Lionel C. Briand
S. Ramesh
37
5
0
03 Aug 2023
Computation Offloading with Multiple Agents in Edge-Computing-Supported IoT
Shihao Shen
Yiwen Han
Xiaofei Wang
Yan Wang
OffRL
23
79
0
01 Aug 2023
Reinforcement learning guided fuzz testing for a browser's HTML rendering engine
Martin Sablotny
B. S. Jensen
Jeremy Singer
21
0
0
27 Jul 2023
JoinGym: An Efficient Query Optimization Environment for Reinforcement Learning
Kaiwen Wang
Junxiong Wang
Yueying Li
Nathan Kallus
Immanuel Trummer
Wen Sun
GP
60
2
0
21 Jul 2023
Machine Learning for SAT: Restricted Heuristics and New Graph Representations
Mikhail Shirokikh
Ilya Shenbin
Anton M. Alekseev
Sergey I. Nikolenko
NAI
23
0
0
18 Jul 2023
Cramer Type Distances for Learning Gaussian Mixture Models by Gradient Descent
Ruichong Zhang
30
0
0
13 Jul 2023
Diffusion Policies for Out-of-Distribution Generalization in Offline Reinforcement Learning
S. E. Ada
Erhan Öztop
Emre Ugur
OffRL
56
15
0
10 Jul 2023
A User Study on Explainable Online Reinforcement Learning for Adaptive Systems
Andreas Metzger
Jan Laufer
Felix Feit
Klaus Pohl
OffRL
OnRL
24
1
0
09 Jul 2023
Deep Attention Q-Network for Personalized Treatment Recommendation
Simin Ma
Junghwan Lee
N. Serban
Shihao Yang
OffRL
40
5
0
04 Jul 2023
Human-like Decision-making at Unsignalized Intersection using Social Value Orientation
Yan Tong
Licheng Wen
Pinlong Cai
Daocheng Fu
Song Mao
Yikang Li
37
2
0
30 Jun 2023
Decentralized Multi-Agent Reinforcement Learning with Global State Prediction
Josh Bloom
Pranjal Paliwal
Apratim Mukherjee
Carlo Pinciroli
35
3
0
22 Jun 2023
Adaptive Ensemble Q-learning: Minimizing Estimation Bias via Error Feedback
Hang Wang
Sen Lin
Junshan Zhang
29
19
0
20 Jun 2023
Enhancing variational quantum state diagonalization using reinforcement learning techniques
Akash Kundu
Przemyslaw Bedelek
M. Ostaszewski
Onur Danaci
Yash J. Patel
Vedran Dunjko
Jaroslaw Adam Miszczak
42
8
0
19 Jun 2023
Cooperative Multi-Objective Reinforcement Learning for Traffic Signal Control and Carbon Emission Reduction
Cheng Ruei Tang
J. Hsieh
Shin-You Teng
21
0
0
16 Jun 2023
Finite-Time Analysis of Minimax Q-Learning for Two-Player Zero-Sum Markov Games: Switching System Approach
Dong-hwan Lee
33
2
0
09 Jun 2023
Bigger, Better, Faster: Human-level Atari with human-level efficiency
Max Schwarzer
J. Obando-Ceron
Rameswar Panda
Marc G. Bellemare
Rishabh Agarwal
Pablo Samuel Castro
OffRL
54
87
0
30 May 2023
Provable and Practical: Efficient Exploration in Reinforcement Learning via Langevin Monte Carlo
Haque Ishfaq
Qingfeng Lan
Pan Xu
A. R. Mahmood
Doina Precup
Anima Anandkumar
Kamyar Azizzadenesheli
BDL
OffRL
33
20
0
29 May 2023
VA-learning as a more efficient alternative to Q-learning
Yunhao Tang
Rémi Munos
Mark Rowland
Michal Valko
OffRL
21
6
0
29 May 2023
Off-Policy RL Algorithms Can be Sample-Efficient for Continuous Control via Sample Multiple Reuse
Jiafei Lyu
Le Wan
Zongqing Lu
Xiu Li
OffRL
38
9
0
29 May 2023
On the Value of Myopic Behavior in Policy Reuse
Kang Xu
Chenjia Bai
Shuang Qiu
Haoran He
Bin Zhao
Zhen Wang
Wei Li
Xuelong Li
41
1
0
28 May 2023
A Comprehensive Overview and Comparative Analysis on Deep Learning Models: CNN, RNN, LSTM, GRU
Farhad Shiri
Thinagaran Perumal
N. Mustapha
Raihani Mohamed
AILaw
SyDa
ELM
AI4TS
60
130
0
27 May 2023
Accelerating Value Iteration with Anchoring
Jongmin Lee
Ernest K. Ryu
26
7
0
26 May 2023
Adaptive action supervision in reinforcement learning from real-world multi-agent demonstrations
Keisuke Fujii
Kazushi Tsutsui
Atom Scott
Hiroshi Nakahara
Naoya Takeishi
Yoshinobu Kawahara
34
6
0
22 May 2023
Deep PackGen: A Deep Reinforcement Learning Framework for Adversarial Network Packet Generation
Soumyadeep Hore
Jalal Ghadermazi
Diwas Paudel
Ankit Shah
Tapas K. Das
Nathaniel D. Bastian
AAML
27
13
0
18 May 2023
Black-Box Targeted Reward Poisoning Attack Against Online Deep Reinforcement Learning
Yinglun Xu
Gagandeep Singh
OffRL
AAML
34
3
0
18 May 2023
What Matters in Reinforcement Learning for Tractography
Antoine Théberge
Christian Desrosiers
Maxime Descoteaux
Pierre-Marc Jodoin
OffRL
29
2
0
15 May 2023
Task-Oriented Communication Design at Scale
Arsham Mostaani
T. Vu
Hamed Habibi
Symeon Chatzinotas
Björn E. Ottersten
31
3
0
15 May 2023
Extracting Diagnosis Pathways from Electronic Health Records Using Deep Reinforcement Learning
Lillian Muyama
A. Neuraz
Adrien Coulet
18
0
0
10 May 2023
Rescue Conversations from Dead-ends: Efficient Exploration for Task-oriented Dialogue Policy Optimization
Yangyang Zhao
Zhenyu Wang
Mehdi Dastani
Shihan Wang
24
0
0
05 May 2023
Bridging Declarative, Procedural, and Conditional Metacognitive Knowledge Gap Using Deep Reinforcement Learning
Mark Abdelshiheed
J. Hostetter
Tiffany Barnes
Min Chi
19
4
0
23 Apr 2023
Towards Effective and Interpretable Human-Agent Collaboration in MOBA Games: A Communication Perspective
Yiming Gao
Feiyu Liu
Liang Wang
Zhenjie Lian
Weixuan Wang
...
Jiawei Wang
Qiang Fu
Wei Yang
Lanxiao Huang
Wei Liu
45
7
0
23 Apr 2023
Efficient Deep Reinforcement Learning Requires Regulating Overfitting
Qiyang Li
Aviral Kumar
Ilya Kostrikov
Sergey Levine
OffRL
37
32
0
20 Apr 2023
MDDL: A Framework for Reinforcement Learning-based Position Allocation in Multi-Channel Feed
Xiaowen Shi
Zehua Wang
Yuanying Cai
Xiaoxu Wu
Fan Yang
Guogang Liao
Yongkang Wang
Xingxing Wang
Dong Wang
OffRL
35
1
0
17 Apr 2023
Continual Semantic Segmentation with Automatic Memory Sample Selection
Lanyun Zhu
Tianrun Chen
Jianxiong Yin
Simon See
Jing Liu
CLL
VLM
29
44
0
11 Apr 2023
Effective control of two-dimensional Rayleigh--Bénard convection: invariant multi-agent reinforcement learning is all you need
Colin Vignon
Jean Rabault
Joel Vasanth
Francisco Alcántara-Ávila
M. Mortensen
Ricardo Vinuesa
AI4CE
33
40
0
05 Apr 2023
Mastering Pair Trading with Risk-Aware Recurrent Reinforcement Learning
Weiguang Han
Jimin Huang
Qianqian Xie
Boyi Zhang
Yanzhao Lai
Min Peng
38
4
0
01 Apr 2023
Multi-Flow Transmission in Wireless Interference Networks: A Convergent Graph Learning Approach
Raz Paul
Kobi Cohen
Gil Kedar
39
5
0
27 Mar 2023
A Survey of Machine Learning-Based Ride-Hailing Planning
Dacheng Wen
Yupeng Li
F. Lau
29
4
0
26 Mar 2023
Deep Q-Network Based Decision Making for Autonomous Driving
M. Ronecker
Yuan-xian Zhu
27
32
0
21 Mar 2023
Boundary-aware Supervoxel-level Iteratively Refined Interactive 3D Image Segmentation with Multi-agent Reinforcement Learning
Chaofan Ma
Qisen Xu
Xiangfeng Wang
Bo Jin
Xiaoyun Zhang
Yanfeng Wang
Ya Zhang
42
22
0
19 Mar 2023
Conversational Tree Search: A New Hybrid Dialog Task
Dirk Vath
Lindsey Vanderlyn
Ngoc Thang Vu
53
7
0
17 Mar 2023
Schrödinger's Camera: First Steps Towards a Quantum-Based Privacy Preserving Camera
Hannah Kirkland
S. Koppal
29
1
0
13 Mar 2023
Twice Regularized Markov Decision Processes: The Equivalence between Robustness and Regularization
E. Derman
Yevgeniy Men
M. Geist
Shie Mannor
47
1
0
12 Mar 2023
Reinforcement Learning Based Self-play and State Stacking Techniques for Noisy Air Combat Environment
A. Tasbas
S. O. Sahin
N. K. Üre
23
3
0
06 Mar 2023
Double A3C: Deep Reinforcement Learning on OpenAI Gym Games
Yangxin Zhong
Jiajie He
Ling-Xue Kong
OffRL
17
2
0
04 Mar 2023
Understanding plasticity in neural networks
Clare Lyle
Zeyu Zheng
Evgenii Nikishin
Bernardo Avila-Pires
Razvan Pascanu
Will Dabney
AI4CE
45
98
0
02 Mar 2023
Previous
1
2
3
4
5
...
17
18
19
Next