ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1509.06461
  4. Cited By
Deep Reinforcement Learning with Double Q-learning

Deep Reinforcement Learning with Double Q-learning

22 September 2015
H. V. Hasselt
A. Guez
David Silver
    OffRL
ArXivPDFHTML

Papers citing "Deep Reinforcement Learning with Double Q-learning"

50 / 953 papers shown
Title
Memory-efficient Reinforcement Learning with Value-based Knowledge
  Consolidation
Memory-efficient Reinforcement Learning with Value-based Knowledge Consolidation
Qingfeng Lan
Yangchen Pan
Jun Luo
A. R. Mahmood
OffRL
45
8
0
22 May 2022
Reinforced Pedestrian Attribute Recognition with Group Optimization
  Reward
Reinforced Pedestrian Attribute Recognition with Group Optimization Reward
Zhong Ji
Zhenfei Hu
Yaodong Wang
Shengjia Li
24
5
0
21 May 2022
Distributed Multi-Agent Deep Reinforcement Learning for Robust
  Coordination against Noise
Distributed Multi-Agent Deep Reinforcement Learning for Robust Coordination against Noise
Yoshinari Motokawa
T. Sugawara
30
2
0
19 May 2022
Robust Losses for Learning Value Functions
Robust Losses for Learning Value Functions
Andrew Patterson
Victor Liao
Martha White
33
12
0
17 May 2022
The Primacy Bias in Deep Reinforcement Learning
The Primacy Bias in Deep Reinforcement Learning
Evgenii Nikishin
Max Schwarzer
P. DÓro
Pierre-Luc Bacon
Rameswar Panda
OnRL
96
183
0
16 May 2022
From Dirichlet to Rubin: Optimistic Exploration in RL without Bonuses
From Dirichlet to Rubin: Optimistic Exploration in RL without Bonuses
D. Tiapkin
Denis Belomestny
Eric Moulines
A. Naumov
S. Samsonov
Yunhao Tang
Michal Valko
Pierre Menard
34
17
0
16 May 2022
PrefixRL: Optimization of Parallel Prefix Circuits using Deep
  Reinforcement Learning
PrefixRL: Optimization of Parallel Prefix Circuits using Deep Reinforcement Learning
Rajarshi Roy
Jonathan Raiman
Neel Kant
Ilyas Elkin
Robert M. Kirby
Michael Siu
S. Oberman
Saad Godil
Bryan Catanzaro
40
38
0
14 May 2022
Learning to Solve Vehicle Routing Problems: A Survey
Learning to Solve Vehicle Routing Problems: A Survey
Aigerim Bogyrbayeva
Meraryslan Meraliyev
Taukekhan Mustakhov
Bissenbay Dauletbayev
38
24
0
05 May 2022
CCLF: A Contrastive-Curiosity-Driven Learning Framework for
  Sample-Efficient Reinforcement Learning
CCLF: A Contrastive-Curiosity-Driven Learning Framework for Sample-Efficient Reinforcement Learning
Chenyu Sun
Hangwei Qian
Chunyan Miao
OffRL
36
12
0
02 May 2022
Evolutionary Approach to Security Games with Signaling
Evolutionary Approach to Security Games with Signaling
A. Żychowski
Jacek Mańdziuk
Elizabeth Bondi-Kelly
Aravind Venugopal
Milind Tambe
Balaraman Ravindran
24
4
0
29 Apr 2022
BATS: Best Action Trajectory Stitching
BATS: Best Action Trajectory Stitching
I. Char
Viraj Mehta
Adam R. Villaflor
John M. Dolan
J. Schneider
OffRL
38
8
0
26 Apr 2022
Skill-based Meta-Reinforcement Learning
Skill-based Meta-Reinforcement Learning
Taewook Nam
Shao-Hua Sun
Karl Pertsch
Sung Ju Hwang
Joseph J. Lim
OffRL
34
45
0
25 Apr 2022
Adaptive actuation of magnetic soft robots using deep reinforcement
  learning
Adaptive actuation of magnetic soft robots using deep reinforcement learning
Jianpeng Yao
Quanliang Cao
Yuwei Ju
Yuxuan Sun
Ruiqi Liu
Xiaotao Han
Liang Li
AI4CE
30
23
0
25 Apr 2022
Joint Learning of Reward Machines and Policies in Environments with
  Partially Known Semantics
Joint Learning of Reward Machines and Policies in Environments with Partially Known Semantics
Christos K. Verginis
Cevahir Köprülü
Sandeep Chinchali
Ufuk Topcu
35
10
0
20 Apr 2022
Understanding and Preventing Capacity Loss in Reinforcement Learning
Understanding and Preventing Capacity Loss in Reinforcement Learning
Clare Lyle
Mark Rowland
Will Dabney
CLL
41
110
0
20 Apr 2022
Safer Autonomous Driving in a Stochastic, Partially-Observable
  Environment by Hierarchical Contingency Planning
Safer Autonomous Driving in a Stochastic, Partially-Observable Environment by Hierarchical Contingency Planning
Ugo Lecerf
Christelle Yemdji Tchassi
Pietro Michiardi
30
1
0
13 Apr 2022
Reinforcement learning on graphs: A survey
Reinforcement learning on graphs: A survey
Mingshuo Nie
Dongming Chen
Dongqi Wang
49
46
0
13 Apr 2022
Optimizing the Long-Term Behaviour of Deep Reinforcement Learning for
  Pushing and Grasping
Optimizing the Long-Term Behaviour of Deep Reinforcement Learning for Pushing and Grasping
Rodrigo Chau
33
0
0
07 Apr 2022
Robust Event-Driven Interactions in Cooperative Multi-Agent Learning
Robust Event-Driven Interactions in Cooperative Multi-Agent Learning
Daniel Jarne Ornia
M. Mazo
42
1
0
07 Apr 2022
Safe Reinforcement Learning via Shielding under Partial Observability
Safe Reinforcement Learning via Shielding under Partial Observability
Steven Carr
N. Jansen
Sebastian Junges
Ufuk Topcu
19
45
0
02 Apr 2022
Robust Fuzzy Q-Learning-Based Strictly Negative Imaginary Tracking
  Controllers for the Uncertain Quadrotor Systems
Robust Fuzzy Q-Learning-Based Strictly Negative Imaginary Tracking Controllers for the Uncertain Quadrotor Systems
V. Tran
M. A. Mabrok
S. Anavatti
Matthew A. Garratt
I. Petersen
17
15
0
26 Mar 2022
Deep reinforcement learning guided graph neural networks for brain
  network analysis
Deep reinforcement learning guided graph neural networks for brain network analysis
Xusheng Zhao
Jia Wu
Hao Peng
Amin Beheshti
Jessica J. M. Monaghan
...
Mark Dras
Qiong Dai
Yangyang Li
Philip S. Yu
Lifang He
GNN
35
45
0
18 Mar 2022
How to Learn from Risk: Explicit Risk-Utility Reinforcement Learning for
  Efficient and Safe Driving Strategies
How to Learn from Risk: Explicit Risk-Utility Reinforcement Learning for Efficient and Safe Driving Strategies
Lukas M. Schmidt
Sebastian Rietsch
Axel Plinge
Bjoern M. Eskofier
Christopher Mutschler
OffRL
40
5
0
16 Mar 2022
Orchestrated Value Mapping for Reinforcement Learning
Orchestrated Value Mapping for Reinforcement Learning
Mehdi Fatemi
Arash Tavakoli
27
8
0
14 Mar 2022
Active Phase-Encode Selection for Slice-Specific Fast MR Scanning Using
  a Transformer-Based Deep Reinforcement Learning Framework
Active Phase-Encode Selection for Slice-Specific Fast MR Scanning Using a Transformer-Based Deep Reinforcement Learning Framework
Yiming Liu
Yanwei Pang
Ruiqi Jin
Zhenchang Wang
MedIm
21
2
0
11 Mar 2022
Artificial Intelligence in Vehicular Wireless Networks: A Case Study
  Using ns-3
Artificial Intelligence in Vehicular Wireless Networks: A Case Study Using ns-3
Matteo Drago
Tommaso Zugno
Federico Mason
M. Giordani
Mate Boban
M. Zorzi
32
7
0
10 Mar 2022
Near-optimal Deep Reinforcement Learning Policies from Data for Zone
  Temperature Control
Near-optimal Deep Reinforcement Learning Policies from Data for Zone Temperature Control
L. D. Natale
B. Svetozarevic
Philipp Heer
Colin N. Jones
OffRL
AI4CE
40
6
0
10 Mar 2022
Temporal Difference Learning for Model Predictive Control
Temporal Difference Learning for Model Predictive Control
Nicklas Hansen
Xiaolong Wang
H. Su
PINN
MU
45
229
0
09 Mar 2022
Deep Q-network using reservoir computing with multi-layered readout
Deep Q-network using reservoir computing with multi-layered readout
Toshitaka Matsuki
OffRL
26
2
0
03 Mar 2022
Improving the Diversity of Bootstrapped DQN by Replacing Priors With
  Noise
Improving the Diversity of Bootstrapped DQN by Replacing Priors With Noise
Li Meng
Morten Goodwin
Anis Yazidi
P. Engelstad
24
4
0
02 Mar 2022
Avalanche RL: a Continual Reinforcement Learning Library
Avalanche RL: a Continual Reinforcement Learning Library
Nicolo Lucchesi
Antonio Carta
Vincenzo Lomonaco
Davide Bacciu
42
6
0
28 Feb 2022
Learning to Liquidate Forex: Optimal Stopping via Adaptive Top-K
  Regression
Learning to Liquidate Forex: Optimal Stopping via Adaptive Top-K Regression
Diksha Garg
Pankaj Malhotra
Anil Bhatia
Sanjay Bhat
L. Vig
Gautam M. Shroff
37
0
0
25 Feb 2022
Using Deep Reinforcement Learning with Automatic Curriculum Learning for
  Mapless Navigation in Intralogistics
Using Deep Reinforcement Learning with Automatic Curriculum Learning for Mapless Navigation in Intralogistics
Honghu Xue
Benedikt Hein
M. Bakr
Georg Schildbach
Bengt Abel
Elmar Rueckert
16
15
0
23 Feb 2022
Coordinate-Aligned Multi-Camera Collaboration for Active Multi-Object
  Tracking
Coordinate-Aligned Multi-Camera Collaboration for Active Multi-Object Tracking
Zeyu Fang
Jian Zhao
Mingyu Yang
Wen-gang Zhou
Zhenbo Lu
Houqiang Li
39
10
0
22 Feb 2022
Retrieval-Augmented Reinforcement Learning
Retrieval-Augmented Reinforcement Learning
Anirudh Goyal
A. Friesen
Andrea Banino
T. Weber
Nan Rosemary Ke
...
Michal Valko
Simon Osindero
Timothy Lillicrap
N. Heess
Charles Blundell
OffRL
37
53
0
17 Feb 2022
Domain Adaptive Fake News Detection via Reinforcement Learning
Domain Adaptive Fake News Detection via Reinforcement Learning
Ahmadreza Mosallanezhad
Mansooreh Karami
Kai Shu
M. Mancenido
Huan Liu
20
71
0
16 Feb 2022
Sequential Bayesian experimental designs via reinforcement learning
Sequential Bayesian experimental designs via reinforcement learning
Hikaru Asano
OffRL
18
0
0
14 Feb 2022
Uncovering Instabilities in Variational-Quantum Deep Q-Networks
Uncovering Instabilities in Variational-Quantum Deep Q-Networks
Maja Franz
Lucas Wolf
Maniraman Periyasamy
Christian Ufrecht
Daniel D. Scherer
Axel Plinge
Christopher Mutschler
Wolfgang Mauerer
41
29
0
10 Feb 2022
Precision Radiotherapy via Information Integration of Expert Human
  Knowledge and AI Recommendation to Optimize Clinical Decision Making
Precision Radiotherapy via Information Integration of Expert Human Knowledge and AI Recommendation to Optimize Clinical Decision Making
Wenbo Sun
D. Niraula
Issam El-Naqa
R. T. Haken
I. Dinov
K. Cuneo
J. Jin
10
15
0
09 Feb 2022
Theory-inspired Parameter Control Benchmarks for Dynamic Algorithm
  Configuration
Theory-inspired Parameter Control Benchmarks for Dynamic Algorithm Configuration
André Biedenkapp
Nguyen Dang
Martin S. Krejca
Frank Hutter
Carola Doerr
39
8
0
07 Feb 2022
Reinforcement learning for multi-item retrieval in the puzzle-based
  storage system
Reinforcement learning for multi-item retrieval in the puzzle-based storage system
Jingxu He
Xinglu Liu
Qiyao Duan
Wai Kin Victor Chan
Mingyao Qi
20
15
0
05 Feb 2022
Offline Reinforcement Learning for Mobile Notifications
Offline Reinforcement Learning for Mobile Notifications
Yiping Yuan
A. Muralidharan
Preetam Nandy
Miao Cheng
Prakruthi Prabhakar
OffRL
36
9
0
04 Feb 2022
Robustness and Adaptability of Reinforcement Learning based Cooperative
  Autonomous Driving in Mixed-autonomy Traffic
Robustness and Adaptability of Reinforcement Learning based Cooperative Autonomous Driving in Mixed-autonomy Traffic
Rodolfo Valiente
Behrad Toghi
Ramtin Pedarsani
Y. P. Fallah
74
56
0
02 Feb 2022
Graph Convolution-Based Deep Reinforcement Learning for Multi-Agent
  Decision-Making in Mixed Traffic Environments
Graph Convolution-Based Deep Reinforcement Learning for Multi-Agent Decision-Making in Mixed Traffic Environments
Qi Liu
Zirui Li
Xueyuan Li
Jingda Wu
Shihua Yuan
AI4CE
44
8
0
30 Jan 2022
Bellman Meets Hawkes: Model-Based Reinforcement Learning via Temporal
  Point Processes
Bellman Meets Hawkes: Model-Based Reinforcement Learning via Temporal Point Processes
Chao Qu
Jue Chen
Siqiao Xue
Xiaoming Shi
James Y. Zhang
Hongyuan Mei
OffRL
35
17
0
29 Jan 2022
Mask-based Latent Reconstruction for Reinforcement Learning
Mask-based Latent Reconstruction for Reinforcement Learning
Tao Yu
Zhizheng Zhang
Cuiling Lan
Yan Lu
Zhibo Chen
29
44
0
28 Jan 2022
Generative Adversarial Exploration for Reinforcement Learning
Generative Adversarial Exploration for Reinforcement Learning
Weijun Hong
Menghui Zhu
Minghuan Liu
Weinan Zhang
Ming Zhou
Yong Yu
Peng Sun
OnRL
39
7
0
27 Jan 2022
Multi-Agent Adversarial Attacks for Multi-Channel Communications
Multi-Agent Adversarial Attacks for Multi-Channel Communications
Juncheng Dong
Suya Wu
Mohammadreza Soltani
Vahid Tarokh
AAML
46
3
0
22 Jan 2022
A Prescriptive Dirichlet Power Allocation Policy with Deep Reinforcement
  Learning
A Prescriptive Dirichlet Power Allocation Policy with Deep Reinforcement Learning
Yuan Tian
Minghao Han
Chetan S. Kulkarni
Olga Fink
25
13
0
20 Jan 2022
Anytime PSRO for Two-Player Zero-Sum Games
Anytime PSRO for Two-Player Zero-Sum Games
Stephen Marcus McAleer
Kevin A. Wang
John Lanier
Marc Lanctot
Pierre Baldi
Tuomas Sandholm
Roy Fox
24
12
0
19 Jan 2022
Previous
123...789...181920
Next