ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1509.06461
  4. Cited By
Deep Reinforcement Learning with Double Q-learning
v1v2v3 (latest)

Deep Reinforcement Learning with Double Q-learning

22 September 2015
H. V. Hasselt
A. Guez
David Silver
    OffRL
ArXiv (abs)PDFHTML

Papers citing "Deep Reinforcement Learning with Double Q-learning"

50 / 2,291 papers shown
Title
Scheduling Drone and Mobile Charger via Hybrid-Action Deep Reinforcement
  Learning
Scheduling Drone and Mobile Charger via Hybrid-Action Deep Reinforcement Learning
Jizhe Dou
Haotian Zhang
Guodong Sun
89
0
0
16 Mar 2024
SINDy-RL: Interpretable and Efficient Model-Based Reinforcement Learning
SINDy-RL: Interpretable and Efficient Model-Based Reinforcement Learning
Nicholas Zolman
Urban Fasel
J. Nathan Kutz
Steven L. Brunton
AI4CE
84
11
0
14 Mar 2024
Symmetric Q-learning: Reducing Skewness of Bellman Error in Online
  Reinforcement Learning
Symmetric Q-learning: Reducing Skewness of Bellman Error in Online Reinforcement Learning
Motoki Omura
Takayuki Osa
Yusuke Mukuta
Tatsuya Harada
OffRL
51
0
0
12 Mar 2024
An Improved Strategy for Blood Glucose Control Using Multi-Step Deep
  Reinforcement Learning
An Improved Strategy for Blood Glucose Control Using Multi-Step Deep Reinforcement Learning
Weiwei Gu
Senquan Wang
65
7
0
12 Mar 2024
Ensembling Prioritized Hybrid Policies for Multi-agent Pathfinding
Ensembling Prioritized Hybrid Policies for Multi-agent Pathfinding
Huijie Tang
Federico Berto
Jinkyoo Park
106
4
0
12 Mar 2024
RLingua: Improving Reinforcement Learning Sample Efficiency in Robotic
  Manipulations With Large Language Models
RLingua: Improving Reinforcement Learning Sample Efficiency in Robotic Manipulations With Large Language Models
Liangliang Chen
Yutian Lei
Shiyu Jin
Ying Zhang
Liangjun Zhang
LM&Ro
105
12
0
11 Mar 2024
Finite-Time Error Analysis of Soft Q-Learning: Switching System Approach
Finite-Time Error Analysis of Soft Q-Learning: Switching System Approach
Narim Jeong
Donghwan Lee
27
1
0
11 Mar 2024
Fish-inspired tracking of underwater turbulent plumes
Fish-inspired tracking of underwater turbulent plumes
Peter Gunnarson
J. Dabiri
64
4
0
10 Mar 2024
Dissecting Deep RL with High Update Ratios: Combatting Value Divergence
Dissecting Deep RL with High Update Ratios: Combatting Value Divergence
Marcel Hussing
C. Voelcker
Igor Gilitschenski
Amir-massoud Farahmand
Eric Eaton
100
3
0
09 Mar 2024
Conservative DDPG -- Pessimistic RL without Ensemble
Conservative DDPG -- Pessimistic RL without Ensemble
Nitsan Soffair
Shie Mannor
OffRL
47
0
0
08 Mar 2024
A Survey on Applications of Reinforcement Learning in Spatial Resource
  Allocation
A Survey on Applications of Reinforcement Learning in Spatial Resource Allocation
Di Zhang
Moyang Wang
Joseph D Mango
Xiang Li
Xianrui Xu
105
1
0
06 Mar 2024
Koopman-Assisted Reinforcement Learning
Koopman-Assisted Reinforcement Learning
Preston Rozwood
Edward Mehrez
Ludger Paehler
Wen Sun
Steven L. Brunton
109
10
0
04 Mar 2024
Deep Reinforcement Learning for Dynamic Algorithm Selection: A
  Proof-of-Principle Study on Differential Evolution
Deep Reinforcement Learning for Dynamic Algorithm Selection: A Proof-of-Principle Study on Differential Evolution
Hongshu Guo
Yining Ma
Zeyuan Ma
Jiacheng Chen
Xinglin Zhang
Zhiguang Cao
Jun Zhang
Yue-Jiao Gong
100
23
0
04 Mar 2024
A Case for Validation Buffer in Pessimistic Actor-Critic
A Case for Validation Buffer in Pessimistic Actor-Critic
Michal Nauman
M. Ostaszewski
Marek Cygan
68
0
0
01 Mar 2024
ArCHer: Training Language Model Agents via Hierarchical Multi-Turn RL
ArCHer: Training Language Model Agents via Hierarchical Multi-Turn RL
Yifei Zhou
Andrea Zanette
Jiayi Pan
Sergey Levine
Aviral Kumar
146
79
0
29 Feb 2024
EBBS: An Ensemble with Bi-Level Beam Search for Zero-Shot Machine Translation
EBBS: An Ensemble with Bi-Level Beam Search for Zero-Shot Machine Translation
Yuqiao Wen
Behzad Shayegh
Chenyang Huang
Yanshuai Cao
Lili Mou
141
5
0
29 Feb 2024
Multistatic-Radar RCS-Signature Recognition of Aerial Vehicles: A
  Bayesian Fusion Approach
Multistatic-Radar RCS-Signature Recognition of Aerial Vehicles: A Bayesian Fusion Approach
Owen Howell
M. Akçakaya
Marius Necsoiu
G. Schirner
Deniz Erdogmus
Tales Imbiriba
104
2
0
28 Feb 2024
Think2Drive: Efficient Reinforcement Learning by Thinking in Latent
  World Model for Quasi-Realistic Autonomous Driving (in CARLA-v2)
Think2Drive: Efficient Reinforcement Learning by Thinking in Latent World Model for Quasi-Realistic Autonomous Driving (in CARLA-v2)
Qifeng Li
Xiaosong Jia
Shaobo Wang
Junchi Yan
124
34
0
26 Feb 2024
Reward Design for Justifiable Sequential Decision-Making
Reward Design for Justifiable Sequential Decision-Making
A. Sukovic
Goran Radanović
52
0
0
24 Feb 2024
Discretionary Lane-Change Decision and Control via Parameterized Soft
  Actor-Critic for Hybrid Action Space
Discretionary Lane-Change Decision and Control via Parameterized Soft Actor-Critic for Hybrid Action Space
Yuan Lin
Xiao Liu
Zishun Zheng
58
5
0
24 Feb 2024
Analysis of Off-Policy Multi-Step TD-Learning with Linear Function
  Approximation
Analysis of Off-Policy Multi-Step TD-Learning with Linear Function Approximation
Donghwan Lee
81
0
0
24 Feb 2024
A priori Estimates for Deep Residual Network in Continuous-time
  Reinforcement Learning
A priori Estimates for Deep Residual Network in Continuous-time Reinforcement Learning
Shuyu Yin
Qixuan Zhou
Fei Wen
Tao Luo
74
0
0
24 Feb 2024
ACE : Off-Policy Actor-Critic with Causality-Aware Entropy
  Regularization
ACE : Off-Policy Actor-Critic with Causality-Aware Entropy Regularization
Tianying Ji
Yongyuan Liang
Yan Zeng
Yu-Juan Luo
Guowei Xu
Jiawei Guo
Ruijie Zheng
Furong Huang
Gang Hua
Huazhe Xu
CML
111
12
0
22 Feb 2024
Automated Design and Optimization of Distributed Filtering Circuits via
  Reinforcement Learning
Automated Design and Optimization of Distributed Filtering Circuits via Reinforcement Learning
Peng Gao
Tao Yu
Fei Wang
Ruyue Yuan
32
1
0
22 Feb 2024
Computation Offloading for Multi-server Multi-access Edge Vehicular
  Networks: A DDQN-based Method
Computation Offloading for Multi-server Multi-access Edge Vehicular Networks: A DDQN-based Method
Siyu Wang
Bo Yang
Zhiwen Yu
Xuelin Cao
Yan Zhang
Chau Yuen
26
0
0
21 Feb 2024
The Edge-of-Reach Problem in Offline Model-Based Reinforcement Learning
The Edge-of-Reach Problem in Offline Model-Based Reinforcement Learning
Anya Sims
Cong Lu
Yee Whye Teh
OffRL
96
4
0
19 Feb 2024
In value-based deep reinforcement learning, a pruned network is a good
  network
In value-based deep reinforcement learning, a pruned network is a good network
J. Obando-Ceron
Rameswar Panda
Pablo Samuel Castro
OffRL
119
26
0
19 Feb 2024
Reinforcement learning to maximise wind turbine energy generation
Reinforcement learning to maximise wind turbine energy generation
Daniel Soler
O. Marino
D. Huergo
Martín de Frutos
Esteban Ferrer
50
0
0
17 Feb 2024
SINR-Aware Deep Reinforcement Learning for Distributed Dynamic Channel
  Allocation in Cognitive Interference Networks
SINR-Aware Deep Reinforcement Learning for Distributed Dynamic Channel Allocation in Cognitive Interference Networks
Yaniv Cohen
Tomer Gafni
Ronen Greenberg
Kobi Cohen
36
5
0
17 Feb 2024
Symmetry-Breaking Augmentations for Ad Hoc Teamwork
Symmetry-Breaking Augmentations for Ad Hoc Teamwork
Ravi Hammond
Dustin Craggs
Mingyu Guo
Jakob Foerster
Ian Reid
92
2
0
15 Feb 2024
Learning Interpretable Policies in Hindsight-Observable POMDPs through
  Partially Supervised Reinforcement Learning
Learning Interpretable Policies in Hindsight-Observable POMDPs through Partially Supervised Reinforcement Learning
Michael Lanier
Ying Xu
Nathan Jacobs
Chongjie Zhang
Yevgeniy Vorobeychik
64
2
0
14 Feb 2024
FGeo-DRL: Deductive Reasoning for Geometric Problems through Deep
  Reinforcement Learning
FGeo-DRL: Deductive Reasoning for Geometric Problems through Deep Reinforcement Learning
Jia Zou
Xiaokai Zhang
Yiming He
Na Zhu
Tuo Leng
AIMatAI4CELRM
102
4
0
14 Feb 2024
Deep Reinforcement Learning for Controlled Traversing of the Attractor Landscape of Boolean Models in the Context of Cellular Reprogramming
Deep Reinforcement Learning for Controlled Traversing of the Attractor Landscape of Boolean Models in the Context of Cellular Reprogramming
Andrzej Mizera
Jakub Zarzycki
69
1
0
13 Feb 2024
Scaling Intelligent Agents in Combat Simulations for Wargaming
Scaling Intelligent Agents in Combat Simulations for Wargaming
Scotty Black
Christian J. Darken
20
1
0
08 Feb 2024
Scaling Artificial Intelligence for Digital Wargaming in Support of
  Decision-Making
Scaling Artificial Intelligence for Digital Wargaming in Support of Decision-Making
Scotty Black
Christian J. Darken
21
2
0
08 Feb 2024
Multi-Timescale Ensemble Q-learning for Markov Decision Process Policy
  Optimization
Multi-Timescale Ensemble Q-learning for Markov Decision Process Policy Optimization
Talha Bozkus
Urbashi Mitra
OffRL
72
5
0
08 Feb 2024
Analyzing Adversarial Inputs in Deep Reinforcement Learning
Analyzing Adversarial Inputs in Deep Reinforcement Learning
Davide Corsi
Guy Amir
Guy Katz
Alessandro Farinelli
AAML
63
7
0
07 Feb 2024
Pedestrian crossing decisions can be explained by bounded optimal
  decision-making under noisy visual perception
Pedestrian crossing decisions can be explained by bounded optimal decision-making under noisy visual perception
Yueyang Wang
Aravinda Ramakrishnan Srinivasan
Jussi P. P. Jokinen
Antti Oulasvirta
Gustav Markkula
50
2
0
06 Feb 2024
Frugal Actor-Critic: Sample Efficient Off-Policy Deep Reinforcement
  Learning Using Unique Experiences
Frugal Actor-Critic: Sample Efficient Off-Policy Deep Reinforcement Learning Using Unique Experiences
Nikhil Kumar Singh
Indranil Saha
OffRL
35
0
0
05 Feb 2024
Q-Star Meets Scalable Posterior Sampling: Bridging Theory and Practice
  via HyperAgent
Q-Star Meets Scalable Posterior Sampling: Bridging Theory and Practice via HyperAgent
Yingru Li
Jiawei Xu
Lei Han
Zhi-Quan Luo
BDLOffRL
101
5
0
05 Feb 2024
Evading Deep Learning-Based Malware Detectors via Obfuscation: A Deep
  Reinforcement Learning Approach
Evading Deep Learning-Based Malware Detectors via Obfuscation: A Deep Reinforcement Learning Approach
Brian Etter
Junjie Hu
Mohammedreza Ebrahimi
Weifeng Li
Xin Li
Hsinchun Chen
87
1
0
04 Feb 2024
Device Scheduling and Assignment in Hierarchical Federated Learning for
  Internet of Things
Device Scheduling and Assignment in Hierarchical Federated Learning for Internet of Things
Tinghao Zhang
Kwok-Yan Lam
Jun Zhao
83
10
0
04 Feb 2024
NetLLM: Adapting Large Language Models for Networking
NetLLM: Adapting Large Language Models for Networking
Duo Wu
Xianda Wang
Yaqi Qiao
Zhi Wang
Junchen Jiang
Shuguang Cui
Fangxin Wang
94
48
0
04 Feb 2024
Towards Optimal Adversarial Robust Q-learning with Bellman
  Infinity-error
Towards Optimal Adversarial Robust Q-learning with Bellman Infinity-error
Haoran Li
Zicheng Zhang
Wang Luo
Congying Han
Yudong Hu
Tiande Guo
Shichen Liao
AAML
135
2
0
03 Feb 2024
The RL/LLM Taxonomy Tree: Reviewing Synergies Between Reinforcement
  Learning and Large Language Models
The RL/LLM Taxonomy Tree: Reviewing Synergies Between Reinforcement Learning and Large Language Models
M. Pternea
Prerna Singh
Abir Chakraborty
Y. Oruganti
M. Milletarí
Sayli Bapat
Kebei Jiang
OffRL
80
10
0
02 Feb 2024
Training Adversarial yet Safe Agent to Characterize Safety Performance
  of Highly Automated Vehicles
Training Adversarial yet Safe Agent to Characterize Safety Performance of Highly Automated Vehicles
Minghao Zhu
Anmol Sidhu
Keith A. Redmill
53
0
0
02 Feb 2024
Neural Style Transfer with Twin-Delayed DDPG for Shared Control of
  Robotic Manipulators
Neural Style Transfer with Twin-Delayed DDPG for Shared Control of Robotic Manipulators
R. Fernandez-Fernandez
Marco Aggravi
P. Giordano
J. Victores
C. Pacchierotti
130
4
0
01 Feb 2024
Transferring human emotions to robot motions using Neural Policy Style
  Transfer
Transferring human emotions to robot motions using Neural Policy Style Transfer
R. Fernandez-Fernandez
Bartek Łukawski
J. Victores
C. Pacchierotti
71
22
0
01 Feb 2024
Control in Stochastic Environment with Delays: A Model-based
  Reinforcement Learning Approach
Control in Stochastic Environment with Delays: A Model-based Reinforcement Learning Approach
Zhiyuan Yao
Ionuţ Florescu
Chihoon Lee
OffRL
43
2
0
01 Feb 2024
SwarmBrain: Embodied agent for real-time strategy game StarCraft II via
  large language models
SwarmBrain: Embodied agent for real-time strategy game StarCraft II via large language models
Xiao Shao
Weifu Jiang
Fei Zuo
Mengqing Liu
LLMAG
95
7
0
31 Jan 2024
Previous
123...678...444546
Next