ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1509.06461
  4. Cited By
Deep Reinforcement Learning with Double Q-learning
v1v2v3 (latest)

Deep Reinforcement Learning with Double Q-learning

22 September 2015
H. V. Hasselt
A. Guez
David Silver
    OffRL
ArXiv (abs)PDFHTML

Papers citing "Deep Reinforcement Learning with Double Q-learning"

50 / 2,291 papers shown
Title
Robust Q-Learning for finite ambiguity sets
Robust Q-Learning for finite ambiguity sets
Cécile Decker
Julian Sester
97
1
0
05 Jul 2024
DRLQ: A Deep Reinforcement Learning-based Task Placement for Quantum
  Cloud Computing
DRLQ: A Deep Reinforcement Learning-based Task Placement for Quantum Cloud Computing
H. T. Nguyen
Muhammad Usman
Rajkumar Buyya
56
0
0
03 Jul 2024
Hybrid RAG-empowered Multi-modal LLM for Secure Healthcare Data
  Management: A Diffusion-based Contract Theory Approach
Hybrid RAG-empowered Multi-modal LLM for Secure Healthcare Data Management: A Diffusion-based Contract Theory Approach
Cheng Su
Jinbo Wen
Jiawen Kang
Yonghua Wang
Hudan Pan
M. S. Hossain
MedIm
45
0
0
01 Jul 2024
Mental Modeling of Reinforcement Learning Agents by Language Models
Mental Modeling of Reinforcement Learning Agents by Language Models
Wenhao Lu
Xufeng Zhao
Josua Spisak
Jae Hee Lee
Stefan Wermter
LLMAGLRMLM&Ro
68
3
0
26 Jun 2024
When does Self-Prediction help? Understanding Auxiliary Tasks in
  Reinforcement Learning
When does Self-Prediction help? Understanding Auxiliary Tasks in Reinforcement Learning
C. Voelcker
Tyler Kastner
Igor Gilitschenski
Amir-massoud Farahmand
SSL
90
6
0
25 Jun 2024
KANQAS: Kolmogorov-Arnold Network for Quantum Architecture Search
KANQAS: Kolmogorov-Arnold Network for Quantum Architecture Search
Akash Kundu
Aritra Sarkar
Abhishek Sadhu
94
31
0
25 Jun 2024
On the consistency of hyper-parameter selection in value-based deep
  reinforcement learning
On the consistency of hyper-parameter selection in value-based deep reinforcement learning
J. Obando-Ceron
J. G. Araújo
Rameswar Panda
Pablo Samuel Castro
120
9
0
25 Jun 2024
OCALM: Object-Centric Assessment with Language Models
OCALM: Object-Centric Assessment with Language Models
Timo Kaufmann
Johannes Czech
Antonia Wüst
Quentin Delfosse
Kristian Kersting
Eyke Hüllermeier
LM&RoLRM
88
1
0
24 Jun 2024
Understanding and Diagnosing Deep Reinforcement Learning
Understanding and Diagnosing Deep Reinforcement Learning
Ezgi Korkmaz
68
3
0
23 Jun 2024
Learning Abstract World Model for Value-preserving Planning with Options
Learning Abstract World Model for Value-preserving Planning with Options
Rafael Rodríguez-Sánchez
George Konidaris
86
1
0
22 Jun 2024
Learning to Select Goals in Automated Planning with Deep-Q Learning
Learning to Select Goals in Automated Planning with Deep-Q Learning
Carlos Núnez-Molina
Juan Fernández-Olivares
Raúl Pérez
69
10
0
20 Jun 2024
Graph Neural Networks for Job Shop Scheduling Problems: A Survey
Graph Neural Networks for Job Shop Scheduling Problems: A Survey
Igor G. Smit
Jianan Zhou
Robbert Reijnen
Yaoxin Wu
Jian Chen
Cong Zhang
Zaharah Bukhsh
Wim P. M. Nuijten
Yingqian Zhang
GNNAI4CE
117
11
0
20 Jun 2024
Do Not Wait: Learning Re-Ranking Model Without User Feedback At Serving
  Time in E-Commerce
Do Not Wait: Learning Re-Ranking Model Without User Feedback At Serving Time in E-Commerce
Yuan Wang
Zhiyu Li
Changshuo Zhang
Sirui Chen
Xiao Zhang
Jun Xu
Quan Lin
64
1
0
20 Jun 2024
Discovering Minimal Reinforcement Learning Environments
Discovering Minimal Reinforcement Learning Environments
Jarek Liesen
Chris Xiaoxuan Lu
Andrei Lupu
Jakob N. Foerster
Henning Sprekeler
R. T. Lange
OffRL
92
4
0
18 Jun 2024
More Efficient Randomized Exploration for Reinforcement Learning via
  Approximate Sampling
More Efficient Randomized Exploration for Reinforcement Learning via Approximate Sampling
Haque Ishfaq
Yixin Tan
Yu Yang
Qingfeng Lan
Jianfeng Lu
A. Rupam Mahmood
Doina Precup
Pan Xu
89
5
0
18 Jun 2024
Linear Bellman Completeness Suffices for Efficient Online Reinforcement
  Learning with Few Actions
Linear Bellman Completeness Suffices for Efficient Online Reinforcement Learning with Few Actions
Noah Golowich
Ankur Moitra
OffRL
68
1
0
17 Jun 2024
Adaptive Reinforcement Learning Planning: Harnessing Large Language
  Models for Complex Information Extraction
Adaptive Reinforcement Learning Planning: Harnessing Large Language Models for Complex Information Extraction
Zepeng Ding
Ruiyang Ke
Wenhao Huang
Guochao Jiang
Yanda Li
Deqing Yang
Jiaqing Liang
89
1
0
17 Jun 2024
An Imitative Reinforcement Learning Framework for Autonomous Dogfight
An Imitative Reinforcement Learning Framework for Autonomous Dogfight
Siyuan Li
Rongchang Zuo
Peng Liu
Yingnan Zhao
Yingnan Zhao
112
1
0
17 Jun 2024
DigiRL: Training In-The-Wild Device-Control Agents with Autonomous
  Reinforcement Learning
DigiRL: Training In-The-Wild Device-Control Agents with Autonomous Reinforcement Learning
Hao Bai
Yifei Zhou
Mert Cemri
Jiayi Pan
Alane Suhr
Sergey Levine
Aviral Kumar
OffRL
111
65
0
14 Jun 2024
Finite-Time Analysis of Simultaneous Double Q-learning
Finite-Time Analysis of Simultaneous Double Q-learning
Hyunjun Na
Donghwan Lee
68
0
0
14 Jun 2024
Mix Q-learning for Lane Changing: A Collaborative Decision-Making Method
  in Multi-Agent Deep Reinforcement Learning
Mix Q-learning for Lane Changing: A Collaborative Decision-Making Method in Multi-Agent Deep Reinforcement Learning
Xiaojun Bi
Mingjie He
Yiwen Sun
52
1
0
14 Jun 2024
CUER: Corrected Uniform Experience Replay for Off-Policy Continuous Deep
  Reinforcement Learning Algorithms
CUER: Corrected Uniform Experience Replay for Off-Policy Continuous Deep Reinforcement Learning Algorithms
Arda Sarp Yenicesu
Furkan B. Mutlu
Suleyman S. Kozat
Ozgur S. Oguz
29
1
0
13 Jun 2024
Multi-agent Reinforcement Learning with Deep Networks for Diverse
  Q-Vectors
Multi-agent Reinforcement Learning with Deep Networks for Diverse Q-Vectors
Zhenglong Luo
Zhiyong Chen
James Welsh
37
1
0
12 Jun 2024
Bootstrapping Expectiles in Reinforcement Learning
Bootstrapping Expectiles in Reinforcement Learning
Pierre Clavier
Emmanuel Rachelson
E. L. Pennec
Matthieu Geist
OffRL
81
0
0
06 Jun 2024
Exploring Pessimism and Optimism Dynamics in Deep Reinforcement Learning
Exploring Pessimism and Optimism Dynamics in Deep Reinforcement Learning
Bahareh Tasdighi
Nicklas Werge
Yi-Shan Wu
M. Kandemir
30
0
0
06 Jun 2024
Quality-Diversity with Limited Resources
Quality-Diversity with Limited Resources
Ren-Jian Wang
Ke Xue
Cong Guan
Chao Qian
84
3
0
06 Jun 2024
Reflective Policy Optimization
Reflective Policy Optimization
Yaozhong Gan
Renye Yan
Zhe Wu
Junliang Xing
84
1
0
06 Jun 2024
Tabular and Deep Learning for the Whittle Index
Tabular and Deep Learning for the Whittle Index
Francisco Robledo Relaño
Vivek Borkar
U. Ayesta
Konstantin Avrachenkov
55
2
0
04 Jun 2024
Verifying the Generalization of Deep Learning to Out-of-Distribution
  Domains
Verifying the Generalization of Deep Learning to Out-of-Distribution Domains
Guy Amir
Osher Maayan
Tom Zelazny
Guy Katz
Michael Schapira
AAML
63
1
0
04 Jun 2024
Learning the Target Network in Function Space
Learning the Target Network in Function Space
Kavosh Asadi
Yao Liu
Shoham Sabach
Ming Yin
Rasool Fakoor
119
0
0
03 Jun 2024
Federated Learning-based Collaborative Wideband Spectrum Sensing and
  Scheduling for UAVs in UTM Systems
Federated Learning-based Collaborative Wideband Spectrum Sensing and Scheduling for UAVs in UTM Systems
Sravan Reddy Chintareddy
Keenan Roach
Kenny Cheung
Morteza Hashemi
44
2
0
03 Jun 2024
A New View on Planning in Online Reinforcement Learning
A New View on Planning in Online Reinforcement Learning
Kevin Roice
Parham Mohammad Panahi
Scott M. Jordan
Adam White
Martha White
OffRL
97
0
0
03 Jun 2024
Learning-based legged locomotion; state of the art and future
  perspectives
Learning-based legged locomotion; state of the art and future perspectives
Sehoon Ha
Joonho Lee
M. van de Panne
Zhaoming Xie
Wenhao Yu
Majid Khadiv
144
20
0
03 Jun 2024
Deep reinforcement learning for weakly coupled MDP's with continuous
  actions
Deep reinforcement learning for weakly coupled MDP's with continuous actions
Francisco Robledo
U. Ayesta
Konstantin Avrachenkov
53
0
0
03 Jun 2024
Value Improved Actor Critic Algorithms
Value Improved Actor Critic Algorithms
Yaniv Oren
Moritz A. Zanger
Pascal R. van der Vaart
M. Spaan
Wendelin Bohmer
Wendelin Bohmer
OffRL
89
0
0
03 Jun 2024
REvolve: Reward Evolution with Large Language Models using Human Feedback
REvolve: Reward Evolution with Large Language Models using Human Feedback
Rishi Hazra
Alkis Sygkounas
Andreas Persson
Amy Loutfi
Pedro Zuidberg Dos Martires
101
3
0
03 Jun 2024
Deep Reinforcement Learning for Sim-to-Real Policy Transfer of VTOL-UAVs
  Offshore Docking Operations
Deep Reinforcement Learning for Sim-to-Real Policy Transfer of VTOL-UAVs Offshore Docking Operations
A. M. Ali
Aryaman Gupta
Hashim A. Hashim
OffRL
65
7
0
02 Jun 2024
Shared-unique Features and Task-aware Prioritized Sampling on Multi-task
  Reinforcement Learning
Shared-unique Features and Task-aware Prioritized Sampling on Multi-task Reinforcement Learning
Po-Shao Lin
Jia-Fong Yeh
Yi-Ting Chen
Winston H. Hsu
85
0
0
02 Jun 2024
Learning Multimodal Behaviors from Scratch with Diffusion Policy
  Gradient
Learning Multimodal Behaviors from Scratch with Diffusion Policy Gradient
Zechu Li
Rickmer Krohn
Tao Chen
Anurag Ajay
Pulkit Agrawal
Georgia Chalvatzaki
DiffM
130
18
0
02 Jun 2024
LAGMA: LAtent Goal-guided Multi-Agent Reinforcement Learning
LAGMA: LAtent Goal-guided Multi-Agent Reinforcement Learning
Hyungho Na
IL-Chul Moon
67
1
0
30 May 2024
Safety through Permissibility: Shield Construction for Fast and Safe
  Reinforcement Learning
Safety through Permissibility: Shield Construction for Fast and Safe Reinforcement Learning
A. Politowicz
Sahisnu Mazumder
Bing-Quan Liu
58
0
0
29 May 2024
FDQN: A Flexible Deep Q-Network Framework for Game Automation
FDQN: A Flexible Deep Q-Network Framework for Game Automation
Prabhath Reddy Gujavarthy
OffRL
26
0
0
29 May 2024
DTR-Bench: An in silico Environment and Benchmark Platform for
  Reinforcement Learning Based Dynamic Treatment Regime
DTR-Bench: An in silico Environment and Benchmark Platform for Reinforcement Learning Based Dynamic Treatment Regime
Zhiyao Luo
Mingcheng Zhu
Fenglin Liu
Jiali Li
Yangchen Pan
Jiandong Zhou
Tingting Zhu
OffRL
62
3
0
28 May 2024
Offline-Boosted Actor-Critic: Adaptively Blending Optimal Historical
  Behaviors in Deep Off-Policy RL
Offline-Boosted Actor-Critic: Adaptively Blending Optimal Historical Behaviors in Deep Off-Policy RL
Yu-Juan Luo
Tianying Ji
Gang Hua
Jianwei Zhang
Huazhe Xu
Xianyuan Zhan
OffRLOnRL
108
3
0
28 May 2024
Highway Reinforcement Learning
Highway Reinforcement Learning
Yuhui Wang
M. Strupl
Francesco Faccio
Qingyuan Wu
Haozhe Liu
Michal Grudzieñ
Xiaoyang Tan
Jürgen Schmidhuber
OffRL
73
4
0
28 May 2024
Mollification Effects of Policy Gradient Methods
Mollification Effects of Policy Gradient Methods
Tao Wang
Sylvia Herbert
Sicun Gao
96
1
0
28 May 2024
Interpretable DRL-based Maneuver Decision of UCAV Dogfight
Interpretable DRL-based Maneuver Decision of UCAV Dogfight
H. Han
Jian Cheng
Maolong Lv
65
1
0
28 May 2024
Rethinking Transformers in Solving POMDPs
Rethinking Transformers in Solving POMDPs
Chenhao Lu
Ruizhe Shi
Yuyao Liu
Kaizhe Hu
Simon S. Du
Huazhe Xu
AI4CE
117
3
0
27 May 2024
Symmetric Reinforcement Learning Loss for Robust Learning on Diverse Tasks and Model Scales
Symmetric Reinforcement Learning Loss for Robust Learning on Diverse Tasks and Model Scales
Ju-Seung Byun
Andrew Perrault
57
1
0
27 May 2024
Rewarded Region Replay (R3) for Policy Learning with Discrete Action
  Space
Rewarded Region Replay (R3) for Policy Learning with Discrete Action Space
Bangzheng Li
Ningshan Ma
Zifan Wang
44
0
1
26 May 2024
Previous
123456...444546
Next