ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1509.06461
  4. Cited By
Deep Reinforcement Learning with Double Q-learning

Deep Reinforcement Learning with Double Q-learning

22 September 2015
H. V. Hasselt
A. Guez
David Silver
    OffRL
ArXivPDFHTML

Papers citing "Deep Reinforcement Learning with Double Q-learning"

50 / 933 papers shown
Title
HAVA: Hybrid Approach to Value-Alignment through Reward Weighing for Reinforcement Learning
HAVA: Hybrid Approach to Value-Alignment through Reward Weighing for Reinforcement Learning
Kryspin Varys
Federico Cerutti
Adam Sobey
Timothy J. Norman
28
0
0
21 May 2025
SAINT: Attention-Based Modeling of Sub-Action Dependencies in Multi-Action Policies
SAINT: Attention-Based Modeling of Sub-Action Dependencies in Multi-Action Policies
Matthew Landers
Taylor W. Killian
Thomas Hartvigsen
Afsaneh Doryab
32
0
0
17 May 2025
Scalability of Reinforcement Learning Methods for Dispatching in Semiconductor Frontend Fabs: A Comparison of Open-Source Models with Real Industry Datasets
Scalability of Reinforcement Learning Methods for Dispatching in Semiconductor Frontend Fabs: A Comparison of Open-Source Models with Real Industry Datasets
Patrick Stöckermann
Henning Südfeld
Alessandro Immordino
Thomas Altenmüller
Marc Wegmann
Martin Gebser
Konstantin Schekotihin
Georg Seidel
Chew Wye Chan
Fei Fei Zhang
OffRL
22
0
0
16 May 2025
Improving the Data-efficiency of Reinforcement Learning by Warm-starting with LLM
Improving the Data-efficiency of Reinforcement Learning by Warm-starting with LLM
Thang Duong
Minglai Yang
Chicheng Zhang
OffRL
29
0
0
16 May 2025
Electric Bus Charging Schedules Relying on Real Data-Driven Targets Based on Hierarchical Deep Reinforcement Learning
Electric Bus Charging Schedules Relying on Real Data-Driven Targets Based on Hierarchical Deep Reinforcement Learning
Jiaju Qi
Lei Lei
Thorsteinn Jonsson
L. Hanzo
18
0
0
15 May 2025
Scaling Multi Agent Reinforcement Learning for Underwater Acoustic Tracking via Autonomous Vehicles
Scaling Multi Agent Reinforcement Learning for Underwater Acoustic Tracking via Autonomous Vehicles
Matteo Gallici
Ivan Masmitja
Mario Martin
OffRL
29
0
0
13 May 2025
Multi-source Plume Tracing via Multi-Agent Reinforcement Learning
Multi-source Plume Tracing via Multi-Agent Reinforcement Learning
Pedro Antonio Alarcon Granadeno
Theodore Chambers
J. Cleland-Huang
AI4CE
23
0
0
12 May 2025
Reinforcement Learning for Game-Theoretic Resource Allocation on Graphs
Reinforcement Learning for Game-Theoretic Resource Allocation on Graphs
Zijian An
Lifeng Zhou
36
0
0
08 May 2025
Graph Neural Network-Based Reinforcement Learning for Controlling Biological Networks: The GATTACA Framework
Graph Neural Network-Based Reinforcement Learning for Controlling Biological Networks: The GATTACA Framework
Andrzej Mizera
Jakub Zarzycki
GNN
AI4CE
45
0
0
05 May 2025
Integration of Multi-Mode Preference into Home Energy Management System Using Deep Reinforcement Learning
Integration of Multi-Mode Preference into Home Energy Management System Using Deep Reinforcement Learning
Mohammed Sumayli
Olugbenga Moses Anubi
26
0
0
02 May 2025
Uncertainty-aware Latent Safety Filters for Avoiding Out-of-Distribution Failures
Uncertainty-aware Latent Safety Filters for Avoiding Out-of-Distribution Failures
Junwon Seo
Kensuke Nakamura
Andrea V. Bajcsy
56
0
0
01 May 2025
Approximation to Deep Q-Network by Stochastic Delay Differential Equations
Approximation to Deep Q-Network by Stochastic Delay Differential Equations
Jianya Lu
Yingjun Mo
38
0
0
01 May 2025
Q-function Decomposition with Intervention Semantics with Factored Action Spaces
Q-function Decomposition with Intervention Semantics with Factored Action Spaces
Junkyu Lee
Tian Gao
Elliot Nelson
Miao Liu
D. Bhattacharjya
Songtao Lu
OffRL
55
0
0
30 Apr 2025
Multi-Agent Reinforcement Learning for Resources Allocation Optimization: A Survey
Multi-Agent Reinforcement Learning for Resources Allocation Optimization: A Survey
Mohamad Abdul Hady
Siyi Hu
Mahardhika Pratama
Jimmy Cao
Ryszard Kowalczyk
29
0
0
29 Apr 2025
Interactive Double Deep Q-network: Integrating Human Interventions and Evaluative Predictions in Reinforcement Learning of Autonomous Driving
Interactive Double Deep Q-network: Integrating Human Interventions and Evaluative Predictions in Reinforcement Learning of Autonomous Driving
Alkis Sygkounas
Ioannis Athanasiadis
A. Persson
Michael Felsberg
Amy Loutfi
OffRL
38
0
0
28 Apr 2025
Cracking the Code of Action: a Generative Approach to Affordances for Reinforcement Learning
Cracking the Code of Action: a Generative Approach to Affordances for Reinforcement Learning
Lynn Cherif
Flemming Kondrup
David Venuto
Ankit Anand
Doina Precup
Khimya Khetarpal
LM&Ro
54
0
0
24 Apr 2025
HF4Rec: Human-Like Feedback-Driven Optimization Framework for Explainable Recommendation
HF4Rec: Human-Like Feedback-Driven Optimization Framework for Explainable Recommendation
Jiakai Tang
Jingsen Zhang
Zihang Tian
Xueyang Feng
Lei Wang
Xu Chen
OffRL
243
0
0
19 Apr 2025
Deep Reinforcement Learning Algorithms for Option Hedging
Deep Reinforcement Learning Algorithms for Option Hedging
Andrei Neagu
Frédéric Godin
Leila Kosseim
28
0
0
07 Apr 2025
Handover and SINR-Aware Path Optimization in 5G-UAV mmWave Communication using DRL
Handover and SINR-Aware Path Optimization in 5G-UAV mmWave Communication using DRL
Achilles Machumilane
A. Gotta
P. Cassará
33
0
0
03 Apr 2025
RL2Grid: Benchmarking Reinforcement Learning in Power Grid Operations
RL2Grid: Benchmarking Reinforcement Learning in Power Grid Operations
Enrico Marchesini
Benjamin Donnot
Constance Crozier
Ian Dytham
Christian Merz
Lars Schewe
Nico Westerbeck
Cathy Wu
Antoine Marot
P. Donti
OffRL
59
1
0
29 Mar 2025
Flip Learning: Weakly Supervised Erase to Segment Nodules in Breast Ultrasound
Flip Learning: Weakly Supervised Erase to Segment Nodules in Breast Ultrasound
Yuhao Huang
Ao Chang
Haoran Dou
X. Tao
Xinrui Zhou
...
Ruobing Huang
Alejandro F Frangi
Lingyun Bao
Xin Yang
Dong Ni
96
1
0
26 Mar 2025
A Generalist Hanabi Agent
A Generalist Hanabi Agent
Arjun Vaithilingam Sudhakar
Hadi Nekoei
Mathieu Reymond
Miao Liu
Janarthanan Rajendran
Sarath Chandar
271
0
0
17 Mar 2025
Flexible Prefrontal Control over Hippocampal Episodic Memory for Goal-Directed Generalization
Flexible Prefrontal Control over Hippocampal Episodic Memory for Goal-Directed Generalization
Yicong Zheng
Nora Wolf
Charan Ranganath
R. C. O'Reilly
Kevin L McKee
42
0
0
04 Mar 2025
HWC-Loco: A Hierarchical Whole-Body Control Approach to Robust Humanoid Locomotion
HWC-Loco: A Hierarchical Whole-Body Control Approach to Robust Humanoid Locomotion
Sixu Lin
Guanren Qiao
Yunxin Tai
Ang Li
Kui Jia
Guiliang Liu
41
0
0
02 Mar 2025
Reinforcement learning with combinatorial actions for coupled restless bandits
Reinforcement learning with combinatorial actions for coupled restless bandits
Lily Xu
Bryan Wilder
Elias B. Khalil
Milind Tambe
75
1
0
01 Mar 2025
IL-SOAR : Imitation Learning with Soft Optimistic Actor cRitic
IL-SOAR : Imitation Learning with Soft Optimistic Actor cRitic
Stefano Viel
Luca Viano
V. Cevher
95
0
0
27 Feb 2025
Shield Synthesis for LTL Modulo Theories
Shield Synthesis for LTL Modulo Theories
Andoni Rodríguez
Guy Amir
Davide Corsi
César Sánchez
Guy Katz
84
6
0
17 Feb 2025
Spatial-aware decision-making with ring attractors in reinforcement learning systems
Spatial-aware decision-making with ring attractors in reinforcement learning systems
Marcos Negre Saura
Richard Allmendinger
Theodore Papamarkou
Wei Pan
252
0
0
17 Feb 2025
Task Offloading in Vehicular Edge Computing using Deep Reinforcement Learning: A Survey
Task Offloading in Vehicular Edge Computing using Deep Reinforcement Learning: A Survey
Ashab Uddin
Ahmed Hamdi Sakr
Ning Zhang
OffRL
62
0
0
10 Feb 2025
DECAF: Learning to be Fair in Multi-agent Resource Allocation
DECAF: Learning to be Fair in Multi-agent Resource Allocation
Ashwin Kumar
William Yeoh
92
1
0
06 Feb 2025
RLOMM: An Efficient and Robust Online Map Matching Framework with Reinforcement Learning
RLOMM: An Efficient and Robust Online Map Matching Framework with Reinforcement Learning
Minxiao Chen
Haitao Yuan
Nan Jiang
Zhihan Zheng
Sai Wu
Ao Zhou
Shuaiqiang Wang
53
0
0
05 Feb 2025
Process-Supervised Reinforcement Learning for Code Generation
Process-Supervised Reinforcement Learning for Code Generation
Yufan Ye
Ting Zhang
Wenbin Jiang
Hua Huang
OffRL
LRM
SyDa
65
1
0
03 Feb 2025
Optimizing Job Allocation using Reinforcement Learning with Graph Neural Networks
Optimizing Job Allocation using Reinforcement Learning with Graph Neural Networks
Lars C.P.M. Quaedvlieg
68
0
0
31 Jan 2025
Reinforcement Teaching
Reinforcement Teaching
Alex Lewandowski
Calarina Muslimani
Dale Schuurmans
Matthew E. Taylor
Jun Luo
92
1
0
28 Jan 2025
Divergence-Augmented Policy Optimization
Qing Wang
Yingru Li
Jiechao Xiong
Tong Zhang
OffRL
55
16
0
28 Jan 2025
RLER-TTE: An Efficient and Effective Framework for En Route Travel Time Estimation with Reinforcement Learning
Zhihan Zheng
Haitao Yuan
Minxiao Chen
Shangguang Wang
AI4TS
86
1
0
28 Jan 2025
Extensive Exploration in Complex Traffic Scenarios using Hierarchical Reinforcement Learning
Zhihao Zhang
Ekim Yurtsever
Keith A. Redmill
38
0
0
28 Jan 2025
Coordinating Ride-Pooling with Public Transit using Reward-Guided Conservative Q-Learning: An Offline Training and Online Fine-Tuning Reinforcement Learning Framework
Coordinating Ride-Pooling with Public Transit using Reward-Guided Conservative Q-Learning: An Offline Training and Online Fine-Tuning Reinforcement Learning Framework
Yulong Hu
Tingting Dong
Sen Li
OffRL
OnRL
67
0
0
24 Jan 2025
Revisiting Ensemble Methods for Stock Trading and Crypto Trading Tasks at ACM ICAIF FinRL Contest 2023-2024
Revisiting Ensemble Methods for Stock Trading and Crypto Trading Tasks at ACM ICAIF FinRL Contest 2023-2024
Nikolaus Holzer
Keyi Wang
Kairong Xiao
Xiao-Yang Liu Yanglet
AIFin
35
1
0
18 Jan 2025
Risk-averse policies for natural gas futures trading using distributional reinforcement learning
Risk-averse policies for natural gas futures trading using distributional reinforcement learning
Félicien Hêche
Biagio Nigro
Oussama Barakat
Stephan Robert-Nicoud
OffRL
49
0
0
08 Jan 2025
Cognitive Kernel: An Open-source Agent System towards Generalist Autopilots
Cognitive Kernel: An Open-source Agent System towards Generalist Autopilots
Han Zhang
Xiaoman Pan
Hongwei Wang
Kaixin Ma
Wenhao Yu
Dong Yu
LLMAG
72
3
0
03 Jan 2025
Multi-Agent Q-Learning for Real-Time Load Balancing User Association and Handover in Mobile Networks
Multi-Agent Q-Learning for Real-Time Load Balancing User Association and Handover in Mobile Networks
Alireza Alizadeh
Byungju Lim
Mai Vu
37
4
0
31 Dec 2024
Adaptive Reward Design for Reinforcement Learning
Adaptive Reward Design for Reinforcement Learning
Minjae Kwon
Ingy Elsayed-Aly
Lu Feng
80
2
0
14 Dec 2024
JPPO: Joint Power and Prompt Optimization for Accelerated Large Language Model Services
JPPO: Joint Power and Prompt Optimization for Accelerated Large Language Model Services
Feiran You
Hongyang Du
Kaibin Huang
Abbas Jamalipour
89
2
0
27 Nov 2024
Broad Critic Deep Actor Reinforcement Learning for Continuous Control
Broad Critic Deep Actor Reinforcement Learning for Continuous Control
Shiron Thalagala
Pak Kin Wong
Xiaozheng Wang
Tianang Sun
OffRL
81
0
0
24 Nov 2024
Beyond The Rainbow: High Performance Deep Reinforcement Learning on a Desktop PC
Beyond The Rainbow: High Performance Deep Reinforcement Learning on a Desktop PC
Tyler Clark
Mark Towers
Christine Evers
Jonathon Hare
OffRL
43
0
0
06 Nov 2024
Toward Automated Algorithm Design: A Survey and Practical Guide to Meta-Black-Box-Optimization
Toward Automated Algorithm Design: A Survey and Practical Guide to Meta-Black-Box-Optimization
Zeyuan Ma
Hongshu Guo
Yue-jiao Gong
Jun Zhang
Kay Chen Tan
128
2
0
01 Nov 2024
Q-Distribution guided Q-learning for offline reinforcement learning: Uncertainty penalized Q-value via consistency model
Q-Distribution guided Q-learning for offline reinforcement learning: Uncertainty penalized Q-value via consistency model
Jing Zhang
Linjiajie Fang
Kexin Shi
Wenjia Wang
Bing-Yi Jing
OffRL
46
0
0
27 Oct 2024
Dynamic Learning Rate for Deep Reinforcement Learning: A Bandit Approach
Dynamic Learning Rate for Deep Reinforcement Learning: A Bandit Approach
Henrique Donâncio
Antoine Barrier
Leah F. South
Florence Forbes
33
0
0
16 Oct 2024
Counterfactual Effect Decomposition in Multi-Agent Sequential Decision Making
Counterfactual Effect Decomposition in Multi-Agent Sequential Decision Making
Stelios Triantafyllou
A. Sukovic
Yasaman Zolfimoselo
Goran Radanović
CML
40
0
0
16 Oct 2024
1234...171819
Next