Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
1509.06461
Cited By
Deep Reinforcement Learning with Double Q-learning
22 September 2015
H. V. Hasselt
A. Guez
David Silver
OffRL
Re-assign community
ArXiv
PDF
HTML
Papers citing
"Deep Reinforcement Learning with Double Q-learning"
50 / 928 papers shown
Title
HAVA: Hybrid Approach to Value-Alignment through Reward Weighing for Reinforcement Learning
Kryspin Varys
Federico Cerutti
Adam Sobey
Timothy J. Norman
28
0
0
21 May 2025
SAINT: Attention-Based Modeling of Sub-Action Dependencies in Multi-Action Policies
Matthew Landers
Taylor W. Killian
Thomas Hartvigsen
Afsaneh Doryab
32
0
0
17 May 2025
Improving the Data-efficiency of Reinforcement Learning by Warm-starting with LLM
Thang Duong
Minglai Yang
Chicheng Zhang
OffRL
29
0
0
16 May 2025
Scalability of Reinforcement Learning Methods for Dispatching in Semiconductor Frontend Fabs: A Comparison of Open-Source Models with Real Industry Datasets
Patrick Stöckermann
Henning Südfeld
Alessandro Immordino
Thomas Altenmüller
Marc Wegmann
Martin Gebser
Konstantin Schekotihin
Georg Seidel
Chew Wye Chan
Fei Fei Zhang
OffRL
22
0
0
16 May 2025
Electric Bus Charging Schedules Relying on Real Data-Driven Targets Based on Hierarchical Deep Reinforcement Learning
Jiaju Qi
Lei Lei
Thorsteinn Jonsson
L. Hanzo
18
0
0
15 May 2025
Scaling Multi Agent Reinforcement Learning for Underwater Acoustic Tracking via Autonomous Vehicles
Matteo Gallici
Ivan Masmitja
Mario Martin
OffRL
29
0
0
13 May 2025
Multi-source Plume Tracing via Multi-Agent Reinforcement Learning
Pedro Antonio Alarcon Granadeno
Theodore Chambers
J. Cleland-Huang
AI4CE
23
0
0
12 May 2025
Reinforcement Learning for Game-Theoretic Resource Allocation on Graphs
Zijian An
Lifeng Zhou
36
0
0
08 May 2025
Graph Neural Network-Based Reinforcement Learning for Controlling Biological Networks: The GATTACA Framework
Andrzej Mizera
Jakub Zarzycki
GNN
AI4CE
45
0
0
05 May 2025
Integration of Multi-Mode Preference into Home Energy Management System Using Deep Reinforcement Learning
Mohammed Sumayli
Olugbenga Moses Anubi
24
0
0
02 May 2025
Approximation to Deep Q-Network by Stochastic Delay Differential Equations
Jianya Lu
Yingjun Mo
38
0
0
01 May 2025
Uncertainty-aware Latent Safety Filters for Avoiding Out-of-Distribution Failures
Junwon Seo
Kensuke Nakamura
Andrea V. Bajcsy
56
0
0
01 May 2025
Q-function Decomposition with Intervention Semantics with Factored Action Spaces
Junkyu Lee
Tian Gao
Elliot Nelson
Miao Liu
D. Bhattacharjya
Songtao Lu
OffRL
55
0
0
30 Apr 2025
Multi-Agent Reinforcement Learning for Resources Allocation Optimization: A Survey
Mohamad Abdul Hady
Siyi Hu
Mahardhika Pratama
Jimmy Cao
Ryszard Kowalczyk
29
0
0
29 Apr 2025
Interactive Double Deep Q-network: Integrating Human Interventions and Evaluative Predictions in Reinforcement Learning of Autonomous Driving
Alkis Sygkounas
Ioannis Athanasiadis
A. Persson
Michael Felsberg
Amy Loutfi
OffRL
38
0
0
28 Apr 2025
Cracking the Code of Action: a Generative Approach to Affordances for Reinforcement Learning
Lynn Cherif
Flemming Kondrup
David Venuto
Ankit Anand
Doina Precup
Khimya Khetarpal
LM&Ro
54
0
0
24 Apr 2025
HF4Rec: Human-Like Feedback-Driven Optimization Framework for Explainable Recommendation
Jiakai Tang
Jingsen Zhang
Zihang Tian
Xueyang Feng
Lei Wang
Xu Chen
OffRL
243
0
0
19 Apr 2025
Deep Reinforcement Learning Algorithms for Option Hedging
Andrei Neagu
Frédéric Godin
Leila Kosseim
28
0
0
07 Apr 2025
Handover and SINR-Aware Path Optimization in 5G-UAV mmWave Communication using DRL
Achilles Machumilane
A. Gotta
P. Cassará
33
0
0
03 Apr 2025
RL2Grid: Benchmarking Reinforcement Learning in Power Grid Operations
Enrico Marchesini
Benjamin Donnot
Constance Crozier
Ian Dytham
Christian Merz
Lars Schewe
Nico Westerbeck
Cathy Wu
Antoine Marot
P. Donti
OffRL
59
1
0
29 Mar 2025
Flip Learning: Weakly Supervised Erase to Segment Nodules in Breast Ultrasound
Yuhao Huang
Ao Chang
Haoran Dou
X. Tao
Xinrui Zhou
...
Ruobing Huang
Alejandro F Frangi
Lingyun Bao
Xin Yang
Dong Ni
96
1
0
26 Mar 2025
A Generalist Hanabi Agent
Arjun Vaithilingam Sudhakar
Hadi Nekoei
Mathieu Reymond
Miao Liu
Janarthanan Rajendran
Sarath Chandar
271
0
0
17 Mar 2025
Flexible Prefrontal Control over Hippocampal Episodic Memory for Goal-Directed Generalization
Yicong Zheng
Nora Wolf
Charan Ranganath
R. C. O'Reilly
Kevin L McKee
42
0
0
04 Mar 2025
HWC-Loco: A Hierarchical Whole-Body Control Approach to Robust Humanoid Locomotion
Sixu Lin
Guanren Qiao
Yunxin Tai
Ang Li
Kui Jia
Guiliang Liu
41
0
0
02 Mar 2025
Reinforcement learning with combinatorial actions for coupled restless bandits
Lily Xu
Bryan Wilder
Elias B. Khalil
Milind Tambe
75
1
0
01 Mar 2025
IL-SOAR : Imitation Learning with Soft Optimistic Actor cRitic
Stefano Viel
Luca Viano
V. Cevher
95
0
0
27 Feb 2025
Shield Synthesis for LTL Modulo Theories
Andoni Rodríguez
Guy Amir
Davide Corsi
César Sánchez
Guy Katz
84
6
0
17 Feb 2025
Spatial-aware decision-making with ring attractors in reinforcement learning systems
Marcos Negre Saura
Richard Allmendinger
Theodore Papamarkou
Wei Pan
252
0
0
17 Feb 2025
Task Offloading in Vehicular Edge Computing using Deep Reinforcement Learning: A Survey
Ashab Uddin
Ahmed Hamdi Sakr
Ning Zhang
OffRL
62
0
0
10 Feb 2025
DECAF: Learning to be Fair in Multi-agent Resource Allocation
Ashwin Kumar
William Yeoh
92
1
0
06 Feb 2025
RLOMM: An Efficient and Robust Online Map Matching Framework with Reinforcement Learning
Minxiao Chen
Haitao Yuan
Nan Jiang
Zhihan Zheng
Sai Wu
Ao Zhou
Shuaiqiang Wang
53
0
0
05 Feb 2025
Process-Supervised Reinforcement Learning for Code Generation
Yufan Ye
Ting Zhang
Wenbin Jiang
Hua Huang
OffRL
LRM
SyDa
65
1
0
03 Feb 2025
Optimizing Job Allocation using Reinforcement Learning with Graph Neural Networks
Lars C.P.M. Quaedvlieg
68
0
0
31 Jan 2025
Reinforcement Teaching
Alex Lewandowski
Calarina Muslimani
Dale Schuurmans
Matthew E. Taylor
Jun Luo
92
1
0
28 Jan 2025
RLER-TTE: An Efficient and Effective Framework for En Route Travel Time Estimation with Reinforcement Learning
Zhihan Zheng
Haitao Yuan
Minxiao Chen
Shangguang Wang
AI4TS
86
1
0
28 Jan 2025
Divergence-Augmented Policy Optimization
Qing Wang
Yingru Li
Jiechao Xiong
Tong Zhang
OffRL
55
16
0
28 Jan 2025
Extensive Exploration in Complex Traffic Scenarios using Hierarchical Reinforcement Learning
Zhihao Zhang
Ekim Yurtsever
Keith A. Redmill
38
0
0
28 Jan 2025
Coordinating Ride-Pooling with Public Transit using Reward-Guided Conservative Q-Learning: An Offline Training and Online Fine-Tuning Reinforcement Learning Framework
Yulong Hu
Tingting Dong
Sen Li
OffRL
OnRL
67
0
0
24 Jan 2025
Revisiting Ensemble Methods for Stock Trading and Crypto Trading Tasks at ACM ICAIF FinRL Contest 2023-2024
Nikolaus Holzer
Keyi Wang
Kairong Xiao
Xiao-Yang Liu Yanglet
AIFin
35
1
0
18 Jan 2025
Risk-averse policies for natural gas futures trading using distributional reinforcement learning
Félicien Hêche
Biagio Nigro
Oussama Barakat
Stephan Robert-Nicoud
OffRL
49
0
0
08 Jan 2025
Cognitive Kernel: An Open-source Agent System towards Generalist Autopilots
Han Zhang
Xiaoman Pan
Hongwei Wang
Kaixin Ma
Wenhao Yu
Dong Yu
LLMAG
72
3
0
03 Jan 2025
Multi-Agent Q-Learning for Real-Time Load Balancing User Association and Handover in Mobile Networks
Alireza Alizadeh
Byungju Lim
Mai Vu
37
4
0
31 Dec 2024
Adaptive Reward Design for Reinforcement Learning
Minjae Kwon
Ingy Elsayed-Aly
Lu Feng
80
2
0
14 Dec 2024
JPPO: Joint Power and Prompt Optimization for Accelerated Large Language Model Services
Feiran You
Hongyang Du
Kaibin Huang
Abbas Jamalipour
89
2
0
27 Nov 2024
Broad Critic Deep Actor Reinforcement Learning for Continuous Control
Shiron Thalagala
Pak Kin Wong
Xiaozheng Wang
Tianang Sun
OffRL
81
0
0
24 Nov 2024
Beyond The Rainbow: High Performance Deep Reinforcement Learning on a Desktop PC
Tyler Clark
Mark Towers
Christine Evers
Jonathon Hare
OffRL
43
0
0
06 Nov 2024
Toward Automated Algorithm Design: A Survey and Practical Guide to Meta-Black-Box-Optimization
Zeyuan Ma
Hongshu Guo
Yue-jiao Gong
Jun Zhang
Kay Chen Tan
128
2
0
01 Nov 2024
Q-Distribution guided Q-learning for offline reinforcement learning: Uncertainty penalized Q-value via consistency model
Jing Zhang
Linjiajie Fang
Kexin Shi
Wenjia Wang
Bing-Yi Jing
OffRL
44
0
0
27 Oct 2024
Counterfactual Effect Decomposition in Multi-Agent Sequential Decision Making
Stelios Triantafyllou
A. Sukovic
Yasaman Zolfimoselo
Goran Radanović
CML
40
0
0
16 Oct 2024
Counterfactual Generative Modeling with Variational Causal Inference
Yulun Wu
Louie McConnell
Claudia Iriondo
CML
BDL
27
0
0
16 Oct 2024
1
2
3
4
...
17
18
19
Next