Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
1509.06461
Cited By
v1
v2
v3 (latest)
Deep Reinforcement Learning with Double Q-learning
22 September 2015
H. V. Hasselt
A. Guez
David Silver
OffRL
Re-assign community
ArXiv (abs)
PDF
HTML
Papers citing
"Deep Reinforcement Learning with Double Q-learning"
50 / 2,291 papers shown
Title
HF4Rec: Human-Like Feedback-Driven Optimization Framework for Explainable Recommendation
Jiakai Tang
Jingsen Zhang
Zihang Tian
Xueyang Feng
Lei Wang
Xu Chen
OffRL
406
0
0
19 Apr 2025
Dueling Deep Reinforcement Learning for Financial Time Series
Bruno Giorgio
AIFin
AI4TS
56
0
0
15 Apr 2025
Moderate Actor-Critic Methods: Controlling Overestimation Bias via Expectile Loss
Ukjo Hwang
Songnam Hong
OffRL
78
0
0
14 Apr 2025
Pay Attention to What and Where? Interpretable Feature Extractor in Vision-based Deep Reinforcement Learning
Tien Pham
Angelo Cangelosi
67
1
0
14 Apr 2025
Supervised Optimism Correction: Be Confident When LLMs Are Sure
Jing Zhang
Rushuai Yang
Shunyu Liu
Ting-En Lin
Fei Huang
Yi Chen
Yongqian Li
Dacheng Tao
OffRL
91
0
0
10 Apr 2025
Deep Reinforcement Learning Algorithms for Option Hedging
Andrei Neagu
Frédéric Godin
Leila Kosseim
79
0
0
07 Apr 2025
NeRFlex: Resource-aware Real-time High-quality Rendering of Complex Scenes on Mobile Devices
Zhe Wang
Yifei Zhu
100
0
0
04 Apr 2025
Handover and SINR-Aware Path Optimization in 5G-UAV mmWave Communication using DRL
Achilles Machumilane
A. Gotta
P. Cassará
71
0
0
03 Apr 2025
FastFlow: Early Yet Robust Network Flow Classification using the Minimal Number of Time-Series Packets
Rushi Jayeshkumar Babaria
Minzhao Lyu
Gustavo E. A. P. A. Batista
V. Sivaraman
AI4TS
58
0
0
02 Apr 2025
A Survey of Reinforcement Learning-Based Motion Planning for Autonomous Driving: Lessons Learned from a Driving Task Perspective
Zhuoren Li
Guizhe Jin
Ran Yu
Zhiwen Chen
Nan I. Li
...
Lu Xiong
Bo Leng
Jia Hu
Ilya Kolmanovsky
Dimitar Filev
106
0
0
31 Mar 2025
RL2Grid: Benchmarking Reinforcement Learning in Power Grid Operations
Enrico Marchesini
Benjamin Donnot
Constance Crozier
Ian Dytham
Christian Merz
Lars Schewe
Nico Westerbeck
Cathy Wu
Antoine Marot
P. Donti
OffRL
86
1
0
29 Mar 2025
On the Mistaken Assumption of Interchangeable Deep Reinforcement Learning Implementations
Rajdeep Singh Hundal
Yan Xiao
Xiaochun Cao
Jin Song Dong
Manuel Rigger
134
0
0
28 Mar 2025
Controlling Large Language Model with Latent Actions
Chengxing Jia
Ziniu Li
Pengyuan Wang
Yi-Chen Li
Zhenyu Hou
Yuxiao Dong
Y. Yu
117
1
0
27 Mar 2025
Flip Learning: Weakly Supervised Erase to Segment Nodules in Breast Ultrasound
Yuhao Huang
Ao Chang
Haoran Dou
X. Tao
Xinrui Zhou
...
Ruobing Huang
Alejandro F Frangi
Lingyun Bao
Xin Yang
Dong Ni
123
1
0
26 Mar 2025
FastFT: Accelerating Reinforced Feature Transformation via Advanced Exploration Strategies
Tianqi He
Xiaohan Huang
Yi Du
Qingqing Long
Ziyue Qiao
Min-Ying Wu
Yanjie Fu
Yuanchun Zhou
Meng Xiao
OffRL
154
3
0
26 Mar 2025
Reinforcement Learning-based Self-adaptive Differential Evolution through Automated Landscape Feature Learning
Hongshu Guo
Sijie Ma
Zechuan Huang
Yuzhi Hu
Zeyuan Ma
Xinglin Zhang
Yue-Jiao Gong
96
4
0
23 Mar 2025
Time After Time: Deep-Q Effect Estimation for Interventions on When and What to do
Yoav Wald
M. Goldstein
Yonathan Efroni
Wouter A. C. van Amsterdam
Rajesh Ranganath
CML
179
0
0
20 Mar 2025
A Generalist Hanabi Agent
Arjun Vaithilingam Sudhakar
Hadi Nekoei
Mathieu Reymond
Miao Liu
Janarthanan Rajendran
Sarath Chandar
481
0
0
17 Mar 2025
Automation and Feature Selection Enhancement with Reinforcement Learning (RL)
Sumana Sanyasipura Nagaraju
115
0
0
15 Mar 2025
Hierarchical Reinforcement Learning for Safe Mapless Navigation with Congestion Estimation
Jianqi Gao
Xizheng Pang
Qi Liu
Yanjie Li
101
0
0
15 Mar 2025
Rule-Guided Reinforcement Learning Policy Evaluation and Improvement
Martin Tappler
Ignacio D. Lopez-Miguel
Sebastian Tschiatschek
Ezio Bartocci
109
0
0
13 Mar 2025
Impoola: The Power of Average Pooling for Image-Based Deep Reinforcement Learning
Raphael Trumpp
Ansgar Schäfftlein
Mirco Theile
Marco Caccamo
102
1
0
07 Mar 2025
Boosting Offline Optimizers with Surrogate Sensitivity
Manh Cuong Dao
Phi Le Nguyen
Thao Nguyen Truong
Trong Nghia Hoang
OffRL
102
0
0
06 Mar 2025
Flexible Prefrontal Control over Hippocampal Episodic Memory for Goal-Directed Generalization
Yicong Zheng
Nora Wolf
Charan Ranganath
R. C. O'Reilly
Kevin L McKee
81
0
0
04 Mar 2025
A2Perf: Real-World Autonomous Agents Benchmark
Ikechukwu Uchendu
Jason J. Jabbour
Korneel Van den Berghe
Joel Runevic
Matthew P. Stewart
...
S. Guadarrama
Jie Tan
Jordan K. Terry
Aleksandra Faust
Vijay Janapa Reddi
91
0
0
04 Mar 2025
Enhancing Deep Reinforcement Learning-based Robot Navigation Generalization through Scenario Augmentation
Shanze Wang
Mingao Tan
Zhiyong Yang
Xinyu Wang
Xiaoyu Shen
Hailong Huang
Wei Zhang
102
0
0
03 Mar 2025
HWC-Loco: A Hierarchical Whole-Body Control Approach to Robust Humanoid Locomotion
Sixu Lin
Guanren Qiao
Yunxin Tai
Ang Li
Kui Jia
Guiliang Liu
120
2
0
02 Mar 2025
Reinforcement learning with combinatorial actions for coupled restless bandits
Lily Xu
Bryan Wilder
Elias B. Khalil
Milind Tambe
107
1
0
01 Mar 2025
IL-SOAR : Imitation Learning with Soft Optimistic Actor cRitic
Stefano Viel
Luca Viano
Volkan Cevher
190
1
0
27 Feb 2025
ColorDynamic: Generalizable, Scalable, Real-time, End-to-end Local Planner for Unstructured and Dynamic Environments
Jinghao Xin
Zhichao Liang
Zihuan Zhang
Peng Wang
Ning Li
92
0
0
27 Feb 2025
Leveraging Large Language Models for Effective and Explainable Multi-Agent Credit Assignment
Kartik Nagpal
Dayi Dong
Jean-Baptiste Bouvier
Negar Mehr
LLMAG
78
1
0
24 Feb 2025
Multi-Teacher Knowledge Distillation with Reinforcement Learning for Visual Recognition
Chuanguang Yang
Xinqiang Yu
Han Yang
Zhulin An
Chengqing Yu
Libo Huang
Yongjun Xu
108
3
0
22 Feb 2025
Fighter Jet Navigation and Combat using Deep Reinforcement Learning with Explainable AI
Swati Kar
Soumyabrata Dey
Mahesh K Banavar
Shahnewaz Karim Sakib
118
0
0
19 Feb 2025
Shield Synthesis for LTL Modulo Theories
Andoni Rodríguez
Guy Amir
Davide Corsi
César Sánchez
Guy Katz
124
7
0
17 Feb 2025
Spatial-aware decision-making with ring attractors in reinforcement learning systems
Marcos Negre Saura
Richard Allmendinger
Theodore Papamarkou
Wei Pan
467
0
0
17 Feb 2025
Intelligent Offloading in Vehicular Edge Computing: A Comprehensive Review of Deep Reinforcement Learning Approaches and Architectures
Ashab Uddin
Ahmed Hamdi Sakr
Ning Zhang
OffRL
104
0
0
10 Feb 2025
Deep Reinforcement Learning based Triggering Function for Early Classifiers of Time Series
Aurélien Renault
A. Bondu
Antoine Cornuéjols
Vincent Lemaire
81
0
0
10 Feb 2025
LRA-GNN: Latent Relation-Aware Graph Neural Network with Initial and Dynamic Residual for Facial Age Estimation
Yiping Zhang
Yuntao Shou
Wei Ai
Tao Meng
Keqin Li
CVBM
120
1
0
08 Feb 2025
Leveraging Constraint Violation Signals For Action-Constrained Reinforcement Learning
J. Brahmanage
Jiajing Ling
Akshat Kumar
151
0
0
08 Feb 2025
DECAF: Learning to be Fair in Multi-agent Resource Allocation
Ashwin Kumar
William Yeoh
159
1
0
06 Feb 2025
RLOMM: An Efficient and Robust Online Map Matching Framework with Reinforcement Learning
Minxiao Chen
Haitao Yuan
Nan Jiang
Zhihan Zheng
Sai Wu
Ao Zhou
Shuaiqiang Wang
217
0
0
05 Feb 2025
Process-Supervised Reinforcement Learning for Code Generation
Yufan Ye
Ting Zhang
Wenbin Jiang
Hua Huang
OffRL
LRM
SyDa
114
1
0
03 Feb 2025
Learning from Suboptimal Data in Continuous Control via Auto-Regressive Soft Q-Network
Jijia Liu
Feng Gao
Q. Liao
Chao Yu
Yu Wang
OffRL
174
0
0
01 Feb 2025
Optimizing Job Allocation using Reinforcement Learning with Graph Neural Networks
Lars C.P.M. Quaedvlieg
105
0
0
31 Jan 2025
Divergence-Augmented Policy Optimization
Qing Wang
Yingru Li
Jiechao Xiong
Tong Zhang
OffRL
174
16
0
28 Jan 2025
RLER-TTE: An Efficient and Effective Framework for En Route Travel Time Estimation with Reinforcement Learning
Zhihan Zheng
Haitao Yuan
Minxiao Chen
Shangguang Wang
AI4TS
127
2
0
28 Jan 2025
Reinforcement Teaching
Alex Lewandowski
Calarina Muslimani
Dale Schuurmans
Matthew E. Taylor
Jun Luo
198
2
0
28 Jan 2025
Extensive Exploration in Complex Traffic Scenarios using Hierarchical Reinforcement Learning
Zhihao Zhang
Ekim Yurtsever
Keith A. Redmill
86
0
0
28 Jan 2025
Coordinating Ride-Pooling with Public Transit using Reward-Guided Conservative Q-Learning: An Offline Training and Online Fine-Tuning Reinforcement Learning Framework
Yulong Hu
Tingting Dong
Sen Li
OffRL
OnRL
114
1
0
24 Jan 2025
Revisiting Ensemble Methods for Stock Trading and Crypto Trading Tasks at ACM ICAIF FinRL Contest 2023-2024
Nikolaus Holzer
Keyi Wang
Kairong Xiao
Xiao-Yang Liu Yanglet
AIFin
84
1
0
18 Jan 2025
Previous
1
2
3
4
5
...
44
45
46
Next