Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
1509.06461
Cited By
v1
v2
v3 (latest)
Deep Reinforcement Learning with Double Q-learning
22 September 2015
H. V. Hasselt
A. Guez
David Silver
OffRL
Re-assign community
ArXiv (abs)
PDF
HTML
Papers citing
"Deep Reinforcement Learning with Double Q-learning"
50 / 2,291 papers shown
Title
Dual-Objective Reinforcement Learning with Novel Hamilton-Jacobi-Bellman Formulations
William Sharpless
Dylan Hirsch
S. Tonkens
Nikhil Shinde
Sylvia Herbert
15
0
0
19 Jun 2025
Stable Gradients for Stable Learning at Scale in Deep Reinforcement Learning
Roger Creus Castanyer
J. Obando-Ceron
Lu Li
Pierre-Luc Bacon
Glen Berseth
Aaron Courville
Pablo Samuel Castro
22
0
0
18 Jun 2025
Inverse design of the transmission matrix in a random system using Reinforcement Learning
Yuhao Kang
15
0
0
16 Jun 2025
Learning The Minimum Action Distance
Lorenzo Steccanella
Joshua B. Evans
Özgür Simsek
Anders Jonsson
23
0
0
10 Jun 2025
Intention-Conditioned Flow Occupancy Models
Chongyi Zheng
S. Park
Sergey Levine
Benjamin Eysenbach
AI4TS
OffRL
AI4CE
34
0
0
10 Jun 2025
CARoL: Context-aware Adaptation for Robot Learning
Zechen Hu
Tong Xu
Xuesu Xiao
Xuan Wang
25
0
0
08 Jun 2025
Reliable Critics: Monotonic Improvement and Convergence Guarantees for Reinforcement Learning
Eshwar S. R.
Gugan Thoppe
Aditya Gopalan
Gal Dalal
18
0
0
08 Jun 2025
A Stable Whitening Optimizer for Efficient Neural Network Training
Kevin Frans
Sergey Levine
Pieter Abbeel
35
0
0
08 Jun 2025
Ensemble Elastic DQN: A novel multi-step ensemble approach to address overestimation in deep value-based reinforcement learning
Adrian Ly
Richard Dazeley
Peter Vamplew
F. Cruz
Sunil Aryal
32
0
0
06 Jun 2025
Horizon Reduction Makes RL Scalable
Seohong Park
Kevin Frans
Deepinder Mann
Benjamin Eysenbach
Aviral Kumar
Sergey Levine
OffRL
92
0
0
04 Jun 2025
Maximizing the Promptness of Metaverse Systems using Edge Computing by Deep Reinforcement Learning
Tam Ninh Thi-Thanh
Trinh Van Chien
Hung Tran
Nguyen Hoai Son
Van Nhan Vo
OffRL
70
0
0
03 Jun 2025
Improving Performance of Spike-based Deep Q-Learning using Ternary Neurons
Aref Ghoreishee
Abhishek Mishra
John Walsh
Anup Das
Nagarajan Kandasamy
23
0
0
03 Jun 2025
The Actor-Critic Update Order Matters for PPO in Federated Reinforcement Learning
Zhijie Xie
Shenghui Song
53
0
0
02 Jun 2025
Learning Abstract World Models with a Group-Structured Latent Space
Thomas Delliaux
Nguyen-Khanh Vu
Vincent François-Lavet
Elise van der Pol
Emmanuel Rachelson
DRL
61
0
0
02 Jun 2025
Optimistic critics can empower small actors
Olya Mastikhina
Dhruv Sreenivas
Pablo Samuel Castro
56
0
0
01 Jun 2025
Q-learning with Posterior Sampling
Priyank Agrawal
Shipra Agrawal
Azmat Azati
OffRL
GP
32
1
0
01 Jun 2025
Mitigating Plasticity Loss in Continual Reinforcement Learning by Reducing Churn
Hongyao Tang
J. Obando-Ceron
Pablo Samuel Castro
Aaron Courville
Glen Berseth
38
0
0
31 May 2025
Enhanced DACER Algorithm with High Diffusion Efficiency
Yinuo Wang
Mining Tan
Wenjun Zou
Haotian Lin
Xujie Song
...
Guojian Zhan
Tianze Zhu
Shiqi Liu
Jingliang Duan
Shengbo Eben Li
DiffM
76
0
0
29 May 2025
Reinforcement Learning-based Sequential Route Recommendation for System-Optimal Traffic Assignment
Leizhen Wang
Peibo Duan
Cheng Lyu
Zhenliang Ma
45
1
0
27 May 2025
Decision Flow Policy Optimization
Jifeng Hu
Sili Huang
Siyuan Guo
Zhaogeng Liu
Li Shen
Lichao Sun
Hechang Chen
Yi-Ju Chang
Dacheng Tao
68
0
0
26 May 2025
DISCOVER: Automated Curricula for Sparse-Reward Reinforcement Learning
Leander Diaz-Bone
Marco Bagatella
Jonas Hübotter
Andreas Krause
OffRL
77
0
0
26 May 2025
Distributionally Robust Deep Q-Learning
Chung I Lu
Julian Sester
Aijia Zhang
OOD
82
0
0
25 May 2025
Improving Value Estimation Critically Enhances Vanilla Policy Gradient
Tao Wang
Ruipeng Zhang
Sicun Gao
OffRL
53
0
0
25 May 2025
Reinforcement Fine-Tuning Powers Reasoning Capability of Multimodal Large Language Models
Haoyuan Sun
Jiaqi Wu
Bo Xia
Yifu Luo
Yifei Zhao
Kai Qin
Xufei Lv
Tiantian Zhang
Yongzhe Chang
Xueqian Wang
OffRL
LRM
209
0
0
24 May 2025
LLM-Powered AI Agent Systems and Their Applications in Industry
Guannan Liang
Qianqian Tong
LLMAG
LM&Ro
83
3
0
22 May 2025
Hadamax Encoding: Elevating Performance in Model-Free Atari
Jacob E. Kooi
Zhao Yang
Vincent François-Lavet
81
1
0
21 May 2025
Trajectory Bellman Residual Minimization: A Simple Value-Based Method for LLM Reasoning
Yurun Yuan
Fan Chen
Zeyu Jia
Alexander Rakhlin
Tengyang Xie
OffRL
133
1
0
21 May 2025
GCNT: Graph-Based Transformer Policies for Morphology-Agnostic Reinforcement Learning
Yingbo Luo
Meibao Yao
Xueming Xiao
75
0
0
21 May 2025
HAVA: Hybrid Approach to Value-Alignment through Reward Weighing for Reinforcement Learning
Kryspin Varys
Federico Cerutti
Adam Sobey
Timothy J. Norman
54
0
0
21 May 2025
Loss-Guided Auxiliary Agents for Overcoming Mode Collapse in GFlowNets
Idriss Malek
Abhijit Sharma
Salem Lahlou
89
1
0
21 May 2025
Touch2Shape: Touch-Conditioned 3D Diffusion for Shape Exploration and Reconstruction
Yuanbo Wang
Zhaoxuan Zhang
Jiajin Qiu
Dilong Sun
Zhengyu Meng
Xiaopeng Wei
Xin Yang
87
0
0
19 May 2025
SAINT: Attention-Based Modeling of Sub-Action Dependencies in Multi-Action Policies
Matthew Landers
Taylor W. Killian
Thomas Hartvigsen
Afsaneh Doryab
61
0
0
17 May 2025
Improving the Data-efficiency of Reinforcement Learning by Warm-starting with LLM
Thang Duong
Minglai Yang
Chicheng Zhang
OffRL
69
0
0
16 May 2025
Scalability of Reinforcement Learning Methods for Dispatching in Semiconductor Frontend Fabs: A Comparison of Open-Source Models with Real Industry Datasets
Patrick Stöckermann
Henning Südfeld
Alessandro Immordino
Thomas Altenmüller
Marc Wegmann
Martin Gebser
Konstantin Schekotihin
Georg Seidel
Chew Wye Chan
Fei Fei Zhang
OffRL
31
0
0
16 May 2025
Electric Bus Charging Schedules Relying on Real Data-Driven Targets Based on Hierarchical Deep Reinforcement Learning
Jiaju Qi
Lei Lei
Thorsteinn Jonsson
L. Hanzo
93
1
0
15 May 2025
Scaling Multi Agent Reinforcement Learning for Underwater Acoustic Tracking via Autonomous Vehicles
Matteo Gallici
Ivan Masmitja
Mario Martin
OffRL
65
0
0
13 May 2025
Multi-source Plume Tracing via Multi-Agent Reinforcement Learning
Pedro Antonio Alarcon Granadeno
Theodore Chambers
J. Cleland-Huang
AI4CE
44
0
0
12 May 2025
Reinforcement Learning for Game-Theoretic Resource Allocation on Graphs
Zijian An
Lifeng Zhou
51
0
0
08 May 2025
Unraveling the Rainbow: can value-based methods schedule?
Arthur Corrêa
Alexandre Jesus
Cristóvão Silva
Samuel Moniz
OffRL
72
0
0
06 May 2025
Graph Neural Network-Based Reinforcement Learning for Controlling Biological Networks: The GATTACA Framework
Andrzej Mizera
Jakub Zarzycki
GNN
AI4CE
81
0
0
05 May 2025
Integration of Multi-Mode Preference into Home Energy Management System Using Deep Reinforcement Learning
Mohammed Sumayli
Olugbenga Moses Anubi
62
0
0
02 May 2025
Uncertainty-aware Latent Safety Filters for Avoiding Out-of-Distribution Failures
Junwon Seo
Kensuke Nakamura
Andrea V. Bajcsy
125
0
0
01 May 2025
Approximation to Deep Q-Network by Stochastic Delay Differential Equations
Jianya Lu
Yingjun Mo
77
0
0
01 May 2025
Q-function Decomposition with Intervention Semantics with Factored Action Spaces
Junkyu Lee
Tian Gao
Elliot Nelson
Miao Liu
D. Bhattacharjya
Songtao Lu
OffRL
90
0
0
30 Apr 2025
Multi-Agent Reinforcement Learning for Resources Allocation Optimization: A Survey
Mohamad Abdul Hady
Siyi Hu
Mahardhika Pratama
Jimmy Cao
Ryszard Kowalczyk
61
0
0
29 Apr 2025
Interactive Double Deep Q-network: Integrating Human Interventions and Evaluative Predictions in Reinforcement Learning of Autonomous Driving
Alkis Sygkounas
Ioannis Athanasiadis
Andreas Persson
Michael Felsberg
Amy Loutfi
OffRL
101
0
0
28 Apr 2025
Cracking the Code of Action: a Generative Approach to Affordances for Reinforcement Learning
Lynn Cherif
Flemming Kondrup
David Venuto
Ankit Anand
Doina Precup
Khimya Khetarpal
LM&Ro
199
0
0
24 Apr 2025
Bridging Econometrics and AI: VaR Estimation via Reinforcement Learning and GARCH Models
Fredy Pokou
Jules Sadefo Kamdem
François Benhmad
AIFin
69
0
0
23 Apr 2025
Enhancing Reinforcement learning in 3-Dimensional Hydrophobic-Polar Protein Folding Model with Attention-based layers
Peizheng Liu
Hitoshi Iba
67
0
0
22 Apr 2025
Symmetry-Preserving Architecture for Multi-NUMA Environments (SPANE): A Deep Reinforcement Learning Approach for Dynamic VM Scheduling
Tin Ping Chan
Yunlong Cheng
Yizhan Zhu
Xiaofeng Gao
Guihai Chen
61
0
0
21 Apr 2025
1
2
3
4
...
44
45
46
Next