Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
1509.06461
Cited By
v1
v2
v3 (latest)
Deep Reinforcement Learning with Double Q-learning
22 September 2015
H. V. Hasselt
A. Guez
David Silver
OffRL
Re-assign community
ArXiv (abs)
PDF
HTML
Papers citing
"Deep Reinforcement Learning with Double Q-learning"
50 / 2,291 papers shown
Title
Robust Q-Learning for finite ambiguity sets
Cécile Decker
Julian Sester
97
1
0
05 Jul 2024
DRLQ: A Deep Reinforcement Learning-based Task Placement for Quantum Cloud Computing
H. T. Nguyen
Muhammad Usman
Rajkumar Buyya
56
0
0
03 Jul 2024
Hybrid RAG-empowered Multi-modal LLM for Secure Healthcare Data Management: A Diffusion-based Contract Theory Approach
Cheng Su
Jinbo Wen
Jiawen Kang
Yonghua Wang
Hudan Pan
M. S. Hossain
MedIm
45
0
0
01 Jul 2024
Mental Modeling of Reinforcement Learning Agents by Language Models
Wenhao Lu
Xufeng Zhao
Josua Spisak
Jae Hee Lee
Stefan Wermter
LLMAG
LRM
LM&Ro
68
3
0
26 Jun 2024
When does Self-Prediction help? Understanding Auxiliary Tasks in Reinforcement Learning
C. Voelcker
Tyler Kastner
Igor Gilitschenski
Amir-massoud Farahmand
SSL
90
6
0
25 Jun 2024
KANQAS: Kolmogorov-Arnold Network for Quantum Architecture Search
Akash Kundu
Aritra Sarkar
Abhishek Sadhu
94
31
0
25 Jun 2024
On the consistency of hyper-parameter selection in value-based deep reinforcement learning
J. Obando-Ceron
J. G. Araújo
Rameswar Panda
Pablo Samuel Castro
120
9
0
25 Jun 2024
OCALM: Object-Centric Assessment with Language Models
Timo Kaufmann
Johannes Czech
Antonia Wüst
Quentin Delfosse
Kristian Kersting
Eyke Hüllermeier
LM&Ro
LRM
88
1
0
24 Jun 2024
Understanding and Diagnosing Deep Reinforcement Learning
Ezgi Korkmaz
68
3
0
23 Jun 2024
Learning Abstract World Model for Value-preserving Planning with Options
Rafael Rodríguez-Sánchez
George Konidaris
86
1
0
22 Jun 2024
Learning to Select Goals in Automated Planning with Deep-Q Learning
Carlos Núnez-Molina
Juan Fernández-Olivares
Raúl Pérez
69
10
0
20 Jun 2024
Graph Neural Networks for Job Shop Scheduling Problems: A Survey
Igor G. Smit
Jianan Zhou
Robbert Reijnen
Yaoxin Wu
Jian Chen
Cong Zhang
Zaharah Bukhsh
Wim P. M. Nuijten
Yingqian Zhang
GNN
AI4CE
117
11
0
20 Jun 2024
Do Not Wait: Learning Re-Ranking Model Without User Feedback At Serving Time in E-Commerce
Yuan Wang
Zhiyu Li
Changshuo Zhang
Sirui Chen
Xiao Zhang
Jun Xu
Quan Lin
64
1
0
20 Jun 2024
Discovering Minimal Reinforcement Learning Environments
Jarek Liesen
Chris Xiaoxuan Lu
Andrei Lupu
Jakob N. Foerster
Henning Sprekeler
R. T. Lange
OffRL
92
4
0
18 Jun 2024
More Efficient Randomized Exploration for Reinforcement Learning via Approximate Sampling
Haque Ishfaq
Yixin Tan
Yu Yang
Qingfeng Lan
Jianfeng Lu
A. Rupam Mahmood
Doina Precup
Pan Xu
89
5
0
18 Jun 2024
Linear Bellman Completeness Suffices for Efficient Online Reinforcement Learning with Few Actions
Noah Golowich
Ankur Moitra
OffRL
68
1
0
17 Jun 2024
Adaptive Reinforcement Learning Planning: Harnessing Large Language Models for Complex Information Extraction
Zepeng Ding
Ruiyang Ke
Wenhao Huang
Guochao Jiang
Yanda Li
Deqing Yang
Jiaqing Liang
89
1
0
17 Jun 2024
An Imitative Reinforcement Learning Framework for Autonomous Dogfight
Siyuan Li
Rongchang Zuo
Peng Liu
Yingnan Zhao
Yingnan Zhao
112
1
0
17 Jun 2024
DigiRL: Training In-The-Wild Device-Control Agents with Autonomous Reinforcement Learning
Hao Bai
Yifei Zhou
Mert Cemri
Jiayi Pan
Alane Suhr
Sergey Levine
Aviral Kumar
OffRL
111
65
0
14 Jun 2024
Finite-Time Analysis of Simultaneous Double Q-learning
Hyunjun Na
Donghwan Lee
68
0
0
14 Jun 2024
Mix Q-learning for Lane Changing: A Collaborative Decision-Making Method in Multi-Agent Deep Reinforcement Learning
Xiaojun Bi
Mingjie He
Yiwen Sun
52
1
0
14 Jun 2024
CUER: Corrected Uniform Experience Replay for Off-Policy Continuous Deep Reinforcement Learning Algorithms
Arda Sarp Yenicesu
Furkan B. Mutlu
Suleyman S. Kozat
Ozgur S. Oguz
29
1
0
13 Jun 2024
Multi-agent Reinforcement Learning with Deep Networks for Diverse Q-Vectors
Zhenglong Luo
Zhiyong Chen
James Welsh
37
1
0
12 Jun 2024
Bootstrapping Expectiles in Reinforcement Learning
Pierre Clavier
Emmanuel Rachelson
E. L. Pennec
Matthieu Geist
OffRL
81
0
0
06 Jun 2024
Exploring Pessimism and Optimism Dynamics in Deep Reinforcement Learning
Bahareh Tasdighi
Nicklas Werge
Yi-Shan Wu
M. Kandemir
30
0
0
06 Jun 2024
Quality-Diversity with Limited Resources
Ren-Jian Wang
Ke Xue
Cong Guan
Chao Qian
84
3
0
06 Jun 2024
Reflective Policy Optimization
Yaozhong Gan
Renye Yan
Zhe Wu
Junliang Xing
84
1
0
06 Jun 2024
Tabular and Deep Learning for the Whittle Index
Francisco Robledo Relaño
Vivek Borkar
U. Ayesta
Konstantin Avrachenkov
55
2
0
04 Jun 2024
Verifying the Generalization of Deep Learning to Out-of-Distribution Domains
Guy Amir
Osher Maayan
Tom Zelazny
Guy Katz
Michael Schapira
AAML
63
1
0
04 Jun 2024
Learning the Target Network in Function Space
Kavosh Asadi
Yao Liu
Shoham Sabach
Ming Yin
Rasool Fakoor
119
0
0
03 Jun 2024
Federated Learning-based Collaborative Wideband Spectrum Sensing and Scheduling for UAVs in UTM Systems
Sravan Reddy Chintareddy
Keenan Roach
Kenny Cheung
Morteza Hashemi
44
2
0
03 Jun 2024
A New View on Planning in Online Reinforcement Learning
Kevin Roice
Parham Mohammad Panahi
Scott M. Jordan
Adam White
Martha White
OffRL
97
0
0
03 Jun 2024
Learning-based legged locomotion; state of the art and future perspectives
Sehoon Ha
Joonho Lee
M. van de Panne
Zhaoming Xie
Wenhao Yu
Majid Khadiv
144
20
0
03 Jun 2024
Deep reinforcement learning for weakly coupled MDP's with continuous actions
Francisco Robledo
U. Ayesta
Konstantin Avrachenkov
53
0
0
03 Jun 2024
Value Improved Actor Critic Algorithms
Yaniv Oren
Moritz A. Zanger
Pascal R. van der Vaart
M. Spaan
Wendelin Bohmer
Wendelin Bohmer
OffRL
89
0
0
03 Jun 2024
REvolve: Reward Evolution with Large Language Models using Human Feedback
Rishi Hazra
Alkis Sygkounas
Andreas Persson
Amy Loutfi
Pedro Zuidberg Dos Martires
101
3
0
03 Jun 2024
Deep Reinforcement Learning for Sim-to-Real Policy Transfer of VTOL-UAVs Offshore Docking Operations
A. M. Ali
Aryaman Gupta
Hashim A. Hashim
OffRL
65
7
0
02 Jun 2024
Shared-unique Features and Task-aware Prioritized Sampling on Multi-task Reinforcement Learning
Po-Shao Lin
Jia-Fong Yeh
Yi-Ting Chen
Winston H. Hsu
85
0
0
02 Jun 2024
Learning Multimodal Behaviors from Scratch with Diffusion Policy Gradient
Zechu Li
Rickmer Krohn
Tao Chen
Anurag Ajay
Pulkit Agrawal
Georgia Chalvatzaki
DiffM
130
18
0
02 Jun 2024
LAGMA: LAtent Goal-guided Multi-Agent Reinforcement Learning
Hyungho Na
IL-Chul Moon
67
1
0
30 May 2024
Safety through Permissibility: Shield Construction for Fast and Safe Reinforcement Learning
A. Politowicz
Sahisnu Mazumder
Bing-Quan Liu
58
0
0
29 May 2024
FDQN: A Flexible Deep Q-Network Framework for Game Automation
Prabhath Reddy Gujavarthy
OffRL
26
0
0
29 May 2024
DTR-Bench: An in silico Environment and Benchmark Platform for Reinforcement Learning Based Dynamic Treatment Regime
Zhiyao Luo
Mingcheng Zhu
Fenglin Liu
Jiali Li
Yangchen Pan
Jiandong Zhou
Tingting Zhu
OffRL
62
3
0
28 May 2024
Offline-Boosted Actor-Critic: Adaptively Blending Optimal Historical Behaviors in Deep Off-Policy RL
Yu-Juan Luo
Tianying Ji
Gang Hua
Jianwei Zhang
Huazhe Xu
Xianyuan Zhan
OffRL
OnRL
108
3
0
28 May 2024
Highway Reinforcement Learning
Yuhui Wang
M. Strupl
Francesco Faccio
Qingyuan Wu
Haozhe Liu
Michal Grudzieñ
Xiaoyang Tan
Jürgen Schmidhuber
OffRL
73
4
0
28 May 2024
Mollification Effects of Policy Gradient Methods
Tao Wang
Sylvia Herbert
Sicun Gao
96
1
0
28 May 2024
Interpretable DRL-based Maneuver Decision of UCAV Dogfight
H. Han
Jian Cheng
Maolong Lv
65
1
0
28 May 2024
Rethinking Transformers in Solving POMDPs
Chenhao Lu
Ruizhe Shi
Yuyao Liu
Kaizhe Hu
Simon S. Du
Huazhe Xu
AI4CE
117
3
0
27 May 2024
Symmetric Reinforcement Learning Loss for Robust Learning on Diverse Tasks and Model Scales
Ju-Seung Byun
Andrew Perrault
57
1
0
27 May 2024
Rewarded Region Replay (R3) for Policy Learning with Discrete Action Space
Bangzheng Li
Ningshan Ma
Zifan Wang
44
0
1
26 May 2024
Previous
1
2
3
4
5
6
...
44
45
46
Next