Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
1509.06461
Cited By
v1
v2
v3 (latest)
Deep Reinforcement Learning with Double Q-learning
22 September 2015
H. V. Hasselt
A. Guez
David Silver
OffRL
Re-assign community
ArXiv (abs)
PDF
HTML
Papers citing
"Deep Reinforcement Learning with Double Q-learning"
50 / 2,291 papers shown
Title
Scheduling Drone and Mobile Charger via Hybrid-Action Deep Reinforcement Learning
Jizhe Dou
Haotian Zhang
Guodong Sun
89
0
0
16 Mar 2024
SINDy-RL: Interpretable and Efficient Model-Based Reinforcement Learning
Nicholas Zolman
Urban Fasel
J. Nathan Kutz
Steven L. Brunton
AI4CE
84
11
0
14 Mar 2024
Symmetric Q-learning: Reducing Skewness of Bellman Error in Online Reinforcement Learning
Motoki Omura
Takayuki Osa
Yusuke Mukuta
Tatsuya Harada
OffRL
51
0
0
12 Mar 2024
An Improved Strategy for Blood Glucose Control Using Multi-Step Deep Reinforcement Learning
Weiwei Gu
Senquan Wang
65
7
0
12 Mar 2024
Ensembling Prioritized Hybrid Policies for Multi-agent Pathfinding
Huijie Tang
Federico Berto
Jinkyoo Park
106
4
0
12 Mar 2024
RLingua: Improving Reinforcement Learning Sample Efficiency in Robotic Manipulations With Large Language Models
Liangliang Chen
Yutian Lei
Shiyu Jin
Ying Zhang
Liangjun Zhang
LM&Ro
105
12
0
11 Mar 2024
Finite-Time Error Analysis of Soft Q-Learning: Switching System Approach
Narim Jeong
Donghwan Lee
27
1
0
11 Mar 2024
Fish-inspired tracking of underwater turbulent plumes
Peter Gunnarson
J. Dabiri
64
4
0
10 Mar 2024
Dissecting Deep RL with High Update Ratios: Combatting Value Divergence
Marcel Hussing
C. Voelcker
Igor Gilitschenski
Amir-massoud Farahmand
Eric Eaton
100
3
0
09 Mar 2024
Conservative DDPG -- Pessimistic RL without Ensemble
Nitsan Soffair
Shie Mannor
OffRL
47
0
0
08 Mar 2024
A Survey on Applications of Reinforcement Learning in Spatial Resource Allocation
Di Zhang
Moyang Wang
Joseph D Mango
Xiang Li
Xianrui Xu
105
1
0
06 Mar 2024
Koopman-Assisted Reinforcement Learning
Preston Rozwood
Edward Mehrez
Ludger Paehler
Wen Sun
Steven L. Brunton
109
10
0
04 Mar 2024
Deep Reinforcement Learning for Dynamic Algorithm Selection: A Proof-of-Principle Study on Differential Evolution
Hongshu Guo
Yining Ma
Zeyuan Ma
Jiacheng Chen
Xinglin Zhang
Zhiguang Cao
Jun Zhang
Yue-Jiao Gong
100
23
0
04 Mar 2024
A Case for Validation Buffer in Pessimistic Actor-Critic
Michal Nauman
M. Ostaszewski
Marek Cygan
68
0
0
01 Mar 2024
ArCHer: Training Language Model Agents via Hierarchical Multi-Turn RL
Yifei Zhou
Andrea Zanette
Jiayi Pan
Sergey Levine
Aviral Kumar
146
79
0
29 Feb 2024
EBBS: An Ensemble with Bi-Level Beam Search for Zero-Shot Machine Translation
Yuqiao Wen
Behzad Shayegh
Chenyang Huang
Yanshuai Cao
Lili Mou
141
5
0
29 Feb 2024
Multistatic-Radar RCS-Signature Recognition of Aerial Vehicles: A Bayesian Fusion Approach
Owen Howell
M. Akçakaya
Marius Necsoiu
G. Schirner
Deniz Erdogmus
Tales Imbiriba
104
2
0
28 Feb 2024
Think2Drive: Efficient Reinforcement Learning by Thinking in Latent World Model for Quasi-Realistic Autonomous Driving (in CARLA-v2)
Qifeng Li
Xiaosong Jia
Shaobo Wang
Junchi Yan
124
34
0
26 Feb 2024
Reward Design for Justifiable Sequential Decision-Making
A. Sukovic
Goran Radanović
52
0
0
24 Feb 2024
Discretionary Lane-Change Decision and Control via Parameterized Soft Actor-Critic for Hybrid Action Space
Yuan Lin
Xiao Liu
Zishun Zheng
58
5
0
24 Feb 2024
Analysis of Off-Policy Multi-Step TD-Learning with Linear Function Approximation
Donghwan Lee
81
0
0
24 Feb 2024
A priori Estimates for Deep Residual Network in Continuous-time Reinforcement Learning
Shuyu Yin
Qixuan Zhou
Fei Wen
Tao Luo
74
0
0
24 Feb 2024
ACE : Off-Policy Actor-Critic with Causality-Aware Entropy Regularization
Tianying Ji
Yongyuan Liang
Yan Zeng
Yu-Juan Luo
Guowei Xu
Jiawei Guo
Ruijie Zheng
Furong Huang
Gang Hua
Huazhe Xu
CML
111
12
0
22 Feb 2024
Automated Design and Optimization of Distributed Filtering Circuits via Reinforcement Learning
Peng Gao
Tao Yu
Fei Wang
Ruyue Yuan
32
1
0
22 Feb 2024
Computation Offloading for Multi-server Multi-access Edge Vehicular Networks: A DDQN-based Method
Siyu Wang
Bo Yang
Zhiwen Yu
Xuelin Cao
Yan Zhang
Chau Yuen
26
0
0
21 Feb 2024
The Edge-of-Reach Problem in Offline Model-Based Reinforcement Learning
Anya Sims
Cong Lu
Yee Whye Teh
OffRL
96
4
0
19 Feb 2024
In value-based deep reinforcement learning, a pruned network is a good network
J. Obando-Ceron
Rameswar Panda
Pablo Samuel Castro
OffRL
119
26
0
19 Feb 2024
Reinforcement learning to maximise wind turbine energy generation
Daniel Soler
O. Marino
D. Huergo
Martín de Frutos
Esteban Ferrer
50
0
0
17 Feb 2024
SINR-Aware Deep Reinforcement Learning for Distributed Dynamic Channel Allocation in Cognitive Interference Networks
Yaniv Cohen
Tomer Gafni
Ronen Greenberg
Kobi Cohen
36
5
0
17 Feb 2024
Symmetry-Breaking Augmentations for Ad Hoc Teamwork
Ravi Hammond
Dustin Craggs
Mingyu Guo
Jakob Foerster
Ian Reid
92
2
0
15 Feb 2024
Learning Interpretable Policies in Hindsight-Observable POMDPs through Partially Supervised Reinforcement Learning
Michael Lanier
Ying Xu
Nathan Jacobs
Chongjie Zhang
Yevgeniy Vorobeychik
64
2
0
14 Feb 2024
FGeo-DRL: Deductive Reasoning for Geometric Problems through Deep Reinforcement Learning
Jia Zou
Xiaokai Zhang
Yiming He
Na Zhu
Tuo Leng
AIMat
AI4CE
LRM
102
4
0
14 Feb 2024
Deep Reinforcement Learning for Controlled Traversing of the Attractor Landscape of Boolean Models in the Context of Cellular Reprogramming
Andrzej Mizera
Jakub Zarzycki
69
1
0
13 Feb 2024
Scaling Intelligent Agents in Combat Simulations for Wargaming
Scotty Black
Christian J. Darken
20
1
0
08 Feb 2024
Scaling Artificial Intelligence for Digital Wargaming in Support of Decision-Making
Scotty Black
Christian J. Darken
21
2
0
08 Feb 2024
Multi-Timescale Ensemble Q-learning for Markov Decision Process Policy Optimization
Talha Bozkus
Urbashi Mitra
OffRL
72
5
0
08 Feb 2024
Analyzing Adversarial Inputs in Deep Reinforcement Learning
Davide Corsi
Guy Amir
Guy Katz
Alessandro Farinelli
AAML
63
7
0
07 Feb 2024
Pedestrian crossing decisions can be explained by bounded optimal decision-making under noisy visual perception
Yueyang Wang
Aravinda Ramakrishnan Srinivasan
Jussi P. P. Jokinen
Antti Oulasvirta
Gustav Markkula
50
2
0
06 Feb 2024
Frugal Actor-Critic: Sample Efficient Off-Policy Deep Reinforcement Learning Using Unique Experiences
Nikhil Kumar Singh
Indranil Saha
OffRL
35
0
0
05 Feb 2024
Q-Star Meets Scalable Posterior Sampling: Bridging Theory and Practice via HyperAgent
Yingru Li
Jiawei Xu
Lei Han
Zhi-Quan Luo
BDL
OffRL
101
5
0
05 Feb 2024
Evading Deep Learning-Based Malware Detectors via Obfuscation: A Deep Reinforcement Learning Approach
Brian Etter
Junjie Hu
Mohammedreza Ebrahimi
Weifeng Li
Xin Li
Hsinchun Chen
87
1
0
04 Feb 2024
Device Scheduling and Assignment in Hierarchical Federated Learning for Internet of Things
Tinghao Zhang
Kwok-Yan Lam
Jun Zhao
83
10
0
04 Feb 2024
NetLLM: Adapting Large Language Models for Networking
Duo Wu
Xianda Wang
Yaqi Qiao
Zhi Wang
Junchen Jiang
Shuguang Cui
Fangxin Wang
94
48
0
04 Feb 2024
Towards Optimal Adversarial Robust Q-learning with Bellman Infinity-error
Haoran Li
Zicheng Zhang
Wang Luo
Congying Han
Yudong Hu
Tiande Guo
Shichen Liao
AAML
135
2
0
03 Feb 2024
The RL/LLM Taxonomy Tree: Reviewing Synergies Between Reinforcement Learning and Large Language Models
M. Pternea
Prerna Singh
Abir Chakraborty
Y. Oruganti
M. Milletarí
Sayli Bapat
Kebei Jiang
OffRL
80
10
0
02 Feb 2024
Training Adversarial yet Safe Agent to Characterize Safety Performance of Highly Automated Vehicles
Minghao Zhu
Anmol Sidhu
Keith A. Redmill
53
0
0
02 Feb 2024
Neural Style Transfer with Twin-Delayed DDPG for Shared Control of Robotic Manipulators
R. Fernandez-Fernandez
Marco Aggravi
P. Giordano
J. Victores
C. Pacchierotti
130
4
0
01 Feb 2024
Transferring human emotions to robot motions using Neural Policy Style Transfer
R. Fernandez-Fernandez
Bartek Łukawski
J. Victores
C. Pacchierotti
71
22
0
01 Feb 2024
Control in Stochastic Environment with Delays: A Model-based Reinforcement Learning Approach
Zhiyuan Yao
Ionuţ Florescu
Chihoon Lee
OffRL
43
2
0
01 Feb 2024
SwarmBrain: Embodied agent for real-time strategy game StarCraft II via large language models
Xiao Shao
Weifu Jiang
Fei Zuo
Mengqing Liu
LLMAG
95
7
0
31 Jan 2024
Previous
1
2
3
...
6
7
8
...
44
45
46
Next