Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
1509.06461
Cited By
Deep Reinforcement Learning with Double Q-learning
22 September 2015
H. V. Hasselt
A. Guez
David Silver
OffRL
Re-assign community
ArXiv
PDF
HTML
Papers citing
"Deep Reinforcement Learning with Double Q-learning"
50 / 933 papers shown
Title
Fish-inspired tracking of underwater turbulent plumes
Peter Gunnarson
J. Dabiri
35
4
0
10 Mar 2024
Dissecting Deep RL with High Update Ratios: Combatting Value Divergence
Marcel Hussing
C. Voelcker
Igor Gilitschenski
Amir-massoud Farahmand
Eric Eaton
47
3
0
09 Mar 2024
Koopman-Assisted Reinforcement Learning
Preston Rozwood
Edward Mehrez
Ludger Paehler
Wen Sun
Steven L. Brunton
45
7
0
04 Mar 2024
EBBS: An Ensemble with Bi-Level Beam Search for Zero-Shot Machine Translation
Yuqiao Wen
Behzad Shayegh
Chenyang Huang
Yanshuai Cao
Lili Mou
63
5
0
29 Feb 2024
Multistatic-Radar RCS-Signature Recognition of Aerial Vehicles: A Bayesian Fusion Approach
Michael Potter
M. Akçakaya
Marius Necsoiu
G. Schirner
Deniz Erdogmus
Tales Imbiriba
38
2
0
28 Feb 2024
Think2Drive: Efficient Reinforcement Learning by Thinking in Latent World Model for Quasi-Realistic Autonomous Driving (in CARLA-v2)
Qifeng Li
Xiaosong Jia
Shaobo Wang
Junchi Yan
48
28
0
26 Feb 2024
ACE : Off-Policy Actor-Critic with Causality-Aware Entropy Regularization
Tianying Ji
Yongyuan Liang
Yan Zeng
Yu-Juan Luo
Guowei Xu
Jiawei Guo
Ruijie Zheng
Furong Huang
Gang Hua
Huazhe Xu
CML
55
11
0
22 Feb 2024
Computation Offloading for Multi-server Multi-access Edge Vehicular Networks: A DDQN-based Method
Siyu Wang
Bo Yang
Zhiwen Yu
Xuelin Cao
Yan Zhang
Chau Yuen
19
0
0
21 Feb 2024
The Edge-of-Reach Problem in Offline Model-Based Reinforcement Learning
Anya Sims
Cong Lu
Yee Whye Teh
OffRL
41
3
0
19 Feb 2024
Symmetry-Breaking Augmentations for Ad Hoc Teamwork
Ravi Hammond
Dustin Craggs
Mingyu Guo
Jakob Foerster
Ian Reid
34
1
0
15 Feb 2024
Deep Reinforcement Learning for Controlled Traversing of the Attractor Landscape of Boolean Models in the Context of Cellular Reprogramming
Andrzej Mizera
Jakub Zarzycki
27
0
0
13 Feb 2024
Scaling Artificial Intelligence for Digital Wargaming in Support of Decision-Making
Scotty Black
Christian J. Darken
19
2
0
08 Feb 2024
Q-Star Meets Scalable Posterior Sampling: Bridging Theory and Practice via HyperAgent
Yingru Li
Jiawei Xu
Lei Han
Zhi-Quan Luo
BDL
OffRL
36
5
0
05 Feb 2024
The RL/LLM Taxonomy Tree: Reviewing Synergies Between Reinforcement Learning and Large Language Models
M. Pternea
Prerna Singh
Abir Chakraborty
Y. Oruganti
M. Milletarí
Sayli Bapat
Kebei Jiang
OffRL
36
8
0
02 Feb 2024
Neural Style Transfer with Twin-Delayed DDPG for Shared Control of Robotic Manipulators
R. Fernandez-Fernandez
Marco Aggravi
P. Giordano
J. Victores
C. Pacchierotti
38
4
0
01 Feb 2024
Transferring human emotions to robot motions using Neural Policy Style Transfer
R. Fernandez-Fernandez
Bartek Łukawski
J. Victores
C. Pacchierotti
29
22
0
01 Feb 2024
A comparison of RL-based and PID controllers for 6-DOF swimming robots: hybrid underwater object tracking
F. Lotfi
K. Virji
Nicholas Dudek
Gregory Dudek
27
0
0
29 Jan 2024
Regularized Q-Learning with Linear Function Approximation
Jiachen Xi
Alfredo Garcia
P. Momcilovic
42
2
0
26 Jan 2024
Modeling and Optimization of Epidemiological Control Policies Through Reinforcement Learning
Ishir Rao
21
1
0
25 Jan 2024
UOEP: User-Oriented Exploration Policy for Enhancing Long-Term User Experiences in Recommender Systems
Changshuo Zhang
Sirui Chen
Xiao Zhang
Sunhao Dai
Weijie Yu
Jun Xu
OffRL
45
1
0
17 Jan 2024
Bridging State and History Representations: Understanding Self-Predictive RL
Tianwei Ni
Benjamin Eysenbach
Erfan Seyedsalehi
Michel Ma
Clement Gehring
Aditya Mahajan
Pierre-Luc Bacon
AI4TS
AI4CE
31
22
0
17 Jan 2024
Decision Making in Non-Stationary Environments with Policy-Augmented Search
Ava Pettet
Yunuo Zhang
Baiting Luo
Kyle Wray
Hendrik Baier
Aron Laszka
Abhishek Dubey
Ayan Mukhopadhyay
22
4
0
06 Jan 2024
RL-MPCA: A Reinforcement Learning Based Multi-Phase Computation Allocation Approach for Recommender Systems
Jiahong Zhou
Shunhui Mao
Guoliang Yang
Bo Tang
Qianlong Xie
Lebin Lin
Xingxing Wang
Dong Wang
37
8
0
27 Dec 2023
Conservative Exploration for Policy Optimization via Off-Policy Policy Evaluation
Paul Daoudi
Mathias Formoso
Othman Gaizi
Achraf Azize
Evrard Garcelon
OffRL
31
0
0
24 Dec 2023
Human-AI Collaboration in Real-World Complex Environment with Reinforcement Learning
Md Saiful Islam
Srijita Das
S. Gottipati
William Duguay
Clodéric Mars
Jalal Arabneydi
Antoine Fagette
Matthew J. Guzdial
Matthew E. Taylor
41
1
0
23 Dec 2023
Multi-Agent Reinforcement Learning for Connected and Automated Vehicles Control: Recent Advancements and Future Prospects
Min Hua
Dong Chen
Xinda Qi
Kun Jiang
Z. Liu
Quan Zhou
Hongming Xu
33
10
0
18 Dec 2023
Episodic Return Decomposition by Difference of Implicitly Assigned Sub-Trajectory Reward
Hao-Chu Lin
Hongqiu Wu
Jiaji Zhang
Yihao Sun
Junyin Ye
Yang Yu
34
2
0
17 Dec 2023
Multi-agent Reinforcement Learning: A Comprehensive Survey
Dom Huh
Prasant Mohapatra
AI4CE
40
8
0
15 Dec 2023
An Invitation to Deep Reinforcement Learning
Bernhard Jaeger
Andreas Geiger
OffRL
OOD
80
5
0
13 Dec 2023
Unsupervised Social Event Detection via Hybrid Graph Contrastive Learning and Reinforced Incremental Clustering
Yuanyuan Guo
Zehua Zang
Hang Gao
Xiao Xu
Rui Wang
Lixiang Liu
Jiangmeng Li
39
6
0
08 Dec 2023
Digital Twin-Enhanced Deep Reinforcement Learning for Resource Management in Networks Slicing
Zhengming Zhang
Yongming Huang
Cheng Zhang
Qingbi Zheng
Luxi Yang
Xiaohu You
34
12
0
28 Nov 2023
Agent-Aware Training for Agent-Agnostic Action Advising in Deep Reinforcement Learning
Yaoquan Wei
Shunyu Liu
Mingli Song
Tongya Zheng
Kaixuan Chen
Yong Wang
Mingli Song
30
0
0
28 Nov 2023
Mission-driven Exploration for Accelerated Deep Reinforcement Learning with Temporal Logic Task Specifications
Jun Wang
Hosein Hasanbeig
Kaiyuan Tan
Zihe Sun
Y. Kantaros
40
3
0
28 Nov 2023
From Images to Connections: Can DQN with GNNs learn the Strategic Game of Hex?
Yannik Keller
Jannis Blüml
Gopika Sudhakaran
Kristian Kersting
GNN
40
0
0
22 Nov 2023
Autonomous Port Navigation With Ranging Sensors Using Model-Based Reinforcement Learning
Siemen Herremans
Ali Anwar
Arne Troch
Ian Ravijts
Maarten Vangeneugden
Siegfried Mercelis
P. Hellinckx
33
1
0
17 Nov 2023
Guaranteeing Control Requirements via Reward Shaping in Reinforcement Learning
F. D. Lellis
M. Coraggio
G. Russo
Mirco Musolesi
Mario di Bernardo
OffRL
35
4
0
16 Nov 2023
CLIP-Motion: Learning Reward Functions for Robotic Actions Using Consecutive Observations
Xuzhe Dang
Stefan Edelkamp
42
4
0
06 Nov 2023
Selectively Sharing Experiences Improves Multi-Agent Reinforcement Learning
M. Gerstgrasser
Tom Danino
Sarah Keren
34
5
0
01 Nov 2023
On the Theory of Risk-Aware Agents: Bridging Actor-Critic and Economics
Michal Nauman
Marek Cygan
40
1
0
30 Oct 2023
TD-MPC2: Scalable, Robust World Models for Continuous Control
Nicklas Hansen
Hao Su
Xiaolong Wang
MU
48
132
0
25 Oct 2023
Combining Policy Gradient and Safety-Based Control for Autonomous Driving
Xi Xiong
Lu Liu
29
0
0
20 Oct 2023
LESSON: Learning to Integrate Exploration Strategies for Reinforcement Learning via an Option Framework
Woojun Kim
Jeonghye Kim
Young-Jin Sung
28
5
0
05 Oct 2023
Deep reinforcement learning for machine scheduling: Methodology, the state-of-the-art, and future directions
Maziyar Khadivi
Todd Charter
Marjan Yaghoubi
Masoud Jalayer
Maryam Ahang
Ardeshir Shojaeinasab
Homayoun Najjaran
40
11
0
04 Oct 2023
Multi-Agent Reinforcement Learning Based on Representational Communication for Large-Scale Traffic Signal Control
Rohit Bokade
Xiaoning Jin
Chris Amato
40
10
0
03 Oct 2023
A General Offline Reinforcement Learning Framework for Interactive Recommendation
Teng Xiao
Donglin Wang
OffRL
44
73
0
01 Oct 2023
Gray-box Adversarial Attack of Deep Reinforcement Learning-based Trading Agents
Foozhan Ataiefard
Hadi Hemmati
AAML
29
2
0
26 Sep 2023
Adapting Double Q-Learning for Continuous Reinforcement Learning
Arsenii Kuznetsov
OffRL
OnRL
40
0
0
25 Sep 2023
Double Deep Q-Learning-based Path Selection and Service Placement for Latency-Sensitive Beyond 5G Applications
Masoud Shokrnezhad
T. Taleb
Patrizio Dazzi
8
9
0
18 Sep 2023
Enhancing the Performance of Multi-Agent Reinforcement Learning for Controlling HVAC Systems
Daniel R. Bayer
M. Pruckner
AI4CE
27
7
0
13 Sep 2023
R^3: On-device Real-Time Deep Reinforcement Learning for Autonomous Robotics
Zexin Li
Aritra Samanta
Yufei Li
Andrea Soltoggio
Hyoseung Kim
Cong Liu
39
6
0
29 Aug 2023
Previous
1
2
3
4
5
6
...
17
18
19
Next