Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
1509.06461
Cited By
v1
v2
v3 (latest)
Deep Reinforcement Learning with Double Q-learning
22 September 2015
H. V. Hasselt
A. Guez
David Silver
OffRL
Re-assign community
ArXiv (abs)
PDF
HTML
Papers citing
"Deep Reinforcement Learning with Double Q-learning"
50 / 2,291 papers shown
Title
Solving Multi-Goal Robotic Tasks with Decision Transformer
Paul Gajewski
Dominik Zurek
Marcin Pietroñ
Kamil Faber
OffRL
61
1
0
08 Oct 2024
Learning in complex action spaces without policy gradients
Arash Tavakoli
Sina Ghiassian
Nemanja Rakićević
OffRL
74
0
0
08 Oct 2024
Mitigating Adversarial Perturbations for Deep Reinforcement Learning via Vector Quantization
Tung M. Luu
Thanh Nguyen
Tee Joshua Tian Jin
Sungwoon Kim
Chang D. Yoo
AAML
83
0
0
04 Oct 2024
Edge Intelligence in Satellite-Terrestrial Networks with Hybrid Quantum Computing
Siyue Huang
Lifeng Wang
Xin Wang
Bo Tan
Wei Ni
Kai-Kit Wong
59
1
0
30 Sep 2024
Learning to Bridge the Gap: Efficient Novelty Recovery with Planning and Reinforcement Learning
Alicia Li
Nishanth Kumar
Tomás Lozano-Pérez
Leslie Kaelbling
OffRL
91
0
0
28 Sep 2024
Energy-Efficient Computation with DVFS using Deep Reinforcement Learning for Multi-Task Systems in Edge Computing
Xinyi Li
Ti Zhou
Haoyu Wang
Man Lin
65
2
0
28 Sep 2024
Development and Validation of Heparin Dosing Policies Using an Offline Reinforcement Learning Algorithm
Yooseok Lim
Inbeom Park
Sujee Lee
OffRL
55
0
0
24 Sep 2024
Reinforcement Feature Transformation for Polymer Property Performance Prediction
Xuanming Hu
Dongjie Wang
Wangyang Ying
Yanjie Fu
82
10
0
23 Sep 2024
Subassembly to Full Assembly: Effective Assembly Sequence Planning through Graph-based Reinforcement Learning
Chang Shu
Anton Kim
Shinkyu Park
57
0
0
20 Sep 2024
Improving Soft-Capture Phase Success in Space Debris Removal Missions: Leveraging Deep Reinforcement Learning and Tactile Feedback
Bahador Beigomi
Zheng H. Zhu
65
0
0
18 Sep 2024
Robust Reinforcement Learning with Dynamic Distortion Risk Measures
Anthony Coache
S. Jaimungal
99
1
0
16 Sep 2024
Deep reinforcement learning for tracking a moving target in jellyfish-like swimming
Yihao Chen
Yue Yang
59
0
0
13 Sep 2024
Autonomous Vehicle Controllers From End-to-End Differentiable Simulation
Asen Nachkov
Danda Pani Paudel
Luc Van Gool
82
0
0
12 Sep 2024
Double Successive Over-Relaxation Q-Learning with an Extension to Deep Reinforcement Learning
Shreyas S R
OffRL
OnRL
73
0
0
10 Sep 2024
Improving Deep Reinforcement Learning by Reducing the Chain Effect of Value and Policy Churn
Hongyao Tang
Glen Berseth
OffRL
93
2
0
07 Sep 2024
Gaussian-Mixture-Model Q-Functions for Reinforcement Learning by Riemannian Optimization
Minh Vu
Konstantinos Slavakis
51
0
0
06 Sep 2024
Surgical Task Automation Using Actor-Critic Frameworks and Self-Supervised Imitation Learning
Jingshuai Liu
Alain Andres
Yonghang Jiang
Xichun Luo
Wenmiao Shu
Sotirios A. Tsaftaris
127
0
0
04 Sep 2024
Decision Transformer for Enhancing Neural Local Search on the Job Shop Scheduling Problem
Constantin Waubert de Puiseau
Fabian Wolz
Merlin Montag
Jannik Peters
Hasan Tercan
Tobias Meisen
82
0
0
04 Sep 2024
AgGym: An agricultural biotic stress simulation environment for ultra-precision management planning
Mahsa Khosravi
Matthew Carroll
Kai Liang Tan
Liza Van der Laan
Joscif Raigne
...
Arti Singh
Aditya Balu
Baskar Ganapathysubramanian
Asheesh Kumar Singh
Soumik Sarkar
OffRL
31
0
0
01 Sep 2024
A Tighter Convergence Proof of Reverse Experience Replay
Nan Jiang
Jinzhao Li
Yexiang Xue
61
0
0
30 Aug 2024
On Stateful Value Factorization in Multi-Agent Reinforcement Learning
Enrico Marchesini
Andrea Baisero
Rupali Bhati
Christopher Amato
OffRL
82
3
0
27 Aug 2024
Localized Observation Abstraction Using Piecewise Linear Spatial Decay for Reinforcement Learning in Combat Simulations
Scotty Black
Christian J. Darken
52
0
0
23 Aug 2024
The Evolution of Reinforcement Learning in Quantitative Finance: A Survey
Nikolaos Pippas
Cagatay Turkay
Elliot A. Ludvig
AIFin
193
4
0
20 Aug 2024
Multi-Agent Reinforcement Learning for Autonomous Driving: A Survey
Ruiqi Zhang
Jing Hou
Florian Walter
Shangding Gu
Jiayi Guan
Florian Röhrbein
Yali Du
Panpan Cai
G. Chen
Alois Knoll
131
15
0
19 Aug 2024
SynTraC: A Synthetic Dataset for Traffic Signal Control from Traffic Monitoring Cameras
Tiejin Chen
Prithvi Shirke
Bharatesh Chakravarthi
Arpitsinh Vaghela
Longchao Da
Duo Lu
Yezhou Yang
Hua Wei
43
1
0
18 Aug 2024
Multilevel Graph Reinforcement Learning for Consistent Cognitive Decision-making in Heterogeneous Mixed Autonomy
Xin Gao
Zhaoyang Ma
Xueyuan Li
Xiaoqiang Meng
Zirui Li
61
0
0
16 Aug 2024
Integrating Saliency Ranking and Reinforcement Learning for Enhanced Object Detection
Matthias Bartolo
D. Seychell
Josef Bajada
44
1
0
13 Aug 2024
Leveraging Knowledge Graph-Based Human-Like Memory Systems to Solve Partially Observable Markov Decision Processes
Taewoon Kim
Vincent François-Lavet
Michael Cochez
RALM
120
2
0
11 Aug 2024
RCDM: Enabling Robustness for Conditional Diffusion Model
Weifeng Xu
Xiang Zhu
Xiaoyong Li
AAML
75
0
0
05 Aug 2024
Review of Cloud Service Composition for Intelligent Manufacturing
Cuixia Li
Liqiang Liu
Li Shi
33
0
0
03 Aug 2024
Multi-agent reinforcement learning for the control of three-dimensional Rayleigh-Bénard convection
Mirko Conrad
Jean Rabault
Francisco Alcántara-Ávila
Mikael Mortensen
Ricardo Vinuesa
AI4CE
84
6
0
31 Jul 2024
How to Choose a Reinforcement-Learning Algorithm
Fabian Bongratz
Vladimir Golkov
Lukas Mautner
Luca Della Libera
Frederik Heetmeyer
Felix Czaja
Julian Rodemann
Daniel Cremers
68
1
0
30 Jul 2024
Boosting Efficiency in Task-Agnostic Exploration through Causal Knowledge
Yupei Yang
Erdun Gao
Shikui Tu
Lei Xu
CML
95
1
0
30 Jul 2024
Quantum Machine Learning Architecture Search via Deep Reinforcement Learning
Xin Dai
Tzu-Chieh Wei
Shinjae Yoo
Samuel Yen-Chi Chen
98
9
0
29 Jul 2024
NAVIX: Scaling MiniGrid Environments with JAX
Eduardo Pignatelli
Jarek Liesen
R. T. Lange
Chris Xiaoxuan Lu
Pablo Samuel Castro
Laura Toni
142
4
0
28 Jul 2024
SOAP-RL: Sequential Option Advantage Propagation for Reinforcement Learning in POMDP Environments
Shu Ishida
João F. Henriques
100
0
0
26 Jul 2024
The Cross-environment Hyperparameter Setting Benchmark for Reinforcement Learning
Andrew Patterson
Samuel Neumann
Raksha Kumaraswamy
Martha White
Adam White
68
2
0
26 Jul 2024
MapTune: Advancing ASIC Technology Mapping via Reinforcement Learning Guided Library Tuning
Mingju Liu
Daniel Robinson
Yingjie Li
Cunxi Yu
47
0
0
25 Jul 2024
A Comparative Study of Deep Reinforcement Learning Models: DQN vs PPO vs A2C
Neil De La Fuente
Daniel A. Vidal Guerra
OffRL
34
7
0
19 Jul 2024
PG-Rainbow: Using Distributional Reinforcement Learning in Policy Gradient Methods
WooJae Jeon
KanJun Lee
Jeewoo Lee
OffRL
32
0
0
18 Jul 2024
DITTO: A Visual Digital Twin for Interventions and Temporal Treatment Outcomes in Head and Neck Cancer
A. Wentzel
Serageldin Attia
Xinhua Zhang
G. Canahuate
Clifton Fuller
G. Marai
84
5
0
18 Jul 2024
ROLeR: Effective Reward Shaping in Offline Reinforcement Learning for Recommender Systems
Yi Zhang
Ruihong Qiu
Jiajun Liu
Sen Wang
OffRL
105
1
0
18 Jul 2024
Optimistic Q-learning for average reward and episodic reinforcement learning
Priyank Agrawal
Shipra Agrawal
121
6
0
18 Jul 2024
Navigating the Smog: A Cooperative Multi-Agent RL for Accurate Air Pollution Mapping through Data Assimilation
Ichrak Mokhtari
Walid Bechkit
Mohamed Sami Assenine
Hervé Rivano
AI4CE
49
1
0
17 Jul 2024
PID Accelerated Temporal Difference Algorithms
Mark Bedaywi
Amin Rakhsha
Amir-massoud Farahmand
77
1
0
11 Jul 2024
Periodic agent-state based Q-learning for POMDPs
Amit Sinha
Mathieu Geist
Aditya Mahajan
86
0
0
08 Jul 2024
An open source Multi-Agent Deep Reinforcement Learning Routing Simulator for satellite networks
Federico Lozano-Cuadra
Mathias D. Thorsager
Israel Leyva Mayorga
B. Soret
89
1
0
08 Jul 2024
CANDID DAC: Leveraging Coupled Action Dimensions with Importance Differences in DAC
Philipp Bordne
M. A. Hasan
Eddie Bergman
Noor H. Awad
André Biedenkapp
116
1
0
08 Jul 2024
Aortic root landmark localization with optimal transport loss for heatmap regression
Tsuyoshi Ishizone
Masaki Miyasaka
Sae Ochi
Norio Tada
Kazuyuki Nakamura
70
0
0
06 Jul 2024
Augmented Bayesian Policy Search
Mahdi Kallel
Debabrota Basu
R. Akrour
Carlo DÉramo
85
3
0
05 Jul 2024
Previous
1
2
3
4
5
...
44
45
46
Next