ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1509.06461
  4. Cited By
Deep Reinforcement Learning with Double Q-learning
v1v2v3 (latest)

Deep Reinforcement Learning with Double Q-learning

22 September 2015
H. V. Hasselt
A. Guez
David Silver
    OffRL
ArXiv (abs)PDFHTML

Papers citing "Deep Reinforcement Learning with Double Q-learning"

50 / 2,291 papers shown
Title
Solving Multi-Goal Robotic Tasks with Decision Transformer
Solving Multi-Goal Robotic Tasks with Decision Transformer
Paul Gajewski
Dominik Zurek
Marcin Pietroñ
Kamil Faber
OffRL
61
1
0
08 Oct 2024
Learning in complex action spaces without policy gradients
Learning in complex action spaces without policy gradients
Arash Tavakoli
Sina Ghiassian
Nemanja Rakićević
OffRL
74
0
0
08 Oct 2024
Mitigating Adversarial Perturbations for Deep Reinforcement Learning via
  Vector Quantization
Mitigating Adversarial Perturbations for Deep Reinforcement Learning via Vector Quantization
Tung M. Luu
Thanh Nguyen
Tee Joshua Tian Jin
Sungwoon Kim
Chang D. Yoo
AAML
83
0
0
04 Oct 2024
Edge Intelligence in Satellite-Terrestrial Networks with Hybrid Quantum
  Computing
Edge Intelligence in Satellite-Terrestrial Networks with Hybrid Quantum Computing
Siyue Huang
Lifeng Wang
Xin Wang
Bo Tan
Wei Ni
Kai-Kit Wong
59
1
0
30 Sep 2024
Learning to Bridge the Gap: Efficient Novelty Recovery with Planning and
  Reinforcement Learning
Learning to Bridge the Gap: Efficient Novelty Recovery with Planning and Reinforcement Learning
Alicia Li
Nishanth Kumar
Tomás Lozano-Pérez
Leslie Kaelbling
OffRL
91
0
0
28 Sep 2024
Energy-Efficient Computation with DVFS using Deep Reinforcement Learning for Multi-Task Systems in Edge Computing
Energy-Efficient Computation with DVFS using Deep Reinforcement Learning for Multi-Task Systems in Edge Computing
Xinyi Li
Ti Zhou
Haoyu Wang
Man Lin
65
2
0
28 Sep 2024
Development and Validation of Heparin Dosing Policies Using an Offline
  Reinforcement Learning Algorithm
Development and Validation of Heparin Dosing Policies Using an Offline Reinforcement Learning Algorithm
Yooseok Lim
Inbeom Park
Sujee Lee
OffRL
55
0
0
24 Sep 2024
Reinforcement Feature Transformation for Polymer Property Performance
  Prediction
Reinforcement Feature Transformation for Polymer Property Performance Prediction
Xuanming Hu
Dongjie Wang
Wangyang Ying
Yanjie Fu
82
10
0
23 Sep 2024
Subassembly to Full Assembly: Effective Assembly Sequence Planning
  through Graph-based Reinforcement Learning
Subassembly to Full Assembly: Effective Assembly Sequence Planning through Graph-based Reinforcement Learning
Chang Shu
Anton Kim
Shinkyu Park
57
0
0
20 Sep 2024
Improving Soft-Capture Phase Success in Space Debris Removal Missions:
  Leveraging Deep Reinforcement Learning and Tactile Feedback
Improving Soft-Capture Phase Success in Space Debris Removal Missions: Leveraging Deep Reinforcement Learning and Tactile Feedback
Bahador Beigomi
Zheng H. Zhu
65
0
0
18 Sep 2024
Robust Reinforcement Learning with Dynamic Distortion Risk Measures
Robust Reinforcement Learning with Dynamic Distortion Risk Measures
Anthony Coache
S. Jaimungal
99
1
0
16 Sep 2024
Deep reinforcement learning for tracking a moving target in
  jellyfish-like swimming
Deep reinforcement learning for tracking a moving target in jellyfish-like swimming
Yihao Chen
Yue Yang
59
0
0
13 Sep 2024
Autonomous Vehicle Controllers From End-to-End Differentiable Simulation
Autonomous Vehicle Controllers From End-to-End Differentiable Simulation
Asen Nachkov
Danda Pani Paudel
Luc Van Gool
82
0
0
12 Sep 2024
Double Successive Over-Relaxation Q-Learning with an Extension to Deep Reinforcement Learning
Double Successive Over-Relaxation Q-Learning with an Extension to Deep Reinforcement Learning
Shreyas S R
OffRLOnRL
73
0
0
10 Sep 2024
Improving Deep Reinforcement Learning by Reducing the Chain Effect of
  Value and Policy Churn
Improving Deep Reinforcement Learning by Reducing the Chain Effect of Value and Policy Churn
Hongyao Tang
Glen Berseth
OffRL
93
2
0
07 Sep 2024
Gaussian-Mixture-Model Q-Functions for Reinforcement Learning by
  Riemannian Optimization
Gaussian-Mixture-Model Q-Functions for Reinforcement Learning by Riemannian Optimization
Minh Vu
Konstantinos Slavakis
51
0
0
06 Sep 2024
Surgical Task Automation Using Actor-Critic Frameworks and
  Self-Supervised Imitation Learning
Surgical Task Automation Using Actor-Critic Frameworks and Self-Supervised Imitation Learning
Jingshuai Liu
Alain Andres
Yonghang Jiang
Xichun Luo
Wenmiao Shu
Sotirios A. Tsaftaris
127
0
0
04 Sep 2024
Decision Transformer for Enhancing Neural Local Search on the Job Shop Scheduling Problem
Decision Transformer for Enhancing Neural Local Search on the Job Shop Scheduling Problem
Constantin Waubert de Puiseau
Fabian Wolz
Merlin Montag
Jannik Peters
Hasan Tercan
Tobias Meisen
82
0
0
04 Sep 2024
AgGym: An agricultural biotic stress simulation environment for
  ultra-precision management planning
AgGym: An agricultural biotic stress simulation environment for ultra-precision management planning
Mahsa Khosravi
Matthew Carroll
Kai Liang Tan
Liza Van der Laan
Joscif Raigne
...
Arti Singh
Aditya Balu
Baskar Ganapathysubramanian
Asheesh Kumar Singh
Soumik Sarkar
OffRL
31
0
0
01 Sep 2024
A Tighter Convergence Proof of Reverse Experience Replay
A Tighter Convergence Proof of Reverse Experience Replay
Nan Jiang
Jinzhao Li
Yexiang Xue
61
0
0
30 Aug 2024
On Stateful Value Factorization in Multi-Agent Reinforcement Learning
On Stateful Value Factorization in Multi-Agent Reinforcement Learning
Enrico Marchesini
Andrea Baisero
Rupali Bhati
Christopher Amato
OffRL
82
3
0
27 Aug 2024
Localized Observation Abstraction Using Piecewise Linear Spatial Decay
  for Reinforcement Learning in Combat Simulations
Localized Observation Abstraction Using Piecewise Linear Spatial Decay for Reinforcement Learning in Combat Simulations
Scotty Black
Christian J. Darken
52
0
0
23 Aug 2024
The Evolution of Reinforcement Learning in Quantitative Finance: A Survey
The Evolution of Reinforcement Learning in Quantitative Finance: A Survey
Nikolaos Pippas
Cagatay Turkay
Elliot A. Ludvig
AIFin
193
4
0
20 Aug 2024
Multi-Agent Reinforcement Learning for Autonomous Driving: A Survey
Multi-Agent Reinforcement Learning for Autonomous Driving: A Survey
Ruiqi Zhang
Jing Hou
Florian Walter
Shangding Gu
Jiayi Guan
Florian Röhrbein
Yali Du
Panpan Cai
G. Chen
Alois Knoll
131
15
0
19 Aug 2024
SynTraC: A Synthetic Dataset for Traffic Signal Control from Traffic
  Monitoring Cameras
SynTraC: A Synthetic Dataset for Traffic Signal Control from Traffic Monitoring Cameras
Tiejin Chen
Prithvi Shirke
Bharatesh Chakravarthi
Arpitsinh Vaghela
Longchao Da
Duo Lu
Yezhou Yang
Hua Wei
43
1
0
18 Aug 2024
Multilevel Graph Reinforcement Learning for Consistent Cognitive
  Decision-making in Heterogeneous Mixed Autonomy
Multilevel Graph Reinforcement Learning for Consistent Cognitive Decision-making in Heterogeneous Mixed Autonomy
Xin Gao
Zhaoyang Ma
Xueyuan Li
Xiaoqiang Meng
Zirui Li
61
0
0
16 Aug 2024
Integrating Saliency Ranking and Reinforcement Learning for Enhanced
  Object Detection
Integrating Saliency Ranking and Reinforcement Learning for Enhanced Object Detection
Matthias Bartolo
D. Seychell
Josef Bajada
44
1
0
13 Aug 2024
Leveraging Knowledge Graph-Based Human-Like Memory Systems to Solve
  Partially Observable Markov Decision Processes
Leveraging Knowledge Graph-Based Human-Like Memory Systems to Solve Partially Observable Markov Decision Processes
Taewoon Kim
Vincent François-Lavet
Michael Cochez
RALM
120
2
0
11 Aug 2024
RCDM: Enabling Robustness for Conditional Diffusion Model
RCDM: Enabling Robustness for Conditional Diffusion Model
Weifeng Xu
Xiang Zhu
Xiaoyong Li
AAML
75
0
0
05 Aug 2024
Review of Cloud Service Composition for Intelligent Manufacturing
Review of Cloud Service Composition for Intelligent Manufacturing
Cuixia Li
Liqiang Liu
Li Shi
33
0
0
03 Aug 2024
Multi-agent reinforcement learning for the control of three-dimensional
  Rayleigh-Bénard convection
Multi-agent reinforcement learning for the control of three-dimensional Rayleigh-Bénard convection
Mirko Conrad
Jean Rabault
Francisco Alcántara-Ávila
Mikael Mortensen
Ricardo Vinuesa
AI4CE
84
6
0
31 Jul 2024
How to Choose a Reinforcement-Learning Algorithm
How to Choose a Reinforcement-Learning Algorithm
Fabian Bongratz
Vladimir Golkov
Lukas Mautner
Luca Della Libera
Frederik Heetmeyer
Felix Czaja
Julian Rodemann
Daniel Cremers
68
1
0
30 Jul 2024
Boosting Efficiency in Task-Agnostic Exploration through Causal
  Knowledge
Boosting Efficiency in Task-Agnostic Exploration through Causal Knowledge
Yupei Yang
Erdun Gao
Shikui Tu
Lei Xu
CML
95
1
0
30 Jul 2024
Quantum Machine Learning Architecture Search via Deep Reinforcement
  Learning
Quantum Machine Learning Architecture Search via Deep Reinforcement Learning
Xin Dai
Tzu-Chieh Wei
Shinjae Yoo
Samuel Yen-Chi Chen
98
9
0
29 Jul 2024
NAVIX: Scaling MiniGrid Environments with JAX
NAVIX: Scaling MiniGrid Environments with JAX
Eduardo Pignatelli
Jarek Liesen
R. T. Lange
Chris Xiaoxuan Lu
Pablo Samuel Castro
Laura Toni
142
4
0
28 Jul 2024
SOAP-RL: Sequential Option Advantage Propagation for Reinforcement
  Learning in POMDP Environments
SOAP-RL: Sequential Option Advantage Propagation for Reinforcement Learning in POMDP Environments
Shu Ishida
João F. Henriques
100
0
0
26 Jul 2024
The Cross-environment Hyperparameter Setting Benchmark for Reinforcement
  Learning
The Cross-environment Hyperparameter Setting Benchmark for Reinforcement Learning
Andrew Patterson
Samuel Neumann
Raksha Kumaraswamy
Martha White
Adam White
68
2
0
26 Jul 2024
MapTune: Advancing ASIC Technology Mapping via Reinforcement Learning
  Guided Library Tuning
MapTune: Advancing ASIC Technology Mapping via Reinforcement Learning Guided Library Tuning
Mingju Liu
Daniel Robinson
Yingjie Li
Cunxi Yu
47
0
0
25 Jul 2024
A Comparative Study of Deep Reinforcement Learning Models: DQN vs PPO vs
  A2C
A Comparative Study of Deep Reinforcement Learning Models: DQN vs PPO vs A2C
Neil De La Fuente
Daniel A. Vidal Guerra
OffRL
34
7
0
19 Jul 2024
PG-Rainbow: Using Distributional Reinforcement Learning in Policy
  Gradient Methods
PG-Rainbow: Using Distributional Reinforcement Learning in Policy Gradient Methods
WooJae Jeon
KanJun Lee
Jeewoo Lee
OffRL
32
0
0
18 Jul 2024
DITTO: A Visual Digital Twin for Interventions and Temporal Treatment
  Outcomes in Head and Neck Cancer
DITTO: A Visual Digital Twin for Interventions and Temporal Treatment Outcomes in Head and Neck Cancer
A. Wentzel
Serageldin Attia
Xinhua Zhang
G. Canahuate
Clifton Fuller
G. Marai
84
5
0
18 Jul 2024
ROLeR: Effective Reward Shaping in Offline Reinforcement Learning for Recommender Systems
ROLeR: Effective Reward Shaping in Offline Reinforcement Learning for Recommender Systems
Yi Zhang
Ruihong Qiu
Jiajun Liu
Sen Wang
OffRL
105
1
0
18 Jul 2024
Optimistic Q-learning for average reward and episodic reinforcement learning
Optimistic Q-learning for average reward and episodic reinforcement learning
Priyank Agrawal
Shipra Agrawal
121
6
0
18 Jul 2024
Navigating the Smog: A Cooperative Multi-Agent RL for Accurate Air
  Pollution Mapping through Data Assimilation
Navigating the Smog: A Cooperative Multi-Agent RL for Accurate Air Pollution Mapping through Data Assimilation
Ichrak Mokhtari
Walid Bechkit
Mohamed Sami Assenine
Hervé Rivano
AI4CE
49
1
0
17 Jul 2024
PID Accelerated Temporal Difference Algorithms
PID Accelerated Temporal Difference Algorithms
Mark Bedaywi
Amin Rakhsha
Amir-massoud Farahmand
77
1
0
11 Jul 2024
Periodic agent-state based Q-learning for POMDPs
Periodic agent-state based Q-learning for POMDPs
Amit Sinha
Mathieu Geist
Aditya Mahajan
86
0
0
08 Jul 2024
An open source Multi-Agent Deep Reinforcement Learning Routing Simulator
  for satellite networks
An open source Multi-Agent Deep Reinforcement Learning Routing Simulator for satellite networks
Federico Lozano-Cuadra
Mathias D. Thorsager
Israel Leyva Mayorga
B. Soret
89
1
0
08 Jul 2024
CANDID DAC: Leveraging Coupled Action Dimensions with Importance
  Differences in DAC
CANDID DAC: Leveraging Coupled Action Dimensions with Importance Differences in DAC
Philipp Bordne
M. A. Hasan
Eddie Bergman
Noor H. Awad
André Biedenkapp
116
1
0
08 Jul 2024
Aortic root landmark localization with optimal transport loss for
  heatmap regression
Aortic root landmark localization with optimal transport loss for heatmap regression
Tsuyoshi Ishizone
Masaki Miyasaka
Sae Ochi
Norio Tada
Kazuyuki Nakamura
70
0
0
06 Jul 2024
Augmented Bayesian Policy Search
Augmented Bayesian Policy Search
Mahdi Kallel
Debabrota Basu
R. Akrour
Carlo DÉramo
85
3
0
05 Jul 2024
Previous
12345...444546
Next