ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1509.06461
  4. Cited By
Deep Reinforcement Learning with Double Q-learning
v1v2v3 (latest)

Deep Reinforcement Learning with Double Q-learning

22 September 2015
H. V. Hasselt
A. Guez
David Silver
    OffRL
ArXiv (abs)PDFHTML

Papers citing "Deep Reinforcement Learning with Double Q-learning"

50 / 2,291 papers shown
Title
A comparison of RL-based and PID controllers for 6-DOF swimming robots:
  hybrid underwater object tracking
A comparison of RL-based and PID controllers for 6-DOF swimming robots: hybrid underwater object tracking
F. Lotfi
K. Virji
Nicholas Dudek
Gregory Dudek
64
0
0
29 Jan 2024
Regularized Q-Learning with Linear Function Approximation
Regularized Q-Learning with Linear Function Approximation
Jiachen Xi
Alfredo Garcia
P. Momcilovic
120
2
0
26 Jan 2024
Modeling and Optimization of Epidemiological Control Policies Through
  Reinforcement Learning
Modeling and Optimization of Epidemiological Control Policies Through Reinforcement Learning
Ishir Rao
38
1
0
25 Jan 2024
Symbolic Equation Solving via Reinforcement Learning
Symbolic Equation Solving via Reinforcement Learning
Lennart Dabelow
Masahito Ueda
84
2
0
24 Jan 2024
Multi-agent deep reinforcement learning with centralized training and
  decentralized execution for transportation infrastructure management
Multi-agent deep reinforcement learning with centralized training and decentralized execution for transportation infrastructure management
M. Saifullah
K. G. Papakonstantinou
C. Andriotis
S. M. Stoffels
AI4CE
101
2
0
23 Jan 2024
Constrained Reinforcement Learning for Adaptive Controller
  Synchronization in Distributed SDN
Constrained Reinforcement Learning for Adaptive Controller Synchronization in Distributed SDN
Ioannis Panitsas
Akrit Mudvari
Leandros Tassiulas
17
0
0
21 Jan 2024
Visual Imitation Learning with Calibrated Contrastive Representation
Visual Imitation Learning with Calibrated Contrastive Representation
Yunke Wang
Linwei Tao
Bo Du
Yutian Lin
Chang Xu
68
0
0
21 Jan 2024
Robotic Test Tube Rearrangement Using Combined Reinforcement Learning
  and Motion Planning
Robotic Test Tube Rearrangement Using Combined Reinforcement Learning and Motion Planning
Hao Chen
Weiwei Wan
Masaki Matsushita
Takeyuki Kotaka
Kensuke Harada
79
2
0
18 Jan 2024
UOEP: User-Oriented Exploration Policy for Enhancing Long-Term User
  Experiences in Recommender Systems
UOEP: User-Oriented Exploration Policy for Enhancing Long-Term User Experiences in Recommender Systems
Changshuo Zhang
Sirui Chen
Xiao Zhang
Sunhao Dai
Weijie Yu
Jun Xu
OffRL
103
1
0
17 Jan 2024
Bridging State and History Representations: Understanding
  Self-Predictive RL
Bridging State and History Representations: Understanding Self-Predictive RL
Tianwei Ni
Benjamin Eysenbach
Erfan Seyedsalehi
Michel Ma
Clement Gehring
Aditya Mahajan
Pierre-Luc Bacon
AI4TSAI4CE
92
29
0
17 Jan 2024
REValueD: Regularised Ensemble Value-Decomposition for Factorisable
  Markov Decision Processes
REValueD: Regularised Ensemble Value-Decomposition for Factorisable Markov Decision Processes
David Ireland
Giovanni Montana
93
4
0
16 Jan 2024
Learned Best-Effort LLM Serving
Learned Best-Effort LLM Serving
Siddharth Jha
Coleman Hooper
Xiaoxuan Liu
Sehoon Kim
Kurt Keutzer
43
2
0
15 Jan 2024
Spatial-Aware Deep Reinforcement Learning for the Traveling Officer
  Problem
Spatial-Aware Deep Reinforcement Learning for the Traveling Officer Problem
Niklas Strauß
Matthias Schubert
42
0
0
11 Jan 2024
Towards Goal-Oriented Agents for Evolving Problems Observed via
  Conversation
Towards Goal-Oriented Agents for Evolving Problems Observed via Conversation
Michael Free
Andrew Langworthy
Mary Dimitropoulaki
Simon Thompson
LLMAG
86
2
0
11 Jan 2024
Interpretable Concept Bottlenecks to Align Reinforcement Learning Agents
Interpretable Concept Bottlenecks to Align Reinforcement Learning Agents
Quentin Delfosse
Sebastian Sztwiertnia
M. Rothermel
Wolfgang Stammer
Kristian Kersting
134
20
0
11 Jan 2024
Unsupervised Salient Patch Selection for Data-Efficient Reinforcement
  Learning
Unsupervised Salient Patch Selection for Data-Efficient Reinforcement Learning
Zhaohui Jiang
Paul Weng
OffRL
67
0
0
10 Jan 2024
Decision Making in Non-Stationary Environments with Policy-Augmented
  Search
Decision Making in Non-Stationary Environments with Policy-Augmented Search
Ava Pettet
Yunuo Zhang
Baiting Luo
Kyle Wray
Hendrik Baier
Aron Laszka
Abhishek Dubey
Ayan Mukhopadhyay
51
4
0
06 Jan 2024
A Survey Analyzing Generalization in Deep Reinforcement Learning
A Survey Analyzing Generalization in Deep Reinforcement Learning
Ezgi Korkmaz
OffRL
66
3
0
04 Jan 2024
GLIDE-RL: Grounded Language Instruction through DEmonstration in RL
GLIDE-RL: Grounded Language Instruction through DEmonstration in RL
Chaitanya Kharyal
S. Gottipati
Tanmay Kumar Sinha
Srijita Das
Matthew E. Taylor
LLMAG
56
1
0
03 Jan 2024
Evaluation of automated driving system safety metrics with logged
  vehicle trajectory data
Evaluation of automated driving system safety metrics with logged vehicle trajectory data
Xintao Yan
Shuo Feng
David J. LeBlanc
Carol Flannagan
Henry X. Liu
34
3
0
03 Jan 2024
Energy-Efficient Power Control for Multiple-Task Split Inference in
  UAVs: A Tiny Learning-Based Approach
Energy-Efficient Power Control for Multiple-Task Split Inference in UAVs: A Tiny Learning-Based Approach
Chenxi Zhao
Min Sheng
Junyu Liu
Tianshu Chu
Jiandong Li
45
3
0
31 Dec 2023
Causal State Distillation for Explainable Reinforcement Learning
Causal State Distillation for Explainable Reinforcement Learning
Wenhao Lu
Xufeng Zhao
Thilo Fryen
Jae Hee Lee
Mengdi Li
S. Magg
Stefan Wermter
CML
83
2
0
30 Dec 2023
Dynamic Decision Making in Engineering System Design: A Deep Q-Learning
  Approach
Dynamic Decision Making in Engineering System Design: A Deep Q-Learning Approach
Ramin Giahi
Cameron A. MacKenzie
Reyhaneh Bijari
17
0
0
28 Dec 2023
RL-MPCA: A Reinforcement Learning Based Multi-Phase Computation
  Allocation Approach for Recommender Systems
RL-MPCA: A Reinforcement Learning Based Multi-Phase Computation Allocation Approach for Recommender Systems
Jiahong Zhou
Shunhui Mao
Guoliang Yang
Bo Tang
Qianlong Xie
Lebin Lin
Xingxing Wang
Dong Wang
62
8
0
27 Dec 2023
Visual Spatial Attention and Proprioceptive Data-Driven Reinforcement
  Learning for Robust Peg-in-Hole Task Under Variable Conditions
Visual Spatial Attention and Proprioceptive Data-Driven Reinforcement Learning for Robust Peg-in-Hole Task Under Variable Conditions
André Yuji Yasutomi
Hideyuki Ichiwara
Hiroshi Ito
Hiroki Mori
Tetsuya Ogata
42
20
0
27 Dec 2023
A Bayesian Framework of Deep Reinforcement Learning for Joint O-RAN/MEC
  Orchestration
A Bayesian Framework of Deep Reinforcement Learning for Joint O-RAN/MEC Orchestration
Fahri Wisnu Murti
Samad Ali
Matti Latva-aho
65
0
0
26 Dec 2023
Conservative Exploration for Policy Optimization via Off-Policy Policy
  Evaluation
Conservative Exploration for Policy Optimization via Off-Policy Policy Evaluation
Paul Daoudi
Mathias Formoso
Othman Gaizi
Achraf Azize
Evrard Garcelon
OffRL
55
0
0
24 Dec 2023
Human-Centric Resource Allocation for the Metaverse With Multiaccess
  Edge Computing
Human-Centric Resource Allocation for the Metaverse With Multiaccess Edge Computing
Zijian Long
Haiwei Dong
Abdulmotaleb El Saddik
106
19
0
23 Dec 2023
Human-AI Collaboration in Real-World Complex Environment with
  Reinforcement Learning
Human-AI Collaboration in Real-World Complex Environment with Reinforcement Learning
Md Saiful Islam
Srijita Das
S. Gottipati
William Duguay
Clodéric Mars
Jalal Arabneydi
Antoine Fagette
Matthew J. Guzdial
Matthew E. Taylor
72
1
0
23 Dec 2023
Dynamic Routing for Integrated Satellite-Terrestrial Networks: A
  Constrained Multi-Agent Reinforcement Learning Approach
Dynamic Routing for Integrated Satellite-Terrestrial Networks: A Constrained Multi-Agent Reinforcement Learning Approach
Yifeng Lyu
Han Hu
Rongfei Fan
Zhi Liu
J. An
Shiwen Mao
82
17
0
23 Dec 2023
Machine learning for structure-guided materials and process design
Machine learning for structure-guided materials and process design
L. Morand
Tarek Iraki
Johannes Dornheim
Stefan Sandfeld
Norbert Link
Dirk Helm
AI4CE
19
5
0
22 Dec 2023
Multiagent Copilot Approach for Shared Autonomy between Human EEG and
  TD3 Deep Reinforcement Learning
Multiagent Copilot Approach for Shared Autonomy between Human EEG and TD3 Deep Reinforcement Learning
Chun-Ren Phang
Akimasa Hirata
24
0
0
22 Dec 2023
Rapid Open-World Adaptation by Adaptation Principles Learning
Rapid Open-World Adaptation by Adaptation Principles Learning
Cheng Xue
Ekaterina Nikonova
Peng Zhang
Jochen Renz
43
1
0
18 Dec 2023
Multi-Agent Reinforcement Learning for Connected and Automated Vehicles
  Control: Recent Advancements and Future Prospects
Multi-Agent Reinforcement Learning for Connected and Automated Vehicles Control: Recent Advancements and Future Prospects
Min Hua
Dong Chen
Xinda Qi
Kun Jiang
Z. Liu
Quan Zhou
Hongming Xu
77
10
0
18 Dec 2023
Episodic Return Decomposition by Difference of Implicitly Assigned
  Sub-Trajectory Reward
Episodic Return Decomposition by Difference of Implicitly Assigned Sub-Trajectory Reward
Hao-Chu Lin
Hongqiu Wu
Jiaji Zhang
Yihao Sun
Junyin Ye
Yang Yu
73
2
0
17 Dec 2023
Active Reinforcement Learning for Robust Building Control
Active Reinforcement Learning for Robust Building Control
Doseok Jang
Larry Yan
Lucas Spangher
C. Spanos
76
3
0
16 Dec 2023
Multi-agent Reinforcement Learning: A Comprehensive Survey
Multi-agent Reinforcement Learning: A Comprehensive Survey
Dom Huh
Prasant Mohapatra
AI4CE
80
10
0
15 Dec 2023
An Invitation to Deep Reinforcement Learning
An Invitation to Deep Reinforcement Learning
Bernhard Jaeger
Andreas Geiger
OffRLOOD
160
5
0
13 Dec 2023
FOSS: A Self-Learned Doctor for Query Optimizer
FOSS: A Self-Learned Doctor for Query Optimizer
Kai Zhong
Luming Sun
Tao Ji
Cuiping Li
Hong Chen
45
0
0
11 Dec 2023
Vision-based Learning for Drones: A Survey
Vision-based Learning for Drones: A Survey
Jiaping Xiao
Rangya Zhang
Yuhang Zhang
Mir Feroskhan
66
5
0
08 Dec 2023
Unsupervised Social Event Detection via Hybrid Graph Contrastive
  Learning and Reinforced Incremental Clustering
Unsupervised Social Event Detection via Hybrid Graph Contrastive Learning and Reinforced Incremental Clustering
Yuanyuan Guo
Zehua Zang
Hang Gao
Xiao Xu
Rui Wang
Lixiang Liu
Jiangmeng Li
85
8
0
08 Dec 2023
Efficient Parallel Reinforcement Learning Framework using the Reactor
  Model
Efficient Parallel Reinforcement Learning Framework using the Reactor Model
Jacky Kwok
Marten Lohstroh
Edward A. Lee
64
0
0
07 Dec 2023
MICRO: Model-Based Offline Reinforcement Learning with a Conservative
  Bellman Operator
MICRO: Model-Based Offline Reinforcement Learning with a Conservative Bellman Operator
Xiao-Yin Liu
Xiao-Hu Zhou
Guo-Tao Li
Hao Li
Mei-Jiang Gui
Tian-Yu Xiang
De-Xing Huang
Zeng-Guang Hou
OffRL
90
6
0
07 Dec 2023
Pearl: A Production-ready Reinforcement Learning Agent
Pearl: A Production-ready Reinforcement Learning Agent
Zheqing Zhu
Rodrigo de Salvo Braz
Jalaj Bhandari
Daniel Jiang
Yi Wan
...
D. Korenkevych
Ürün Dogan
Frank Cheng
Zheng Wu
Wanqiao Xu
VLMOffRLOnRL
120
7
0
06 Dec 2023
Diffused Task-Agnostic Milestone Planner
Diffused Task-Agnostic Milestone Planner
Mineui Hong
Minjae Kang
Songhwai Oh
113
6
0
06 Dec 2023
Data-efficient Deep Reinforcement Learning for Vehicle Trajectory
  Control
Data-efficient Deep Reinforcement Learning for Vehicle Trajectory Control
Bernd Frauenknecht
Tobias Ehlgen
Sebastian Trimpe
85
4
0
30 Nov 2023
Towards Assessing and Benchmarking Risk-Return Tradeoff of Off-Policy
  Evaluation
Towards Assessing and Benchmarking Risk-Return Tradeoff of Off-Policy Evaluation
Haruka Kiyohara
Ren Kishimoto
K. Kawakami
Ken Kobayashi
Kazuhide Nakata
Yuta Saito
OffRL
95
9
0
30 Nov 2023
Bias Resilient Multi-Step Off-Policy Goal-Conditioned Reinforcement
  Learning
Bias Resilient Multi-Step Off-Policy Goal-Conditioned Reinforcement Learning
Lisheng Wu
Ke Chen
64
0
0
29 Nov 2023
Minimax Exploiter: A Data Efficient Approach for Competitive Self-Play
Minimax Exploiter: A Data Efficient Approach for Competitive Self-Play
Daniel Bairamian
Philippe Marcotte
Joshua Romoff
Gabriel Robert
Derek Nowrouzezahrai
79
0
0
28 Nov 2023
Digital Twin-Enhanced Deep Reinforcement Learning for Resource
  Management in Networks Slicing
Digital Twin-Enhanced Deep Reinforcement Learning for Resource Management in Networks Slicing
Zhengming Zhang
Yongming Huang
Cheng Zhang
Qingbi Zheng
Luxi Yang
Xiaohu You
63
14
0
28 Nov 2023
Previous
123...789...444546
Next