ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1509.06461
  4. Cited By
Deep Reinforcement Learning with Double Q-learning
v1v2v3 (latest)

Deep Reinforcement Learning with Double Q-learning

22 September 2015
H. V. Hasselt
A. Guez
David Silver
    OffRL
ArXiv (abs)PDFHTML

Papers citing "Deep Reinforcement Learning with Double Q-learning"

50 / 2,291 papers shown
Title
Average-Reward Reinforcement Learning with Entropy Regularization
Average-Reward Reinforcement Learning with Entropy Regularization
Jacob Adamczyk
Volodymyr Makarenko
Stas Tiomkin
R. Kulkarni
OOD
80
2
0
17 Jan 2025
Online inductive learning from answer sets for efficient reinforcement learning exploration
Online inductive learning from answer sets for efficient reinforcement learning exploration
Celeste Veronese
Daniele Meli
Alessandro Farinelli
OnRL
92
1
0
13 Jan 2025
Pareto Set Learning for Multi-Objective Reinforcement Learning
Pareto Set Learning for Multi-Objective Reinforcement Learning
Erlong Liu
Yu-Chang Wu
Xiaobin Huang
Chengrui Gao
Ren-Jian Wang
Ke Xue
Chao Qian
OffRL
235
2
0
12 Jan 2025
Risk-averse policies for natural gas futures trading using distributional reinforcement learning
Risk-averse policies for natural gas futures trading using distributional reinforcement learning
Félicien Hêche
Biagio Nigro
Oussama Barakat
Stephan Robert-Nicoud
OffRL
88
0
0
08 Jan 2025
Cognitive Kernel: An Open-source Agent System towards Generalist Autopilots
Cognitive Kernel: An Open-source Agent System towards Generalist Autopilots
Han Zhang
Xiaoman Pan
Hongwei Wang
Kaixin Ma
Wenhao Yu
Dong Yu
LLMAG
144
4
0
03 Jan 2025
Multi-Agent Q-Learning for Real-Time Load Balancing User Association and Handover in Mobile Networks
Multi-Agent Q-Learning for Real-Time Load Balancing User Association and Handover in Mobile Networks
Alireza Alizadeh
Byungju Lim
Mai Vu
96
5
0
31 Dec 2024
An Advantage-based Optimization Method for Reinforcement Learning in
  Large Action Space
An Advantage-based Optimization Method for Reinforcement Learning in Large Action Space
Hai Lin
Cheng Huang
Zhihong Chen
OffRL
106
0
0
17 Dec 2024
Adaptive Reward Design for Reinforcement Learning
Adaptive Reward Design for Reinforcement Learning
Minjae Kwon
Ingy Elsayed-Aly
Lu Feng
155
2
0
14 Dec 2024
TransferLight: Zero-Shot Traffic Signal Control on any Road-Network
TransferLight: Zero-Shot Traffic Signal Control on any Road-Network
J. Schmidt
Frank Dreyer
Sayed Abid Hashimi
Sebastian Stober
110
0
0
12 Dec 2024
Optimizing Plastic Waste Collection in Water Bodies Using Heterogeneous
  Autonomous Surface Vehicles with Deep Reinforcement Learning
Optimizing Plastic Waste Collection in Water Bodies Using Heterogeneous Autonomous Surface Vehicles with Deep Reinforcement Learning
Alejandro Mendoza Barrionuevo
S. Luis
Daniel Gutiérrez-Reina
S. T. Marín
92
1
0
03 Dec 2024
Conformal Symplectic Optimization for Stable Reinforcement Learning
Conformal Symplectic Optimization for Stable Reinforcement Learning
Yao Lyu
Xiangteng Zhang
Shengbo Eben Li
Jingliang Duan
Letian Tao
Qing Xu
Lei He
Keqiang Li
174
0
0
03 Dec 2024
Integrating Transit Signal Priority into Multi-Agent Reinforcement
  Learning based Traffic Signal Control
Integrating Transit Signal Priority into Multi-Agent Reinforcement Learning based Traffic Signal Control
Dickness Kwesiga
Suyash Chandra Vishnoi
Angshuman Guin
Michael Hunter
100
1
0
28 Nov 2024
Monocular Obstacle Avoidance Based on Inverse PPO for Fixed-wing UAVs
Monocular Obstacle Avoidance Based on Inverse PPO for Fixed-wing UAVs
Haochen Chai
Meimei Su
Yang Lyu
Zhunga Liu
Chunhui Zhao
Quan Pan
117
0
0
27 Nov 2024
JPPO: Joint Power and Prompt Optimization for Accelerated Large Language Model Services
JPPO: Joint Power and Prompt Optimization for Accelerated Large Language Model Services
Feiran You
Hongyang Du
Kaibin Huang
Abbas Jamalipour
147
2
0
27 Nov 2024
LLM-Based Offline Learning for Embodied Agents via Consistency-Guided
  Reward Ensemble
LLM-Based Offline Learning for Embodied Agents via Consistency-Guided Reward Ensemble
Yujeong Lee
Sangwoo Shin
Wei-Jin Park
Honguk Woo
OffRL3DV
134
1
0
26 Nov 2024
CRASH: Challenging Reinforcement-Learning Based Adversarial Scenarios
  For Safety Hardening
CRASH: Challenging Reinforcement-Learning Based Adversarial Scenarios For Safety Hardening
A. Kulkarni
Shangtong Zhang
Madhur Behl
AAML
106
1
0
26 Nov 2024
Broad Critic Deep Actor Reinforcement Learning for Continuous Control
Broad Critic Deep Actor Reinforcement Learning for Continuous Control
Shiron Thalagala
Pak Kin Wong
Xiaozheng Wang
Tianang Sun
OffRL
179
0
0
24 Nov 2024
To Train or Not to Train: Balancing Efficiency and Training Cost in Deep
  Reinforcement Learning for Mobile Edge Computing
To Train or Not to Train: Balancing Efficiency and Training Cost in Deep Reinforcement Learning for Mobile Edge Computing
Maddalena Boscaro
Federico Mason
Federico Chiariotti
Andrea Zanella
45
0
0
11 Nov 2024
Optimal Execution with Reinforcement Learning
Optimal Execution with Reinforcement Learning
Yadh Hafsi
Edoardo Vittori
64
1
0
10 Nov 2024
Human-in-the-Loop Feature Selection Using Interpretable
  Kolmogorov-Arnold Network-based Double Deep Q-Network
Human-in-the-Loop Feature Selection Using Interpretable Kolmogorov-Arnold Network-based Double Deep Q-Network
Md Abrar Jahin
M. F. Mridha
Nilanjan Dey
21
0
0
06 Nov 2024
Beyond The Rainbow: High Performance Deep Reinforcement Learning on a Desktop PC
Beyond The Rainbow: High Performance Deep Reinforcement Learning on a Desktop PC
Tyler Clark
Mark Towers
Christine Evers
Jonathon Hare
OffRL
156
1
0
06 Nov 2024
Autonomous Decision Making for UAV Cooperative Pursuit-Evasion Game with
  Reinforcement Learning
Autonomous Decision Making for UAV Cooperative Pursuit-Evasion Game with Reinforcement Learning
Yang Zhao
Zidong Nie
Kangsheng Dong
Qinghua Huang
Xiaochen Li
35
0
0
05 Nov 2024
Dynamic Weight Adjusting Deep Q-Networks for Real-Time Environmental
  Adaptation
Dynamic Weight Adjusting Deep Q-Networks for Real-Time Environmental Adaptation
Xinhao Zhang
Jinghan Zhang
Wujun Si
Kunpeng Liu
79
1
0
04 Nov 2024
Two-Timescale Model Caching and Resource Allocation for Edge-Enabled
  AI-Generated Content Services
Two-Timescale Model Caching and Resource Allocation for Edge-Enabled AI-Generated Content Services
Zhang Liu
Hongyang Du
Xiangwang Hou
Lianfen Huang
Seyyedali Hosseinalipour
Dusit Niyato
K. B. Letaief
DiffM
68
2
0
03 Nov 2024
Guiding Multi-agent Multi-task Reinforcement Learning by a Hierarchical
  Framework with Logical Reward Shaping
Guiding Multi-agent Multi-task Reinforcement Learning by a Hierarchical Framework with Logical Reward Shaping
Chanjuan Liu
Jinmiao Cong
Bingcai Chen
Yaochu Jin
Enqiang Zhu
92
1
0
02 Nov 2024
Toward Automated Algorithm Design: A Survey and Practical Guide to Meta-Black-Box-Optimization
Toward Automated Algorithm Design: A Survey and Practical Guide to Meta-Black-Box-Optimization
Zeyuan Ma
Hongshu Guo
Yue-Jiao Gong
Jun Zhang
Kay Chen Tan
298
5
0
01 Nov 2024
Deterministic Exploration via Stationary Bellman Error Maximization
Deterministic Exploration via Stationary Bellman Error Maximization
Sebastian Griesbach
Carlo DÉramo
50
0
0
31 Oct 2024
CALE: Continuous Arcade Learning Environment
CALE: Continuous Arcade Learning Environment
Jesse Farebrother
Pablo Samuel Castro
ELM
68
0
0
31 Oct 2024
Q-Distribution guided Q-learning for offline reinforcement learning: Uncertainty penalized Q-value via consistency model
Q-Distribution guided Q-learning for offline reinforcement learning: Uncertainty penalized Q-value via consistency model
Jing Zhang
Linjiajie Fang
Kexin Shi
Wenjia Wang
Bing-Yi Jing
OffRL
168
0
0
27 Oct 2024
Velocity-History-Based Soft Actor-Critic Tackling IROS'24 Competition
  "AI Olympics with RealAIGym"
Velocity-History-Based Soft Actor-Critic Tackling IROS'24 Competition "AI Olympics with RealAIGym"
Tim Lukas Faust
Habib Maraqten
Erfan Aghadavoodi
Boris Belousov
Jan Peters
33
1
0
26 Oct 2024
Guiding Reinforcement Learning with Incomplete System Dynamics
Guiding Reinforcement Learning with Incomplete System Dynamics
Shuyuan Wang
Jingliang Duan
Nathan P. Lawrence
Philip D. Loewen
M. Forbes
R. Bhushan Gopaluni
Lixian Zhang
107
1
0
22 Oct 2024
Long-distance Geomagnetic Navigation in GNSS-denied Environments with
  Deep Reinforcement Learning
Long-distance Geomagnetic Navigation in GNSS-denied Environments with Deep Reinforcement Learning
Wenqi Bai
Xiaohui Zhang
Shiliang Zhang
Songnan Yang
Yushuai Li
Tingwen Huang
AI4CE
49
2
0
21 Oct 2024
GNNRL-Smoothing: A Prior-Free Reinforcement Learning Model for Mesh
  Smoothing
GNNRL-Smoothing: A Prior-Free Reinforcement Learning Model for Mesh Smoothing
Zhichao Wang
Xinhai Chen
Chunye Gong
Bo Yang
Liang Deng
Yufei Sun
Yufei Pang
Jie Liu
AI4CE
63
0
0
19 Oct 2024
Online Reinforcement Learning with Passive Memory
Online Reinforcement Learning with Passive Memory
Anay Pattanaik
Lav R. Varshney
CLLOffRL
53
0
0
18 Oct 2024
TF-DDRL: A Transformer-enhanced Distributed DRL Technique for Scheduling
  IoT Applications in Edge and Cloud Computing Environments
TF-DDRL: A Transformer-enhanced Distributed DRL Technique for Scheduling IoT Applications in Edge and Cloud Computing Environments
Zhiyu Wang
M. Goudarzi
Rajkumar Buyya
OffRL
115
4
0
18 Oct 2024
Deep Reinforcement Learning for Online Optimal Execution Strategies
Deep Reinforcement Learning for Online Optimal Execution Strategies
Alessandro Micheli
Mélodie Monod
OffRL
23
0
0
17 Oct 2024
Robust RL with LLM-Driven Data Synthesis and Policy Adaptation for
  Autonomous Driving
Robust RL with LLM-Driven Data Synthesis and Policy Adaptation for Autonomous Driving
Sihao Wu
Jiaxu Liu
Xiangyu Yin
Guangliang Cheng
Xingyu Zhao
Meng Fang
Xinping Yi
Xiaowei Huang
87
1
0
16 Oct 2024
SEMSO: A Secure and Efficient Multi-Data Source Blockchain Oracle
SEMSO: A Secure and Efficient Multi-Data Source Blockchain Oracle
Youquan Xian
Xueying Zeng
Chunpei Li
Peng Wang
Dongcheng Li
Peng Liu
Xianxian Li
48
0
0
16 Oct 2024
Counterfactual Effect Decomposition in Multi-Agent Sequential Decision Making
Counterfactual Effect Decomposition in Multi-Agent Sequential Decision Making
Stelios Triantafyllou
A. Sukovic
Yasaman Zolfimoselo
Goran Radanović
CML
102
0
0
16 Oct 2024
Dynamic Learning Rate for Deep Reinforcement Learning: A Bandit Approach
Dynamic Learning Rate for Deep Reinforcement Learning: A Bandit Approach
Henrique Donâncio
Antoine Barrier
Leah F. South
Florence Forbes
71
0
0
16 Oct 2024
Counterfactual Generative Modeling with Variational Causal Inference
Counterfactual Generative Modeling with Variational Causal Inference
Yulun Wu
Louie McConnell
Claudia Iriondo
CMLBDL
147
3
0
16 Oct 2024
Improve Value Estimation of Q Function and Reshape Reward with Monte
  Carlo Tree Search
Improve Value Estimation of Q Function and Reshape Reward with Monte Carlo Tree Search
Jiamian Li
78
0
0
15 Oct 2024
DIAR: Diffusion-model-guided Implicit Q-learning with Adaptive
  Revaluation
DIAR: Diffusion-model-guided Implicit Q-learning with Adaptive Revaluation
Jaehyun Park
Yunho Kim
Sejin Kim
Byung-Jun Lee
Sundong Kim
OffRL
83
1
0
15 Oct 2024
PCF-Lift: Panoptic Lifting by Probabilistic Contrastive Fusion
PCF-Lift: Panoptic Lifting by Probabilistic Contrastive Fusion
Runsong Zhu
Shi Qiu
Qianyi Wu
Ka-Hei Hui
Pheng-Ann Heng
Chi-Wing Fu
55
0
0
14 Oct 2024
Dynamic Estimation of Learning Rates Using a Non-Linear Autoregressive
  Model
Dynamic Estimation of Learning Rates Using a Non-Linear Autoregressive Model
Ramin Okhrati
32
0
0
13 Oct 2024
Gradient-Free Neural Network Training on the Edge
Gradient-Free Neural Network Training on the Edge
Dotan Di Castro
O. Joglekar
Shir Kozlovsky
Vladimir Tchuiev
Michal Moshkovitz
MQ
33
0
0
13 Oct 2024
SimBa: Simplicity Bias for Scaling Up Parameters in Deep Reinforcement Learning
SimBa: Simplicity Bias for Scaling Up Parameters in Deep Reinforcement Learning
Hojoon Lee
Dongyoon Hwang
Donghu Kim
Hyunseung Kim
Jun Jet Tai
K. Subramanian
Peter R. Wurman
Jaegul Choo
Peter Stone
Takuma Seno
OffRL
183
17
0
13 Oct 2024
SAPIENT: Mastering Multi-turn Conversational Recommendation with Strategic Planning and Monte Carlo Tree Search
SAPIENT: Mastering Multi-turn Conversational Recommendation with Strategic Planning and Monte Carlo Tree Search
Hanwen Du
Bo Peng
Xia Ning
117
0
0
12 Oct 2024
Meta-Learning from Learning Curves for Budget-Limited Algorithm
  Selection
Meta-Learning from Learning Curves for Budget-Limited Algorithm Selection
Manh Hung Nguyen
Lisheng Sun-Hosoya
Isabelle M Guyon
68
1
0
10 Oct 2024
A Safety Modulator Actor-Critic Method in Model-Free Safe Reinforcement
  Learning and Application in UAV Hovering
A Safety Modulator Actor-Critic Method in Model-Free Safe Reinforcement Learning and Application in UAV Hovering
Qihan Qi
Xinsong Yang
Gang Xia
Daniel W. C. Ho
Pengyang Tang
94
0
0
09 Oct 2024
Previous
123456...444546
Next