ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1509.06461
  4. Cited By
Deep Reinforcement Learning with Double Q-learning
v1v2v3 (latest)

Deep Reinforcement Learning with Double Q-learning

22 September 2015
H. V. Hasselt
A. Guez
David Silver
    OffRL
ArXiv (abs)PDFHTML

Papers citing "Deep Reinforcement Learning with Double Q-learning"

50 / 2,291 papers shown
Title
Aggressive Q-Learning with Ensembles: Achieving Both High Sample
  Efficiency and High Asymptotic Performance
Aggressive Q-Learning with Ensembles: Achieving Both High Sample Efficiency and High Asymptotic Performance
Yanqiu Wu
Xinyue Chen
Che Wang
Yiming Zhang
George Andriopoulos
OffRL
65
9
0
17 Nov 2021
Compressive Features in Offline Reinforcement Learning for Recommender
  Systems
Compressive Features in Offline Reinforcement Learning for Recommender Systems
Hung Nguyen
Minh Nguyen
Long Pham
Jennifer Adorno Nieves
OffRL
48
2
0
16 Nov 2021
Obstacle Avoidance for UAS in Continuous Action Space Using Deep
  Reinforcement Learning
Obstacle Avoidance for UAS in Continuous Action Space Using Deep Reinforcement Learning
Jueming Hu
Xuxi Yang
Weichang Wang
Peng Wei
Lei Ying
Yongming Liu
58
24
0
13 Nov 2021
Improving Experience Replay through Modeling of Similar Transitions'
  Sets
Improving Experience Replay through Modeling of Similar Transitions' Sets
Daniel Eugênio Neves
João Pedro Oliveira Batisteli
Eduardo Felipe Lopes
Lucila Ishitani
Zenilton K. G. Patrocínio
OffRL
23
1
0
12 Nov 2021
AWD3: Dynamic Reduction of the Estimation Bias
AWD3: Dynamic Reduction of the Estimation Bias
Dogan C. Cicek
Enes Duran
Baturay Saglam
Kagan Kaya
Furkan B. Mutlu
Suleyman S. Kozat
OffRL
26
7
0
12 Nov 2021
CubeTR: Learning to Solve The Rubiks Cube Using Transformers
Mustafa Chasmai
ViT
65
1
0
11 Nov 2021
Spatially and Seamlessly Hierarchical Reinforcement Learning for State
  Space and Policy space in Autonomous Driving
Spatially and Seamlessly Hierarchical Reinforcement Learning for State Space and Policy space in Autonomous Driving
Jaehyung Kim
Jaeseung Jeong
18
0
0
10 Nov 2021
Dealing with the Unknown: Pessimistic Offline Reinforcement Learning
Dealing with the Unknown: Pessimistic Offline Reinforcement Learning
Jinning Li
Chen Tang
Masayoshi Tomizuka
Wei Zhan
OffRL
94
22
0
09 Nov 2021
d3rlpy: An Offline Deep Reinforcement Learning Library
d3rlpy: An Offline Deep Reinforcement Learning Library
Takuma Seno
M. Imai
OffRLGP
111
106
0
06 Nov 2021
Cross Modality 3D Navigation Using Reinforcement Learning and Neural
  Style Transfer
Cross Modality 3D Navigation Using Reinforcement Learning and Neural Style Transfer
Cesare Magnetti
Hadrien Reynaud
Bernhard Kainz
MedIm
22
0
0
05 Nov 2021
Value Function Spaces: Skill-Centric State Abstractions for Long-Horizon
  Reasoning
Value Function Spaces: Skill-Centric State Abstractions for Long-Horizon Reasoning
Dhruv Shah
Peng Xu
Yao Lu
Ted Xiao
Alexander Toshev
Sergey Levine
Brian Ichter
OffRL
81
43
0
04 Nov 2021
Balanced Q-learning: Combining the Influence of Optimistic and
  Pessimistic Targets
Balanced Q-learning: Combining the Influence of Optimistic and Pessimistic Targets
Thommen George Karimpanal
Hung Le
Majid Abdolshah
Santu Rana
Sunil R. Gupta
T. Tran
Svetha Venkatesh
64
5
0
03 Nov 2021
Off-Policy Correction for Deep Deterministic Policy Gradient Algorithms
  via Batch Prioritized Experience Replay
Off-Policy Correction for Deep Deterministic Policy Gradient Algorithms via Batch Prioritized Experience Replay
Dogan C. Cicek
Enes Duran
Baturay Saglam
Furkan B. Mutlu
Suleyman S. Kozat
OffRL
35
11
0
02 Nov 2021
Investigation of Independent Reinforcement Learning Algorithms in
  Multi-Agent Environments
Investigation of Independent Reinforcement Learning Algorithms in Multi-Agent Environments
Ken Ming Lee
Sriram Ganapathi Subramanian
Mark Crowley
60
11
0
01 Nov 2021
Decentralized Multi-Agent Reinforcement Learning: An Off-Policy Method
Decentralized Multi-Agent Reinforcement Learning: An Off-Policy Method
Kuo Li
Qing-Shan Jia
OffRL
18
2
0
31 Oct 2021
Mastering Atari Games with Limited Data
Mastering Atari Games with Limited Data
Weirui Ye
Shao-Wei Liu
Thanard Kurutach
Pieter Abbeel
Yang Gao
VLM
149
242
0
30 Oct 2021
Bayesian Sequential Optimal Experimental Design for Nonlinear Models
  Using Policy Gradient Reinforcement Learning
Bayesian Sequential Optimal Experimental Design for Nonlinear Models Using Policy Gradient Reinforcement Learning
Wanggang Shen
Xun Huan
62
40
0
28 Oct 2021
Cooperative Deep $Q$-learning Framework for Environments Providing Image
  Feedback
Cooperative Deep QQQ-learning Framework for Environments Providing Image Feedback
Krishnan Raghavan
Vignesh Narayanan
S. Jagannathan
VLMOffRL
55
1
0
28 Oct 2021
Learning to Control using Image Feedback
Learning to Control using Image Feedback
Krishnan Raghavan
Vignesh Narayanan
Jagannathan Saraangapani
35
0
0
28 Oct 2021
Hindsight Goal Ranking on Replay Buffer for Sparse Reward Environment
Hindsight Goal Ranking on Replay Buffer for Sparse Reward Environment
Tung M. Luu
Chang D. Yoo
80
8
0
28 Oct 2021
Temporal-Difference Value Estimation via Uncertainty-Guided Soft Updates
Temporal-Difference Value Estimation via Uncertainty-Guided Soft Updates
Litian Liang
Yaosheng Xu
Stephen Marcus McAleer
Dailin Hu
Alexander Ihler
Pieter Abbeel
Roy Fox
23
4
0
28 Oct 2021
Comparing Heuristics, Constraint Optimization, and Reinforcement
  Learning for an Industrial 2D Packing Problem
Comparing Heuristics, Constraint Optimization, and Reinforcement Learning for an Industrial 2D Packing Problem
S. Böhm
Martin Neumayer
Oliver Kramer
Alexander Schiendorfer
Alois Knoll
OffRL
22
2
0
27 Oct 2021
The Difficulty of Passive Learning in Deep Reinforcement Learning
The Difficulty of Passive Learning in Deep Reinforcement Learning
Georg Ostrovski
Pablo Samuel Castro
Will Dabney
OffRL
74
59
0
26 Oct 2021
Automating Control of Overestimation Bias for Reinforcement Learning
Automating Control of Overestimation Bias for Reinforcement Learning
Arsenii Kuznetsov
Alexander Grishin
Artem Tsypin
Arsenii Ashukha
Artur Kadurin
Dmitry Vetrov
OffRL
47
2
0
26 Oct 2021
Persona Authentication through Generative Dialogue
Persona Authentication through Generative Dialogue
Fengyi Tang
Lifan Zeng
Fei Wang
Jiayu Zhou
97
8
0
25 Oct 2021
Deep Reinforcement Learning for Simultaneous Sensing and Channel Access
  in Cognitive Networks
Deep Reinforcement Learning for Simultaneous Sensing and Channel Access in Cognitive Networks
Yoel Bokobza
R. Dabora
Kobi Cohen
60
14
0
24 Oct 2021
A Reinforcement Learning Approach to Parameter Selection for Distributed
  Optimal Power Flow
A Reinforcement Learning Approach to Parameter Selection for Distributed Optimal Power Flow
Tai-Yin Chiu
Alyssa Kody
Youngdae Kim
Kibaek Kim
Daniel K. Molzahn
41
21
0
22 Oct 2021
Deep Generative Models in Engineering Design: A Review
Deep Generative Models in Engineering Design: A Review
Lyle Regenwetter
Amin Heyrani Nobari
Faez Ahmed
3DVAI4CE
136
192
0
21 Oct 2021
CIM-PPO:Proximal Policy Optimization with Liu-Correntropy Induced Metric
CIM-PPO:Proximal Policy Optimization with Liu-Correntropy Induced Metric
Yunxiao Guo
Han Long
Xiaojun Duan
Kaiyuan Feng
Maochu Li
Xiaying Ma
22
4
0
20 Oct 2021
Continuous Control with Action Quantization from Demonstrations
Continuous Control with Action Quantization from Demonstrations
Robert Dadashi
Léonard Hussenot
Damien Vincent
Sertan Girgin
Anton Raichuk
Matthieu Geist
Olivier Pietquin
OffRL
103
23
0
19 Oct 2021
Reinforcement Learning-Based Coverage Path Planning with Implicit
  Cellular Decomposition
Reinforcement Learning-Based Coverage Path Planning with Implicit Cellular Decomposition
Javad Heydari
Olimpiya Saha
Viswanath Ganapathy
OffRL
35
16
0
18 Oct 2021
Damped Anderson Mixing for Deep Reinforcement Learning: Acceleration,
  Convergence, and Stabilization
Damped Anderson Mixing for Deep Reinforcement Learning: Acceleration, Convergence, and Stabilization
Ke Sun
Yafei Wang
Yi Liu
Yingnan Zhao
Bo Pan
Shangling Jui
Bei Jiang
Linglong Kong
46
11
0
17 Oct 2021
Centroid Approximation for Bootstrap: Improving Particle Quality at
  Inference
Centroid Approximation for Bootstrap: Improving Particle Quality at Inference
Mao Ye
Qiang Liu
41
1
0
17 Oct 2021
SaLinA: Sequential Learning of Agents
SaLinA: Sequential Learning of Agents
Ludovic Denoyer
Alfredo De la Fuente
S. Duong
Jean-Baptiste Gaya
Pierre-Alexandre Kamienny
Daniel H. Thompson
94
11
0
15 Oct 2021
Urban traffic dynamic rerouting framework: A DRL-based model with
  fog-cloud architecture
Urban traffic dynamic rerouting framework: A DRL-based model with fog-cloud architecture
Runjia Du
Sikai Chen
Jiqian Dong
Tiantian Chen
Xiaowen Fu
Samuel Labi
71
0
0
11 Oct 2021
Theoretically Principled Deep RL Acceleration via Nearest Neighbor
  Function Approximation
Theoretically Principled Deep RL Acceleration via Nearest Neighbor Function Approximation
Junhong Shen
Lin F. Yang
OffRL
51
17
0
09 Oct 2021
Training Transition Policies via Distribution Matching for Complex Tasks
Training Transition Policies via Distribution Matching for Complex Tasks
Ju-Seung Byun
Andrew Perrault
55
6
0
08 Oct 2021
Medical Dead-ends and Learning to Identify High-risk States and
  Treatments
Medical Dead-ends and Learning to Identify High-risk States and Treatments
Mehdi Fatemi
Taylor W. Killian
J. Subramanian
Marzyeh Ghassemi
OffRL
94
40
0
08 Oct 2021
Learning Pessimism for Robust and Efficient Off-Policy Reinforcement
  Learning
Learning Pessimism for Robust and Efficient Off-Policy Reinforcement Learning
Edoardo Cetin
Oya Celiktutan
OffRL
83
17
0
07 Oct 2021
Designing Composites with Target Effective Young's Modulus using
  Reinforcement Learning
Designing Composites with Target Effective Young's Modulus using Reinforcement Learning
Aldair E. Gongora
Siddharth Mysore
Beichen Li
Wan Shou
Wojciech Matusik
E. Morgan
Keith A. Brown
Emily Whiting
AI4CE
62
9
0
07 Oct 2021
Explaining Deep Reinforcement Learning Agents In The Atari Domain
  through a Surrogate Model
Explaining Deep Reinforcement Learning Agents In The Atari Domain through a Surrogate Model
Alexander Sieusahai
Matthew J. Guzdial
71
13
0
07 Oct 2021
Optimized Recommender Systems with Deep Reinforcement Learning
Optimized Recommender Systems with Deep Reinforcement Learning
Lucas Farris
OffRL
25
0
0
06 Oct 2021
On The Transferability of Deep-Q Networks
On The Transferability of Deep-Q Networks
M. Sabatelli
Pierre Geurts
83
2
0
06 Oct 2021
DeepEdge: A Deep Reinforcement Learning based Task Orchestrator for Edge
  Computing
DeepEdge: A Deep Reinforcement Learning based Task Orchestrator for Edge Computing
Baris Yamansavascilar
A. C. Baktir
Cagatay Sonmez
Atay Ozgovde
Cem Ersoy
49
25
0
05 Oct 2021
Deep reinforcement learning for guidewire navigation in coronary artery
  phantom
Deep reinforcement learning for guidewire navigation in coronary artery phantom
Jihoon Kweon
Kyunghwan Kim
Chaehyuk Lee
Hwi Kwon
Jinwoo Park
...
Inwook Back
J. Roh
Y. Moon
Jaesoon Choi
Young-Hak Kim
OnRL
65
34
0
05 Oct 2021
Uncertainty-Based Offline Reinforcement Learning with Diversified
  Q-Ensemble
Uncertainty-Based Offline Reinforcement Learning with Diversified Q-Ensemble
Gaon An
Seungyong Moon
Jang-Hyun Kim
Hyun Oh Song
OffRL
185
283
0
04 Oct 2021
Multi-Agent Path Planning Using Deep Reinforcement Learning
Multi-Agent Path Planning Using Deep Reinforcement Learning
M. Çetinkaya
58
2
0
04 Oct 2021
A Cramér Distance perspective on Quantile Regression based
  Distributional Reinforcement Learning
A Cramér Distance perspective on Quantile Regression based Distributional Reinforcement Learning
Alix Lhéritier
Nicolas Bondoux
38
5
0
01 Oct 2021
Scalable Online Planning via Reinforcement Learning Fine-Tuning
Scalable Online Planning via Reinforcement Learning Fine-Tuning
Arnaud Fickinger
Hengyuan Hu
Brandon Amos
Stuart J. Russell
Noam Brown
97
21
0
30 Sep 2021
Modeling Interactions of Autonomous Vehicles and Pedestrians with Deep
  Multi-Agent Reinforcement Learning for Collision Avoidance
Modeling Interactions of Autonomous Vehicles and Pedestrians with Deep Multi-Agent Reinforcement Learning for Collision Avoidance
Raphael Trumpp
Harald Bayerlein
David Gesbert
38
18
0
30 Sep 2021
Previous
123...212223...444546
Next