ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1509.06461
  4. Cited By
Deep Reinforcement Learning with Double Q-learning

Deep Reinforcement Learning with Double Q-learning

22 September 2015
H. V. Hasselt
A. Guez
David Silver
    OffRL
ArXivPDFHTML

Papers citing "Deep Reinforcement Learning with Double Q-learning"

50 / 981 papers shown
Title
Bellman Meets Hawkes: Model-Based Reinforcement Learning via Temporal
  Point Processes
Bellman Meets Hawkes: Model-Based Reinforcement Learning via Temporal Point Processes
Chao Qu
Jue Chen
Siqiao Xue
Xiaoming Shi
James Y. Zhang
Hongyuan Mei
OffRL
37
17
0
29 Jan 2022
Mask-based Latent Reconstruction for Reinforcement Learning
Mask-based Latent Reconstruction for Reinforcement Learning
Tao Yu
Zhizheng Zhang
Cuiling Lan
Yan Lu
Zhibo Chen
29
44
0
28 Jan 2022
Generative Adversarial Exploration for Reinforcement Learning
Generative Adversarial Exploration for Reinforcement Learning
Weijun Hong
Menghui Zhu
Minghuan Liu
Weinan Zhang
Ming Zhou
Yong Yu
Peng Sun
OnRL
39
7
0
27 Jan 2022
Multi-Agent Adversarial Attacks for Multi-Channel Communications
Multi-Agent Adversarial Attacks for Multi-Channel Communications
Juncheng Dong
Suya Wu
Mohammadreza Soltani
Vahid Tarokh
AAML
51
3
0
22 Jan 2022
A Prescriptive Dirichlet Power Allocation Policy with Deep Reinforcement
  Learning
A Prescriptive Dirichlet Power Allocation Policy with Deep Reinforcement Learning
Yuan Tian
Minghao Han
Chetan S. Kulkarni
Olga Fink
25
13
0
20 Jan 2022
Anytime PSRO for Two-Player Zero-Sum Games
Anytime PSRO for Two-Player Zero-Sum Games
Stephen Marcus McAleer
Kevin A. Wang
John Lanier
Marc Lanctot
Pierre Baldi
Tuomas Sandholm
Roy Fox
24
12
0
19 Jan 2022
Demystifying Reinforcement Learning in Time-Varying Systems
Demystifying Reinforcement Learning in Time-Varying Systems
Pouya Hamadanian
Malte Schwarzkopf
Siddartha Sen
MohammadIman Alizadeh
77
1
0
14 Jan 2022
Criticality-Based Varying Step-Number Algorithm for Reinforcement
  Learning
Criticality-Based Varying Step-Number Algorithm for Reinforcement Learning
Yitzhak Spielberg
A. Azaria
24
0
0
13 Jan 2022
SmartDet: Context-Aware Dynamic Control of Edge Task Offloading for
  Mobile Object Detection
SmartDet: Context-Aware Dynamic Control of Edge Task Offloading for Mobile Object Detection
Davide Callegaro
Francesco Restuccia
Marco Levorato
29
3
0
11 Jan 2022
Automated Reinforcement Learning (AutoRL): A Survey and Open Problems
Automated Reinforcement Learning (AutoRL): A Survey and Open Problems
Jack Parker-Holder
Raghunandan Rajan
Xingyou Song
André Biedenkapp
Yingjie Miao
...
Vu-Linh Nguyen
Roberto Calandra
Aleksandra Faust
Frank Hutter
Marius Lindauer
AI4CE
56
102
0
11 Jan 2022
Mirror Learning: A Unifying Framework of Policy Optimisation
Mirror Learning: A Unifying Framework of Policy Optimisation
J. Kuba
Christian Schroeder de Witt
Jakob N. Foerster
34
24
0
07 Jan 2022
A Surrogate-Assisted Controller for Expensive Evolutionary Reinforcement
  Learning
A Surrogate-Assisted Controller for Expensive Evolutionary Reinforcement Learning
Yuxing Wang
Tiantian Zhang
Yongzhe Chang
Bin Liang
Xueqian Wang
Bo Yuan
29
15
0
01 Jan 2022
Constraint Sampling Reinforcement Learning: Incorporating Expertise For
  Faster Learning
Constraint Sampling Reinforcement Learning: Incorporating Expertise For Faster Learning
Tong Mu
Georgios Theocharous
David Arbour
Emma Brunskill
33
6
0
30 Dec 2021
Multi-Agent Reinforcement Learning via Adaptive Kalman Temporal
  Difference and Successor Representation
Multi-Agent Reinforcement Learning via Adaptive Kalman Temporal Difference and Successor Representation
Mohammad Salimibeni
Arash Mohammadi
Parvin Malekzadeh
Konstantinos N. Plataniotis
23
5
0
30 Dec 2021
A Graph Attention Learning Approach to Antenna Tilt Optimization
A Graph Attention Learning Approach to Antenna Tilt Optimization
Yifei Jin
Filippo Vannella
Maxime Bouton
Jaeseong Jeong
Ezeddin Al Hakim
29
10
0
27 Dec 2021
Value Activation for Bias Alleviation: Generalized-activated Deep Double
  Deterministic Policy Gradients
Value Activation for Bias Alleviation: Generalized-activated Deep Double Deterministic Policy Gradients
Jiafei Lyu
Yu Yang
Jiangpeng Yan
Xiu Li
OffRL
AI4CE
44
5
0
21 Dec 2021
DB-BERT: a Database Tuning Tool that "Reads the Manual"
DB-BERT: a Database Tuning Tool that "Reads the Manual"
Immanuel Trummer
35
61
0
21 Dec 2021
RoboAssembly: Learning Generalizable Furniture Assembly Policy in a
  Novel Multi-robot Contact-rich Simulation Environment
RoboAssembly: Learning Generalizable Furniture Assembly Policy in a Novel Multi-robot Contact-rich Simulation Environment
Mingxin Yu
Lin Shao
Zhehuan Chen
Tianhao Wu
Qingnan Fan
Kaichun Mo
Hao Dong
39
17
0
19 Dec 2021
On Optimizing Interventions in Shared Autonomy
On Optimizing Interventions in Shared Autonomy
Weihao Tan
David Koleczek
Siddhant Pradhan
Nicholas Perello
Vivek Chettiar
Vishal Rohra
Aaslesha Rajaram
Soundararajan Srinivasan
H. M. S. Hossain
Yash Chandak
36
5
0
16 Dec 2021
Deep Reinforcement Learning Policies Learn Shared Adversarial Features
  Across MDPs
Deep Reinforcement Learning Policies Learn Shared Adversarial Features Across MDPs
Ezgi Korkmaz
27
25
0
16 Dec 2021
Learning from Guided Play: A Scheduled Hierarchical Approach for
  Improving Exploration in Adversarial Imitation Learning
Learning from Guided Play: A Scheduled Hierarchical Approach for Improving Exploration in Adversarial Imitation Learning
Trevor Ablett
Bryan Chan
Jonathan Kelly
42
4
0
16 Dec 2021
Tree-based Focused Web Crawling with Reinforcement Learning
Tree-based Focused Web Crawling with Reinforcement Learning
Andreas Kontogiannis
Dimitrios Kelesis
Vasilis Pollatos
George Giannakopoulos
Georgios Paliouras
29
2
0
12 Dec 2021
Recent Advances in Reinforcement Learning in Finance
Recent Advances in Reinforcement Learning in Finance
B. Hambly
Renyuan Xu
Huining Yang
OffRL
40
168
0
08 Dec 2021
JueWu-MC: Playing Minecraft with Sample-efficient Hierarchical
  Reinforcement Learning
JueWu-MC: Playing Minecraft with Sample-efficient Hierarchical Reinforcement Learning
Zichuan Lin
Junyou Li
Jianing Shi
Deheng Ye
Qiang Fu
Wei Yang
BDL
45
34
0
07 Dec 2021
Explainable Deep Learning in Healthcare: A Methodological Survey from an
  Attribution View
Explainable Deep Learning in Healthcare: A Methodological Survey from an Attribution View
Di Jin
Elena Sergeeva
W. Weng
Geeticka Chauhan
Peter Szolovits
OOD
64
56
0
05 Dec 2021
A Generic Graph Sparsification Framework using Deep Reinforcement
  Learning
A Generic Graph Sparsification Framework using Deep Reinforcement Learning
Ryan Wickman
Xiaofei Zhang
Weizi Li
OffRL
25
13
0
02 Dec 2021
Pessimistic Model Selection for Offline Deep Reinforcement Learning
Pessimistic Model Selection for Offline Deep Reinforcement Learning
Chao-Han Huck Yang
Zhengling Qi
Yifan Cui
Pin-Yu Chen
OffRL
50
4
0
29 Nov 2021
Reinforcement Learning-based Switching Controller for a Milliscale Robot
  in a Constrained Environment
Reinforcement Learning-based Switching Controller for a Milliscale Robot in a Constrained Environment
Abbas Tariverdi
Ulysse Côté-Allard
Kim Mathiassen
O. Elle
H. Kalvøy
Ø. Martinsen
J. Tørresen
16
4
0
27 Nov 2021
Adaptively Calibrated Critic Estimates for Deep Reinforcement Learning
Adaptively Calibrated Critic Estimates for Deep Reinforcement Learning
Nicolai Dorka
Tim Welschehold
Joschka Boedecker
Wolfram Burgard
OffRL
35
9
0
24 Nov 2021
Component Transfer Learning for Deep RL Based on Abstract
  Representations
Component Transfer Learning for Deep RL Based on Abstract Representations
Geoffrey van Driessel
Vincent François-Lavet
DRL
OffRL
30
6
0
22 Nov 2021
Renewable energy integration and microgrid energy trading using
  multi-agent deep reinforcement learning
Renewable energy integration and microgrid energy trading using multi-agent deep reinforcement learning
Daniel J. B. Harrold
Jun Cao
Zhongbo Fan
39
62
0
21 Nov 2021
Aggressive Q-Learning with Ensembles: Achieving Both High Sample
  Efficiency and High Asymptotic Performance
Aggressive Q-Learning with Ensembles: Achieving Both High Sample Efficiency and High Asymptotic Performance
Yanqiu Wu
Xinyue Chen
Che Wang
Yiming Zhang
Keith Ross
OffRL
19
9
0
17 Nov 2021
Compressive Features in Offline Reinforcement Learning for Recommender
  Systems
Compressive Features in Offline Reinforcement Learning for Recommender Systems
Hung Nguyen
Minh Nguyen
Long Pham
Jennifer Adorno Nieves
OffRL
24
2
0
16 Nov 2021
Obstacle Avoidance for UAS in Continuous Action Space Using Deep
  Reinforcement Learning
Obstacle Avoidance for UAS in Continuous Action Space Using Deep Reinforcement Learning
Jueming Hu
Xuxi Yang
Weichang Wang
Peng Wei
Lei Ying
Yongming Liu
46
24
0
13 Nov 2021
CubeTR: Learning to Solve The Rubiks Cube Using Transformers
Mustafa Chasmai
ViT
37
1
0
11 Nov 2021
Dealing with the Unknown: Pessimistic Offline Reinforcement Learning
Dealing with the Unknown: Pessimistic Offline Reinforcement Learning
Jinning Li
Chen Tang
Masayoshi Tomizuka
Wei Zhan
OffRL
36
21
0
09 Nov 2021
d3rlpy: An Offline Deep Reinforcement Learning Library
d3rlpy: An Offline Deep Reinforcement Learning Library
Takuma Seno
M. Imai
OffRL
GP
65
101
0
06 Nov 2021
Value Function Spaces: Skill-Centric State Abstractions for Long-Horizon
  Reasoning
Value Function Spaces: Skill-Centric State Abstractions for Long-Horizon Reasoning
Dhruv Shah
Peng Xu
Yao Lu
Ted Xiao
Alexander Toshev
Sergey Levine
Brian Ichter
OffRL
42
41
0
04 Nov 2021
Balanced Q-learning: Combining the Influence of Optimistic and
  Pessimistic Targets
Balanced Q-learning: Combining the Influence of Optimistic and Pessimistic Targets
Thommen George Karimpanal
Hung Le
Majid Abdolshah
Santu Rana
Sunil R. Gupta
T. Tran
Svetha Venkatesh
25
5
0
03 Nov 2021
Off-Policy Correction for Deep Deterministic Policy Gradient Algorithms
  via Batch Prioritized Experience Replay
Off-Policy Correction for Deep Deterministic Policy Gradient Algorithms via Batch Prioritized Experience Replay
Dogan C. Cicek
Enes Duran
Baturay Saglam
Furkan B. Mutlu
Suleyman S. Kozat
OffRL
33
11
0
02 Nov 2021
Mastering Atari Games with Limited Data
Mastering Atari Games with Limited Data
Weirui Ye
Shao-Wei Liu
Thanard Kurutach
Pieter Abbeel
Yang Gao
VLM
70
227
0
30 Oct 2021
Bayesian Sequential Optimal Experimental Design for Nonlinear Models
  Using Policy Gradient Reinforcement Learning
Bayesian Sequential Optimal Experimental Design for Nonlinear Models Using Policy Gradient Reinforcement Learning
Wanggang Shen
Xun Huan
18
40
0
28 Oct 2021
Hindsight Goal Ranking on Replay Buffer for Sparse Reward Environment
Hindsight Goal Ranking on Replay Buffer for Sparse Reward Environment
Tung M. Luu
Chang D. Yoo
28
8
0
28 Oct 2021
The Difficulty of Passive Learning in Deep Reinforcement Learning
The Difficulty of Passive Learning in Deep Reinforcement Learning
Georg Ostrovski
Pablo Samuel Castro
Will Dabney
OffRL
33
57
0
26 Oct 2021
Automating Control of Overestimation Bias for Reinforcement Learning
Automating Control of Overestimation Bias for Reinforcement Learning
Arsenii Kuznetsov
Alexander Grishin
Artem Tsypin
Arsenii Ashukha
Artur Kadurin
Dmitry Vetrov
OffRL
19
2
0
26 Oct 2021
Deep Reinforcement Learning for Simultaneous Sensing and Channel Access
  in Cognitive Networks
Deep Reinforcement Learning for Simultaneous Sensing and Channel Access in Cognitive Networks
Yoel Bokobza
R. Dabora
Kobi Cohen
52
13
0
24 Oct 2021
Deep Generative Models in Engineering Design: A Review
Deep Generative Models in Engineering Design: A Review
Lyle Regenwetter
Amin Heyrani Nobari
Faez Ahmed
3DV
AI4CE
41
177
0
21 Oct 2021
Continuous Control with Action Quantization from Demonstrations
Continuous Control with Action Quantization from Demonstrations
Robert Dadashi
Léonard Hussenot
Damien Vincent
Sertan Girgin
Anton Raichuk
Matthieu Geist
Olivier Pietquin
OffRL
33
23
0
19 Oct 2021
Urban traffic dynamic rerouting framework: A DRL-based model with
  fog-cloud architecture
Urban traffic dynamic rerouting framework: A DRL-based model with fog-cloud architecture
Runjia Du
Sikai Chen
Jiqian Dong
Tiantian Chen
Xiaowen Fu
Samuel Labi
20
0
0
11 Oct 2021
Training Transition Policies via Distribution Matching for Complex Tasks
Training Transition Policies via Distribution Matching for Complex Tasks
Ju-Seung Byun
Andrew Perrault
13
6
0
08 Oct 2021
Previous
123...8910...181920
Next