ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1509.06461
  4. Cited By
Deep Reinforcement Learning with Double Q-learning

Deep Reinforcement Learning with Double Q-learning

22 September 2015
H. V. Hasselt
A. Guez
David Silver
    OffRL
ArXivPDFHTML

Papers citing "Deep Reinforcement Learning with Double Q-learning"

50 / 985 papers shown
Title
Mastering Atari with Discrete World Models
Mastering Atari with Discrete World Models
Danijar Hafner
Timothy Lillicrap
Mohammad Norouzi
Jimmy Ba
DRL
58
824
0
05 Oct 2020
Facilitating Connected Autonomous Vehicle Operations Using
  Space-weighted Information Fusion and Deep Reinforcement Learning Based
  Control
Facilitating Connected Autonomous Vehicle Operations Using Space-weighted Information Fusion and Deep Reinforcement Learning Based Control
Jiqian Dong
Sikai Chen
Yujie Li
Runjia Du
Aaron Steinfeld
Samuel Labi
31
11
0
30 Sep 2020
Toolpath design for additive manufacturing using deep reinforcement
  learning
Toolpath design for additive manufacturing using deep reinforcement learning
M. Mozaffar
Ablodghani Ebrahimi
Jian Cao
AI4CE
28
7
0
30 Sep 2020
Finite-Time Analysis for Double Q-learning
Finite-Time Analysis for Double Q-learning
Huaqing Xiong
Linna Zhao
Yingbin Liang
Wei Zhang
30
31
0
29 Sep 2020
Learning to Play against Any Mixture of Opponents
Learning to Play against Any Mixture of Opponents
Max O. Smith
Thomas W. Anthony
Yongzhao Wang
Michael P. Wellman
OffRL
33
9
0
29 Sep 2020
Novelty Search in Representational Space for Sample Efficient
  Exploration
Novelty Search in Representational Space for Sample Efficient Exploration
Ruo Yu Tao
Vincent François-Lavet
Joelle Pineau
42
43
0
28 Sep 2020
A New Approach for Tactical Decision Making in Lane Changing: Sample
  Efficient Deep Q Learning with a Safety Feedback Reward
A New Approach for Tactical Decision Making in Lane Changing: Sample Efficient Deep Q Learning with a Safety Feedback Reward
U. Yavas
T. Kumbasar
N. K. Üre
21
19
0
24 Sep 2020
Learning a Contact-Adaptive Controller for Robust, Efficient Legged
  Locomotion
Learning a Contact-Adaptive Controller for Robust, Efficient Legged Locomotion
Xingye Da
Zhaoming Xie
David Hoeller
Byron Boots
Anima Anandkumar
Yuke Zhu
Buck Babich
Animesh Garg
27
57
0
21 Sep 2020
Lyapunov-Based Reinforcement Learning for Decentralized Multi-Agent
  Control
Lyapunov-Based Reinforcement Learning for Decentralized Multi-Agent Control
Qingrui Zhang
Hao Dong
Wei Pan
29
6
0
20 Sep 2020
The Importance of Pessimism in Fixed-Dataset Policy Optimization
The Importance of Pessimism in Fixed-Dataset Policy Optimization
Jacob Buckman
Carles Gelada
Marc G. Bellemare
OffRL
47
137
0
15 Sep 2020
VacSIM: Learning Effective Strategies for COVID-19 Vaccine Distribution
  using Reinforcement Learning
VacSIM: Learning Effective Strategies for COVID-19 Vaccine Distribution using Reinforcement Learning
R. Awasthi
K. K. Guliani
Saif Ahmad Khan
Aniket Vashishtha
M. S. Gill
Arshita Bhatt
A. Nagori
Aniket Gupta
Ponnurangam Kumaraguru
Tavpritesh Sethi
59
24
0
14 Sep 2020
COVID-19 Pandemic Cyclic Lockdown Optimization Using Reinforcement
  Learning
COVID-19 Pandemic Cyclic Lockdown Optimization Using Reinforcement Learning
M. Arango
Lyudmil Pelov
22
16
0
10 Sep 2020
Dynamic Scheduling for Stochastic Edge-Cloud Computing Environments
  using A3C learning and Residual Recurrent Neural Networks
Dynamic Scheduling for Stochastic Edge-Cloud Computing Environments using A3C learning and Residual Recurrent Neural Networks
Shreshth Tuli
Shashikant Ilager
K. Ramamohanarao
Rajkumar Buyya
27
176
0
01 Sep 2020
CLAN: Continuous Learning using Asynchronous Neuroevolution on Commodity
  Edge Devices
CLAN: Continuous Learning using Asynchronous Neuroevolution on Commodity Edge Devices
Parth Mannan
A. Samajdar
T. Krishna
36
2
0
27 Aug 2020
Robust Image Matching By Dynamic Feature Selection
Robust Image Matching By Dynamic Feature Selection
Hao Huang
Jianchun Chen
Xiang Li
Lingjing Wang
Yi Fang
23
3
0
13 Aug 2020
Robust Deep Reinforcement Learning through Adversarial Loss
Robust Deep Reinforcement Learning through Adversarial Loss
Tuomas P. Oikarinen
Wang Zhang
Alexandre Megretski
Luca Daniel
Tsui-Wei Weng
AAML
49
94
0
05 Aug 2020
EasyRL: A Simple and Extensible Reinforcement Learning Framework
EasyRL: A Simple and Extensible Reinforcement Learning Framework
Neil Hulbert
S. Spillers
Brandon Francis
James Haines-Temons
Ken Gil Romero
Benjamin De Jager
Sam Wong
Kevin Flora
Bowei Huang
Athirai Aravazhi Irissappane
OffRL
OnRL
SyDa
16
1
0
04 Aug 2020
Low Dimensional State Representation Learning with Reward-shaped Priors
Low Dimensional State Representation Learning with Reward-shaped Priors
N. Botteghi
Ruben Obbink
D. Geijs
M. Poel
B. Sirmaçek
C. Brune
A. Mersha
Stefano Stramigioli
SSL
OffRL
25
4
0
29 Jul 2020
Variance Reduction for Deep Q-Learning using Stochastic Recursive
  Gradient
Variance Reduction for Deep Q-Learning using Stochastic Recursive Gradient
Hao Jia
Xiao Zhang
Jun Xu
Wei Zeng
Hao Jiang
Xiao Yan
Ji-Rong Wen
37
3
0
25 Jul 2020
Maximum Mutation Reinforcement Learning for Scalable Control
Maximum Mutation Reinforcement Learning for Scalable Control
Karush Suri
Xiaolong Shi
Konstantinos N. Plataniotis
Y. Lawryshyn
30
4
0
24 Jul 2020
EMaQ: Expected-Max Q-Learning Operator for Simple Yet Effective Offline
  and Online RL
EMaQ: Expected-Max Q-Learning Operator for Simple Yet Effective Offline and Online RL
Seyed Kamyar Seyed Ghasemipour
Dale Schuurmans
S. Gu
OffRL
217
119
0
21 Jul 2020
UAV Target Tracking in Urban Environments Using Deep Reinforcement
  Learning
UAV Target Tracking in Urban Environments Using Deep Reinforcement Learning
Sarthak Bhagat
Sujit PB
42
47
0
21 Jul 2020
Active MR k-space Sampling with Reinforcement Learning
Active MR k-space Sampling with Reinforcement Learning
Luis Villaseñor-Pineda
Sumana Basu
Adriana Romero
Roberto Calandra
M. Drozdzal
19
70
0
20 Jul 2020
Multi-robot Cooperative Object Transportation using Decentralized Deep
  Reinforcement Learning
Multi-robot Cooperative Object Transportation using Decentralized Deep Reinforcement Learning
Lin Zhang
Hao Xiong
Ou Ma
Zhaokui Wang
17
6
0
17 Jul 2020
Meta-Gradient Reinforcement Learning with an Objective Discovered Online
Meta-Gradient Reinforcement Learning with an Objective Discovered Online
Zhongwen Xu
H. V. Hasselt
Matteo Hessel
Junhyuk Oh
Satinder Singh
David Silver
32
77
0
16 Jul 2020
Weighing Counts: Sequential Crowd Counting by Reinforcement Learning
Weighing Counts: Sequential Crowd Counting by Reinforcement Learning
Liang Liu
Hao Lu
Hongwei Zou
Haipeng Xiong
Zhiguo Cao
Chunhua Shen
OffRL
32
71
0
16 Jul 2020
Odyssey: Creation, Analysis and Detection of Trojan Models
Odyssey: Creation, Analysis and Detection of Trojan Models
Marzieh Edraki
Nazmul Karim
Nazanin Rahnavard
Ajmal Mian
M. Shah
AAML
42
13
0
16 Jul 2020
Information Freshness-Aware Task Offloading in Air-Ground Integrated
  Edge Computing Systems
Information Freshness-Aware Task Offloading in Air-Ground Integrated Edge Computing Systems
Xianfu Chen
Celimuge Wu
Tao Chen
Zhi Liu
Honggang Zhang
M. Bennis
Hang Liu
Yusheng Ji
39
71
0
15 Jul 2020
Robustifying Reinforcement Learning Agents via Action Space Adversarial
  Training
Robustifying Reinforcement Learning Agents via Action Space Adversarial Training
Kai Liang Tan
Yasaman Esfandiari
Xian Yeow Lee
Aakanksha
Soumik Sarkar
AAML
26
55
0
14 Jul 2020
Revisiting Fundamentals of Experience Replay
Revisiting Fundamentals of Experience Replay
W. Fedus
Prajit Ramachandran
Rishabh Agarwal
Yoshua Bengio
Hugo Larochelle
Mark Rowland
Will Dabney
KELM
OffRL
41
235
0
13 Jul 2020
Reinforcement Learning of Musculoskeletal Control from Functional
  Simulations
Reinforcement Learning of Musculoskeletal Control from Functional Simulations
Emanuel Joos
Fabien Péan
Orçun Göksel
AI4CE
37
12
0
13 Jul 2020
Data-Efficient Reinforcement Learning with Self-Predictive
  Representations
Data-Efficient Reinforcement Learning with Self-Predictive Representations
Max Schwarzer
Ankesh Anand
Rishab Goel
R. Devon Hjelm
Aaron Courville
Philip Bachman
41
312
0
12 Jul 2020
Learning Abstract Models for Strategic Exploration and Fast Reward
  Transfer
Learning Abstract Models for Strategic Exploration and Fast Reward Transfer
Emmy Liu
Ramtin Keramati
Sudarshan Seshadri
Kelvin Guu
Panupong Pasupat
Emma Brunskill
Percy Liang
OffRL
29
5
0
12 Jul 2020
SUNRISE: A Simple Unified Framework for Ensemble Learning in Deep
  Reinforcement Learning
SUNRISE: A Simple Unified Framework for Ensemble Learning in Deep Reinforcement Learning
Kimin Lee
Michael Laskin
A. Srinivas
Pieter Abbeel
OffRL
30
200
0
09 Jul 2020
UAV Path Planning for Wireless Data Harvesting: A Deep Reinforcement
  Learning Approach
UAV Path Planning for Wireless Data Harvesting: A Deep Reinforcement Learning Approach
Harald Bayerlein
Mirco Theile
Marco Caccamo
David Gesbert
32
55
0
01 Jul 2020
Convex Regularization in Monte-Carlo Tree Search
Convex Regularization in Monte-Carlo Tree Search
Tuan Dam
Carlo DÉramo
Jan Peters
Joni Pajarinen
OffRL
22
11
0
01 Jul 2020
Group Equivariant Deep Reinforcement Learning
Group Equivariant Deep Reinforcement Learning
Arnab Kumar Mondal
Pratheeksha Nair
Kaleem Siddiqi
19
32
0
01 Jul 2020
Deep reinforcement learning approach to MIMO precoding problem:
  Optimality and Robustness
Deep reinforcement learning approach to MIMO precoding problem: Optimality and Robustness
Heunchul Lee
Maksym A. Girnyk
Jaeseong Jeong
24
15
0
30 Jun 2020
Experience Replay with Likelihood-free Importance Weights
Experience Replay with Likelihood-free Importance Weights
Samarth Sinha
Jiaming Song
Animesh Garg
Stefano Ermon
OffRL
38
55
0
23 Jun 2020
The Effect of Multi-step Methods on Overestimation in Deep Reinforcement
  Learning
The Effect of Multi-step Methods on Overestimation in Deep Reinforcement Learning
Lingheng Meng
R. Gorbet
Dana Kulic
OffRL
30
27
0
23 Jun 2020
DREAM: Deep Regret minimization with Advantage baselines and Model-free
  learning
DREAM: Deep Regret minimization with Advantage baselines and Model-free learning
Eric Steinberger
Adam Lerer
Noam Brown
53
53
0
18 Jun 2020
WD3: Taming the Estimation Bias in Deep Reinforcement Learning
WD3: Taming the Estimation Bias in Deep Reinforcement Learning
Qiang He
Xinwen Hou
OffRL
10
28
0
18 Jun 2020
Forgetful Experience Replay in Hierarchical Reinforcement Learning from
  Demonstrations
Forgetful Experience Replay in Hierarchical Reinforcement Learning from Demonstrations
Alexey Skrynnik
A. Staroverov
Ermek Aitygulov
Kirill Aksenov
Vasilii Davydov
Aleksandr I. Panov
OffRL
28
4
0
17 Jun 2020
Reinforcement Learning with Uncertainty Estimation for Tactical
  Decision-Making in Intersections
Reinforcement Learning with Uncertainty Estimation for Tactical Decision-Making in Intersections
C. Hoel
Tommy Tram
J. Sjöberg
32
30
0
17 Jun 2020
Learning Heuristic Selection with Dynamic Algorithm Configuration
Learning Heuristic Selection with Dynamic Algorithm Configuration
David Speck
André Biedenkapp
Frank Hutter
Robert Mattmüller
Marius Lindauer
32
29
0
15 Jun 2020
Self-Imitation Learning via Generalized Lower Bound Q-learning
Self-Imitation Learning via Generalized Lower Bound Q-learning
Yunhao Tang
SSL
33
24
0
12 Jun 2020
AdaDeep: A Usage-Driven, Automated Deep Model Compression Framework for
  Enabling Ubiquitous Intelligent Mobiles
AdaDeep: A Usage-Driven, Automated Deep Model Compression Framework for Enabling Ubiquitous Intelligent Mobiles
Sicong Liu
Junzhao Du
Kaiming Nan
Zimu Zhou
Zhangyang Wang
Yingyan Lin
34
30
0
08 Jun 2020
Randomized Entity-wise Factorization for Multi-Agent Reinforcement
  Learning
Randomized Entity-wise Factorization for Multi-Agent Reinforcement Learning
Shariq Iqbal
Christian Schroeder de Witt
Bei Peng
Wendelin Bohmer
Shimon Whiteson
Fei Sha
36
64
0
07 Jun 2020
Re-understanding Finite-State Representations of Recurrent Policy
  Networks
Re-understanding Finite-State Representations of Recurrent Policy Networks
Mohamad H. Danesh
Anurag Koul
Alan Fern
Saeed Khorram
38
21
0
06 Jun 2020
A Novel Update Mechanism for Q-Networks Based On Extreme Learning
  Machines
A Novel Update Mechanism for Q-Networks Based On Extreme Learning Machines
Callum Wilson
A. Riccardi
E. Minisci
19
4
0
04 Jun 2020
Previous
123...121314...181920
Next