Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
1509.06461
Cited By
Deep Reinforcement Learning with Double Q-learning
22 September 2015
H. V. Hasselt
A. Guez
David Silver
OffRL
Re-assign community
ArXiv
PDF
HTML
Papers citing
"Deep Reinforcement Learning with Double Q-learning"
50 / 985 papers shown
Title
Mastering Atari with Discrete World Models
Danijar Hafner
Timothy Lillicrap
Mohammad Norouzi
Jimmy Ba
DRL
58
824
0
05 Oct 2020
Facilitating Connected Autonomous Vehicle Operations Using Space-weighted Information Fusion and Deep Reinforcement Learning Based Control
Jiqian Dong
Sikai Chen
Yujie Li
Runjia Du
Aaron Steinfeld
Samuel Labi
31
11
0
30 Sep 2020
Toolpath design for additive manufacturing using deep reinforcement learning
M. Mozaffar
Ablodghani Ebrahimi
Jian Cao
AI4CE
28
7
0
30 Sep 2020
Finite-Time Analysis for Double Q-learning
Huaqing Xiong
Linna Zhao
Yingbin Liang
Wei Zhang
30
31
0
29 Sep 2020
Learning to Play against Any Mixture of Opponents
Max O. Smith
Thomas W. Anthony
Yongzhao Wang
Michael P. Wellman
OffRL
33
9
0
29 Sep 2020
Novelty Search in Representational Space for Sample Efficient Exploration
Ruo Yu Tao
Vincent François-Lavet
Joelle Pineau
42
43
0
28 Sep 2020
A New Approach for Tactical Decision Making in Lane Changing: Sample Efficient Deep Q Learning with a Safety Feedback Reward
U. Yavas
T. Kumbasar
N. K. Üre
21
19
0
24 Sep 2020
Learning a Contact-Adaptive Controller for Robust, Efficient Legged Locomotion
Xingye Da
Zhaoming Xie
David Hoeller
Byron Boots
Anima Anandkumar
Yuke Zhu
Buck Babich
Animesh Garg
27
57
0
21 Sep 2020
Lyapunov-Based Reinforcement Learning for Decentralized Multi-Agent Control
Qingrui Zhang
Hao Dong
Wei Pan
29
6
0
20 Sep 2020
The Importance of Pessimism in Fixed-Dataset Policy Optimization
Jacob Buckman
Carles Gelada
Marc G. Bellemare
OffRL
47
137
0
15 Sep 2020
VacSIM: Learning Effective Strategies for COVID-19 Vaccine Distribution using Reinforcement Learning
R. Awasthi
K. K. Guliani
Saif Ahmad Khan
Aniket Vashishtha
M. S. Gill
Arshita Bhatt
A. Nagori
Aniket Gupta
Ponnurangam Kumaraguru
Tavpritesh Sethi
59
24
0
14 Sep 2020
COVID-19 Pandemic Cyclic Lockdown Optimization Using Reinforcement Learning
M. Arango
Lyudmil Pelov
22
16
0
10 Sep 2020
Dynamic Scheduling for Stochastic Edge-Cloud Computing Environments using A3C learning and Residual Recurrent Neural Networks
Shreshth Tuli
Shashikant Ilager
K. Ramamohanarao
Rajkumar Buyya
27
176
0
01 Sep 2020
CLAN: Continuous Learning using Asynchronous Neuroevolution on Commodity Edge Devices
Parth Mannan
A. Samajdar
T. Krishna
36
2
0
27 Aug 2020
Robust Image Matching By Dynamic Feature Selection
Hao Huang
Jianchun Chen
Xiang Li
Lingjing Wang
Yi Fang
23
3
0
13 Aug 2020
Robust Deep Reinforcement Learning through Adversarial Loss
Tuomas P. Oikarinen
Wang Zhang
Alexandre Megretski
Luca Daniel
Tsui-Wei Weng
AAML
49
94
0
05 Aug 2020
EasyRL: A Simple and Extensible Reinforcement Learning Framework
Neil Hulbert
S. Spillers
Brandon Francis
James Haines-Temons
Ken Gil Romero
Benjamin De Jager
Sam Wong
Kevin Flora
Bowei Huang
Athirai Aravazhi Irissappane
OffRL
OnRL
SyDa
16
1
0
04 Aug 2020
Low Dimensional State Representation Learning with Reward-shaped Priors
N. Botteghi
Ruben Obbink
D. Geijs
M. Poel
B. Sirmaçek
C. Brune
A. Mersha
Stefano Stramigioli
SSL
OffRL
25
4
0
29 Jul 2020
Variance Reduction for Deep Q-Learning using Stochastic Recursive Gradient
Hao Jia
Xiao Zhang
Jun Xu
Wei Zeng
Hao Jiang
Xiao Yan
Ji-Rong Wen
37
3
0
25 Jul 2020
Maximum Mutation Reinforcement Learning for Scalable Control
Karush Suri
Xiaolong Shi
Konstantinos N. Plataniotis
Y. Lawryshyn
30
4
0
24 Jul 2020
EMaQ: Expected-Max Q-Learning Operator for Simple Yet Effective Offline and Online RL
Seyed Kamyar Seyed Ghasemipour
Dale Schuurmans
S. Gu
OffRL
217
119
0
21 Jul 2020
UAV Target Tracking in Urban Environments Using Deep Reinforcement Learning
Sarthak Bhagat
Sujit PB
42
47
0
21 Jul 2020
Active MR k-space Sampling with Reinforcement Learning
Luis Villaseñor-Pineda
Sumana Basu
Adriana Romero
Roberto Calandra
M. Drozdzal
19
70
0
20 Jul 2020
Multi-robot Cooperative Object Transportation using Decentralized Deep Reinforcement Learning
Lin Zhang
Hao Xiong
Ou Ma
Zhaokui Wang
17
6
0
17 Jul 2020
Meta-Gradient Reinforcement Learning with an Objective Discovered Online
Zhongwen Xu
H. V. Hasselt
Matteo Hessel
Junhyuk Oh
Satinder Singh
David Silver
32
77
0
16 Jul 2020
Weighing Counts: Sequential Crowd Counting by Reinforcement Learning
Liang Liu
Hao Lu
Hongwei Zou
Haipeng Xiong
Zhiguo Cao
Chunhua Shen
OffRL
32
71
0
16 Jul 2020
Odyssey: Creation, Analysis and Detection of Trojan Models
Marzieh Edraki
Nazmul Karim
Nazanin Rahnavard
Ajmal Mian
M. Shah
AAML
42
13
0
16 Jul 2020
Information Freshness-Aware Task Offloading in Air-Ground Integrated Edge Computing Systems
Xianfu Chen
Celimuge Wu
Tao Chen
Zhi Liu
Honggang Zhang
M. Bennis
Hang Liu
Yusheng Ji
39
71
0
15 Jul 2020
Robustifying Reinforcement Learning Agents via Action Space Adversarial Training
Kai Liang Tan
Yasaman Esfandiari
Xian Yeow Lee
Aakanksha
Soumik Sarkar
AAML
26
55
0
14 Jul 2020
Revisiting Fundamentals of Experience Replay
W. Fedus
Prajit Ramachandran
Rishabh Agarwal
Yoshua Bengio
Hugo Larochelle
Mark Rowland
Will Dabney
KELM
OffRL
41
235
0
13 Jul 2020
Reinforcement Learning of Musculoskeletal Control from Functional Simulations
Emanuel Joos
Fabien Péan
Orçun Göksel
AI4CE
37
12
0
13 Jul 2020
Data-Efficient Reinforcement Learning with Self-Predictive Representations
Max Schwarzer
Ankesh Anand
Rishab Goel
R. Devon Hjelm
Aaron Courville
Philip Bachman
41
312
0
12 Jul 2020
Learning Abstract Models for Strategic Exploration and Fast Reward Transfer
Emmy Liu
Ramtin Keramati
Sudarshan Seshadri
Kelvin Guu
Panupong Pasupat
Emma Brunskill
Percy Liang
OffRL
29
5
0
12 Jul 2020
SUNRISE: A Simple Unified Framework for Ensemble Learning in Deep Reinforcement Learning
Kimin Lee
Michael Laskin
A. Srinivas
Pieter Abbeel
OffRL
30
200
0
09 Jul 2020
UAV Path Planning for Wireless Data Harvesting: A Deep Reinforcement Learning Approach
Harald Bayerlein
Mirco Theile
Marco Caccamo
David Gesbert
32
55
0
01 Jul 2020
Convex Regularization in Monte-Carlo Tree Search
Tuan Dam
Carlo DÉramo
Jan Peters
Joni Pajarinen
OffRL
22
11
0
01 Jul 2020
Group Equivariant Deep Reinforcement Learning
Arnab Kumar Mondal
Pratheeksha Nair
Kaleem Siddiqi
19
32
0
01 Jul 2020
Deep reinforcement learning approach to MIMO precoding problem: Optimality and Robustness
Heunchul Lee
Maksym A. Girnyk
Jaeseong Jeong
24
15
0
30 Jun 2020
Experience Replay with Likelihood-free Importance Weights
Samarth Sinha
Jiaming Song
Animesh Garg
Stefano Ermon
OffRL
38
55
0
23 Jun 2020
The Effect of Multi-step Methods on Overestimation in Deep Reinforcement Learning
Lingheng Meng
R. Gorbet
Dana Kulic
OffRL
30
27
0
23 Jun 2020
DREAM: Deep Regret minimization with Advantage baselines and Model-free learning
Eric Steinberger
Adam Lerer
Noam Brown
53
53
0
18 Jun 2020
WD3: Taming the Estimation Bias in Deep Reinforcement Learning
Qiang He
Xinwen Hou
OffRL
10
28
0
18 Jun 2020
Forgetful Experience Replay in Hierarchical Reinforcement Learning from Demonstrations
Alexey Skrynnik
A. Staroverov
Ermek Aitygulov
Kirill Aksenov
Vasilii Davydov
Aleksandr I. Panov
OffRL
28
4
0
17 Jun 2020
Reinforcement Learning with Uncertainty Estimation for Tactical Decision-Making in Intersections
C. Hoel
Tommy Tram
J. Sjöberg
32
30
0
17 Jun 2020
Learning Heuristic Selection with Dynamic Algorithm Configuration
David Speck
André Biedenkapp
Frank Hutter
Robert Mattmüller
Marius Lindauer
32
29
0
15 Jun 2020
Self-Imitation Learning via Generalized Lower Bound Q-learning
Yunhao Tang
SSL
33
24
0
12 Jun 2020
AdaDeep: A Usage-Driven, Automated Deep Model Compression Framework for Enabling Ubiquitous Intelligent Mobiles
Sicong Liu
Junzhao Du
Kaiming Nan
Zimu Zhou
Zhangyang Wang
Yingyan Lin
34
30
0
08 Jun 2020
Randomized Entity-wise Factorization for Multi-Agent Reinforcement Learning
Shariq Iqbal
Christian Schroeder de Witt
Bei Peng
Wendelin Bohmer
Shimon Whiteson
Fei Sha
36
64
0
07 Jun 2020
Re-understanding Finite-State Representations of Recurrent Policy Networks
Mohamad H. Danesh
Anurag Koul
Alan Fern
Saeed Khorram
38
21
0
06 Jun 2020
A Novel Update Mechanism for Q-Networks Based On Extreme Learning Machines
Callum Wilson
A. Riccardi
E. Minisci
19
4
0
04 Jun 2020
Previous
1
2
3
...
12
13
14
...
18
19
20
Next