Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
1509.06461
Cited By
Deep Reinforcement Learning with Double Q-learning
22 September 2015
H. V. Hasselt
A. Guez
David Silver
OffRL
Re-assign community
ArXiv
PDF
HTML
Papers citing
"Deep Reinforcement Learning with Double Q-learning"
50 / 947 papers shown
Title
M
2
^2
2
DQN: A Robust Method for Accelerating Deep Q-learning Network
Zhe Zhang
Yukun Zou
Junjie Lai
Qinglong Xu
18
4
0
16 Sep 2022
Reducing Variance in Temporal-Difference Value Estimation via Ensemble of Deep Networks
Litian Liang
Yaosheng Xu
Stephen Marcus McAleer
Dailin Hu
Alexander Ihler
Pieter Abbeel
Roy Fox
OOD
32
17
0
16 Sep 2022
On the Reuse Bias in Off-Policy Reinforcement Learning
Chengyang Ying
Zhongkai Hao
Xinning Zhou
Hang Su
Dong Yan
Jun Zhu
OffRL
45
3
0
15 Sep 2022
MIXRTs: Toward Interpretable Multi-Agent Reinforcement Learning via Mixing Recurrent Soft Decision Trees
Zichuan Liu
Zichuan Liu
Zhi Wang
Yuanyang Zhu
Chunlin Chen
66
5
0
15 Sep 2022
Ask Before You Act: Generalising to Novel Environments by Asking Questions
Ross Murphy
S. Mosesov
Javier Leguina Peral
Thymo ter Doest
LRM
32
0
0
10 Sep 2022
Task-Agnostic Learning to Accomplish New Tasks
Xianqi Zhang
Xingtao Wang
Xu Liu
Wenrui Wang
Xiaopeng Fan
Debin Zhao
OffRL
91
0
0
09 Sep 2022
Reward Delay Attacks on Deep Reinforcement Learning
Anindya Sarkar
Jiarui Feng
Yevgeniy Vorobeychik
Christopher Gill
Ning Zhang
AAML
13
6
0
08 Sep 2022
Prediction Based Decision Making for Autonomous Highway Driving
Mustafa Yildirim
Sajjad Mozaffari
Lucy McCutcheon
M. Dianati
Alireza Tamaddoni-Nezhad Saber Fallah
18
7
0
05 Sep 2022
Model-Free Deep Reinforcement Learning in Software-Defined Networks
Luke Borchjes
Clement N. Nyirenda
L. Leenen
29
1
0
03 Sep 2022
Reinforcement Learning with Prior Policy Guidance for Motion Planning of Dual-Arm Free-Floating Space Robot
Yu-wen Cao
Shengjie Wang
Xiang Zheng
Wen-Xuan Ma
Xinru Xie
Lei Liu
13
25
0
03 Sep 2022
Distributed Ensembles of Reinforcement Learning Agents for Electricity Control
Pierrick Pochelu
S. Petiton
B. Conche
AI4CE
33
2
0
30 Aug 2022
An intelligent algorithmic trading based on a risk-return reinforcement learning algorithm
Boyin Jin
24
1
0
23 Aug 2022
Quantum Multi-Agent Meta Reinforcement Learning
Won Joon Yun
Jihong Park
Joongheon Kim
32
37
0
22 Aug 2022
A Review of the Convergence of 5G/6G Architecture and Deep Learning
O. Odeyomi
Olubiyi O. Akintade
T. Olowu
G. Záruba
AILaw
3DV
AI4TS
28
1
0
16 Aug 2022
Implicit Two-Tower Policies
Yunfan Zhao
Qingkai Pan
K. Choromanski
Deepali Jain
Vikas Sindhwani
OffRL
36
3
0
02 Aug 2022
DRL-M4MR: An Intelligent Multicast Routing Approach Based on DQN Deep Reinforcement Learning in SDN
Chenwei Zhao
Miao Ye
Xingsi Xue
Jianhui Lv
Qiuxiang Jiang
Yong Wang
27
17
0
31 Jul 2022
Future-Dependent Value-Based Off-Policy Evaluation in POMDPs
Masatoshi Uehara
Haruka Kiyohara
Andrew Bennett
Victor Chernozhukov
Nan Jiang
Nathan Kallus
C. Shi
Wen Sun
OffRL
34
16
0
26 Jul 2022
Adaptive Decision Making at the Intersection for Autonomous Vehicles Based on Skill Discovery
Xianqi He
Ling Yang
Chao Lu
Zirui Li
Jian-wei Gong
29
1
0
24 Jul 2022
Graph-Structured Policy Learning for Multi-Goal Manipulation Tasks
David Klee
Ondrej Biza
Robert Platt
OffRL
32
1
0
22 Jul 2022
Romanus: Robust Task Offloading in Modular Multi-Sensor Autonomous Driving Systems
Luke Chen
Mohanad Odema
Mohammad Abdullah Al Faruque
32
4
0
18 Jul 2022
Robust AI Driving Strategy for Autonomous Vehicles
S. Nageshrao
Yousaf Rahman
V. Ivanovic
M. Janković
E. Tseng
M. Hafner
Dimitar Filev
44
4
0
16 Jul 2022
Self-Play PSRO: Toward Optimal Populations in Two-Player Zero-Sum Games
Stephen Marcus McAleer
JB Lanier
Kevin A. Wang
Pierre Baldi
Roy Fox
Tuomas Sandholm
40
18
0
13 Jul 2022
Hindsight Learning for MDPs with Exogenous Inputs
Sean R. Sinclair
Felipe Vieira Frujeri
Ching-An Cheng
Luke Marshall
Hugo Barbalho
Jingling Li
Jennifer Neville
Ishai Menache
Adith Swaminathan
18
23
0
13 Jul 2022
Multi-objective Optimization of Notifications Using Offline Reinforcement Learning
Prakruthi Prabhakar
Yiping Yuan
Guangyu Yang
Wensheng Sun
A. Muralidharan
OffRL
28
6
0
07 Jul 2022
Offline RL Policies Should be Trained to be Adaptive
Dibya Ghosh
Anurag Ajay
Pulkit Agrawal
Sergey Levine
OffRL
35
45
0
05 Jul 2022
Robust Reinforcement Learning in Continuous Control Tasks with Uncertainty Set Regularization
Yuan Zhang
Jianhong Wang
Joschka Boedecker
43
3
0
05 Jul 2022
Asynchronous Curriculum Experience Replay: A Deep Reinforcement Learning Approach for UAV Autonomous Motion Control in Unknown Dynamic Environments
Zijian Hu
Xiao-guang Gao
Kaifang Wan
Qianglong Wang
Yiwei Zhai
45
10
0
04 Jul 2022
A Survey on Active Simultaneous Localization and Mapping: State of the Art and New Frontiers
Julio A. Placed
Jared Strader
Henry Carrillo
Nikolay Atanasov
Vadim Indelman
Luca Carlone
J. A. Castellanos
35
178
0
01 Jul 2022
Conditionally Elicitable Dynamic Risk Measures for Deep Reinforcement Learning
Anthony Coache
S. Jaimungal
Á. Cartea
30
13
0
29 Jun 2022
Imitate then Transcend: Multi-Agent Optimal Execution with Dual-Window Denoise PPO
Jin Fang
Jiacheng Weng
Yi Xiang
Xinwen Zhang
OffRL
29
2
0
21 Jun 2022
Sampling Efficient Deep Reinforcement Learning through Preference-Guided Stochastic Exploration
Wenhui Huang
Cong Zhang
Jingda Wu
Xiangkun He
Jie Zhang
Chengqi Lv
32
8
0
20 Jun 2022
Cooperative Edge Caching via Multi Agent Reinforcement Learning in Fog Radio Access Networks
Qi Chang
Yanxiang Jiang
F. Zheng
M. Bennis
X. You
11
7
0
20 Jun 2022
A Universal Adversarial Policy for Text Classifiers
Gallil Maimon
Lior Rokach
AAML
19
10
0
19 Jun 2022
BYOL-Explore: Exploration by Bootstrapped Prediction
Z. Guo
S. Thakoor
Miruna Pislar
Bernardo Avila-Pires
Florent Altché
...
Yunhao Tang
Michal Valko
Rémi Munos
M. G. Azar
Bilal Piot
27
69
0
16 Jun 2022
A Search-Based Testing Approach for Deep Reinforcement Learning Agents
Amirhossein Zolfagharian
Manel Abdellatif
Lionel C. Briand
M. Bagherzadeh
Ramesh S
55
27
0
15 Jun 2022
Does Self-supervised Learning Really Improve Reinforcement Learning from Pixels?
Xiang Li
Jinghuan Shang
Srijan Das
Michael S. Ryoo
SSL
40
32
0
10 Jun 2022
AAM-Gym: Artificial Intelligence Testbed for Advanced Air Mobility
Marc Brittain
Luis E. Alvarez
Kara Breeden
Ian Jessen
28
8
0
09 Jun 2022
Goal-Space Planning with Subgoal Models
Chun-Ping Lo
Kevin Roice
Parham Mohammad Panahi
Scott M. Jordan
Adam White
Gábor Mihucz
Farzane Aminmansour
Martha White
29
5
0
06 Jun 2022
Reincarnating Reinforcement Learning: Reusing Prior Computation to Accelerate Progress
Rishabh Agarwal
Max Schwarzer
Pablo Samuel Castro
Rameswar Panda
Marc G. Bellemare
OffRL
OnRL
37
63
0
03 Jun 2022
Deep Transformer Q-Networks for Partially Observable Reinforcement Learning
Kevin Esslinger
Robert Platt
Chris Amato
OffRL
35
36
0
02 Jun 2022
Watch Out for the Safety-Threatening Actors: Proactively Mitigating Safety Hazards
Saurabh Jha
Shengkun Cui
Zbigniew T. Kalbarczyk
Ravishankar Iyer
LLMSV
24
1
0
02 Jun 2022
Efficient Reward Poisoning Attacks on Online Deep Reinforcement Learning
Yinglun Xu
Qi Zeng
Gagandeep Singh
AAML
42
6
0
30 May 2022
Group-wise Reinforcement Feature Generation for Optimal and Explainable Representation Space Reconstruction
Dongjie Wang
Yanjie Fu
Kunpeng Liu
Xiaolin Li
Yan Solihin
37
30
0
28 May 2022
Reinforcement Learning for Branch-and-Bound Optimisation using Retrospective Trajectories
Christopher W. F. Parsonson
Alexandre Laterre
Thomas D. Barrett
34
19
0
28 May 2022
Automated Dynamic Algorithm Configuration
Steven Adriaensen
André Biedenkapp
Gresa Shala
Noor H. Awad
Theresa Eimer
Marius Lindauer
Frank Hutter
44
37
0
27 May 2022
Verifying Learning-Based Robotic Navigation Systems
Guy Amir
Davide Corsi
Raz Yerushalmi
Luca Marzari
D. Harel
Alessandro Farinelli
Guy Katz
94
37
0
26 May 2022
An Experimental Comparison Between Temporal Difference and Residual Gradient with Neural Network Approximation
Shuyu Yin
Yaoyu Zhang
Peilin Liu
Z. Xu
31
2
0
25 May 2022
Deep Reinforcement Learning for Multi-class Imbalanced Training
Jenny Yang
Rasheed el-Bouri
Odhran O'Donoghue
Alexander S. Lachapelle
A. Soltan
David Clifton
OffRL
AI4CE
24
10
0
24 May 2022
MetaSlicing: A Novel Resource Allocation Framework for Metaverse
N. Chu
D. Hoang
Diep N. Nguyen
Khoa T. Phan
E. Dutkiewicz
Dusist Niyato
Tao Shu
55
46
0
23 May 2022
Memory-efficient Reinforcement Learning with Value-based Knowledge Consolidation
Qingfeng Lan
Yangchen Pan
Jun Luo
A. R. Mahmood
OffRL
39
8
0
22 May 2022
Previous
1
2
3
...
6
7
8
...
17
18
19
Next