Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
1509.06461
Cited By
v1
v2
v3 (latest)
Deep Reinforcement Learning with Double Q-learning
22 September 2015
H. V. Hasselt
A. Guez
David Silver
OffRL
Re-assign community
ArXiv (abs)
PDF
HTML
Papers citing
"Deep Reinforcement Learning with Double Q-learning"
50 / 2,291 papers shown
Title
Off-Belief Learning
Hengyuan Hu
Adam Lerer
Brandon Cui
David J. Wu
Luis Pineda
Noam Brown
Jakob N. Foerster
OffRL
63
73
0
06 Mar 2021
Deep reinforcement learning in medical imaging: A literature review
S. Kevin Zhou
Hoang Ngan Le
Khoa Luu
Hien V Nguyen
N. Ayache
LM&MA
OffRL
MedIm
77
149
0
05 Mar 2021
Improving Computational Efficiency in Visual Reinforcement Learning via Stored Embeddings
Lili Chen
Kimin Lee
A. Srinivas
Pieter Abbeel
OffRL
70
11
0
04 Mar 2021
Hierarchical and Partially Observable Goal-driven Policy Learning with Goals Relational Graph
Xin Ye
Yezhou Yang
102
25
0
01 Mar 2021
Decision Making in Monopoly using a Hybrid Deep Reinforcement Learning Approach
Trevor Bonjour
Marina Haliem
A. Alsalem
Shilpa Thomas
Hongyu Li
Vaneet Aggarwal
Mayank Kejriwal
Bharat K. Bhargava
97
15
0
01 Mar 2021
Ensemble Bootstrapping for Q-Learning
Oren Peer
Chen Tessler
Nadav Merlis
Ron Meir
87
42
0
28 Feb 2021
Sequential Learning-based IaaS Composition
Sajib Mistry
Sheik Mohammad Mostakim Fattah
A. Bouguettaya
54
0
0
24 Feb 2021
Balancing Rational and Other-Regarding Preferences in Cooperative-Competitive Environments
Dmitry Ivanov
Vladimir Egorov
A. Shpilman
55
5
0
24 Feb 2021
Greedy-Step Off-Policy Reinforcement Learning
Yuhui Wang
Qingyuan Wu
Pengcheng He
Xiaoyang Tan
OffRL
57
1
0
23 Feb 2021
An Interaction-aware Evaluation Method for Highly Automated Vehicles
Xinpeng Wang
Songan Zhang
Kuan-Hui Lee
H. Peng
33
10
0
23 Feb 2021
Reinforcement Learning with Prototypical Representations
Denis Yarats
Rob Fergus
A. Lazaric
Lerrel Pinto
SSL
83
226
0
22 Feb 2021
Training a Resilient Q-Network against Observational Interference
Chao-Han Huck Yang
I-Te Danny Hung
Ouyang Yi
Pin-Yu Chen
OOD
61
15
0
18 Feb 2021
Adaptive Rational Activations to Boost Deep Reinforcement Learning
Quentin Delfosse
P. Schramowski
Martin Mundt
Alejandro Molina
Kristian Kersting
139
15
0
18 Feb 2021
Improved Deep Reinforcement Learning with Expert Demonstrations for Urban Autonomous Driving
Haochen Liu
Zhiyu Huang
Jingda Wu
Chen Lv
92
74
0
18 Feb 2021
Continuous Doubly Constrained Batch Reinforcement Learning
Rasool Fakoor
Jonas W. Mueller
Kavosh Asadi
Pratik Chaudhari
Alex Smola
OffRL
286
27
0
18 Feb 2021
SCAPE: Learning Stiffness Control from Augmented Position Control Experiences
Mincheol Kim
S. Niekum
A. Deshpande
76
4
0
16 Feb 2021
TradeR: Practical Deep Hierarchical Reinforcement Learning for Trade Execution
Karush Suri
Xiaolong Shi
Konstantinos Plataniotis
Y. Lawryshyn
OffRL
42
4
0
16 Feb 2021
Steadily Learn to Drive with Virtual Memory
Yuhang Zhang
Yao Mu
Yujie Yang
Yang Guan
Shengbo Eben Li
Qi Sun
Jianyu Chen
40
1
0
16 Feb 2021
Zero-Shot Adaptation for mmWave Beam-Tracking on Overhead Messenger Wires through Robust Adversarial Reinforcement Learning
Masao Shinzaki
Yusuke Koda
Koji Yamamoto
Takayuki Nishio
M. Morikura
Y. Shirato
D. Uchida
N. Kita
30
7
0
16 Feb 2021
Transferring Domain Knowledge with an Adviser in Continuous Tasks
Rukshan Wijesinghe
Kasun Vithanage
Dumindu Tissera
A. Xavier
Subha Fernando
Jayathu Samarawickrama
CLL
43
0
0
16 Feb 2021
Training Larger Networks for Deep Reinforcement Learning
Keita Ota
Devesh K. Jha
Asako Kanezaki
OffRL
97
40
0
16 Feb 2021
Reinforcement Learning for IoT Security: A Comprehensive Survey
Aashma Uprety
D. Rawat
AAML
93
124
0
14 Feb 2021
Q-Value Weighted Regression: Reinforcement Learning with Limited Data
Piotr Kozakowski
Lukasz Kaiser
Henryk Michalewski
Afroz Mohiuddin
Katarzyna Kañska
OffRL
73
5
0
12 Feb 2021
Generalizing Decision Making for Automated Driving with an Invariant Environment Representation using Deep Reinforcement Learning
Karl Kurzer
Philip Schorner
Alexander Albers
Hauke Thomsen
Karam Daaboul
Johann Marius Zöllner
51
11
0
12 Feb 2021
Hedging of Financial Derivative Contracts via Monte Carlo Tree Search
O. Szehr
AIFin
30
3
0
11 Feb 2021
Personalization for Web-based Services using Offline Reinforcement Learning
P. Apostolopoulos
Zehui Wang
Hanson Wang
Chad Zhou
Kittipat Virochsiri
Norm Zhou
Igor L. Markov
OffRL
OnRL
68
7
0
10 Feb 2021
Adaptive Processor Frequency Adjustment for Mobile Edge Computing with Intermittent Energy Supply
Tiansheng Huang
Weiwei Lin
Ying Li
Xiumin Wang
Qingbo Wu
Rui Li
Ching-Hsien Hsu
Albert Y. Zomaya
52
6
0
10 Feb 2021
Reinforcement Learning For Constraint Satisfaction Game Agents (15-Puzzle, Minesweeper, 2048, and Sudoku)
Anav Mehta
AI4CE
122
4
0
09 Feb 2021
Measuring Progress in Deep Reinforcement Learning Sample Efficiency
Florian E. Dorner
55
13
0
09 Feb 2021
Deep Reinforcement Learning for the Control of Robotic Manipulation: A Focussed Mini-Review
Rongrong Liu
F. Nageotte
P. Zanne
M. de Mathelin
Birgitta Dresp
104
150
0
08 Feb 2021
MSPM: A Modularized and Scalable Multi-Agent Reinforcement Learning-based System for Financial Portfolio Management
Zhenhan Huang
F. Tanaka
AIFin
91
25
0
06 Feb 2021
Corner Case Generation and Analysis for Safety Assessment of Autonomous Vehicles
Haowei Sun
Shuo Feng
Xintao Yan
Henry X. Liu
AAML
79
54
0
06 Feb 2021
An advantage actor-critic algorithm for robotic motion planning in dense and dynamic scenarios
Chengmin Zhou
Bingding Huang
Pasi Fränti
32
1
0
05 Feb 2021
Experience-Based Heuristic Search: Robust Motion Planning with Deep Q-Learning
Julian Bernhard
Robert Gieselmann
Klemens Esterle
Alois Knoll
42
17
0
05 Feb 2021
Addressing Inherent Uncertainty: Risk-Sensitive Behavior Generation for Automated Driving using Distributional Reinforcement Learning
Julian Bernhard
Stefan Pollok
Alois Knoll
47
33
0
05 Feb 2021
A review of motion planning algorithms for intelligent robotics
Chengmin Zhou
Bingding Huang
Pasi Fränti
63
4
0
04 Feb 2021
DRLDO: A novel DRL based De-ObfuscationSystem for Defense against Metamorphic Malware
Mohit Sewak
S. K. Sahay
Hemant Rathore
23
13
0
01 Feb 2021
Reinforcement Learning for Freight Booking Control Problems
Justin Dumouchelle
Emma Frejinger
Andrea Lodi
59
1
0
29 Jan 2021
Acting in Delayed Environments with Non-Stationary Markov Policies
E. Derman
Gal Dalal
Shie Mannor
84
34
0
28 Jan 2021
Learning Abstract Representations through Lossy Compression of Multi-Modal Signals
Charles Wilmot
Gianluca Baldassarre
Jochen Triesch
33
5
0
27 Jan 2021
Reinforcement Learning for Selective Key Applications in Power Systems: Recent Advances and Future Challenges
Xin Chen
Guannan Qu
Yujie Tang
S. Low
Na Li
83
239
0
27 Jan 2021
The MineRL 2020 Competition on Sample Efficient Reinforcement Learning using Human Priors
William H. Guss
Cayden R. Codel
Katja Hofmann
Brandon Houghton
Noburu Kuno
...
John Schulman
Manuela Veloso
Nicholay Topin
Avinash Ummadisingu
Phillip Wang
OffRL
83
65
0
26 Jan 2021
Learning Synthetic Environments for Reinforcement Learning with Evolution Strategies
Fabio Ferreira
Thomas Nierhoff
Frank Hutter
69
8
0
24 Jan 2021
Solving optimal stopping problems with Deep Q-Learning
John Ery
Loris Michel
14
6
0
24 Jan 2021
Multi-intersection Traffic Optimisation: A Benchmark Dataset and a Strong Baseline
Hu Wang
Hao Chen
Qi Wu
Congbo Ma
Yidong Li
Chunhua Shen
60
13
0
24 Jan 2021
Decoupled Exploration and Exploitation Policies for Sample-Efficient Reinforcement Learning
William F. Whitney
Michael Bloesch
Jost Tobias Springenberg
A. Abdolmaleki
Kyunghyun Cho
Martin Riedmiller
OffRL
77
16
0
23 Jan 2021
E-commerce warehousing: learning a storage policy
Adrien Rimélé
P. Grangier
M. Gamache
M. Gendreau
Louis-Martin Rousseau
60
8
0
21 Jan 2021
Stable deep reinforcement learning method by predicting uncertainty in rewards as a subtask
Kanata Suzuki
T. Ogata
78
2
0
18 Jan 2021
A Safe Hierarchical Planning Framework for Complex Driving Scenarios based on Reinforcement Learning
Jinning Li
Liting Sun
Jianyu Chen
Masayoshi Tomizuka
Wei Zhan
68
47
0
17 Jan 2021
Adaptive Remote Sensing Image Attribute Learning for Active Object Detection
Nuo Xu
Chunlei Huo
Jiacheng Guo
Yiwei Liu
Jian Wang
Chunhong Pan
ObjD
52
4
0
16 Jan 2021
Previous
1
2
3
...
26
27
28
...
44
45
46
Next