Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
1509.06461
Cited By
v1
v2
v3 (latest)
Deep Reinforcement Learning with Double Q-learning
22 September 2015
H. V. Hasselt
A. Guez
David Silver
OffRL
Re-assign community
ArXiv (abs)
PDF
HTML
Papers citing
"Deep Reinforcement Learning with Double Q-learning"
50 / 2,291 papers shown
Title
DQ-GAT: Towards Safe and Efficient Autonomous Driving with Deep Q-Learning and Graph Attention Networks
Peide Cai
Hengli Wang
Yuxiang Sun
Ming-Yuan Liu
GNN
95
39
0
11 Aug 2021
Maximizing Influence with Graph Neural Networks
G. Panagopoulos
Nikolaos Tziortziotis
Michalis Vazirgiannis
Fragkiskos D. Malliaros
29
8
0
10 Aug 2021
A Survey on Deep Reinforcement Learning for Data Processing and Analytics
Qingpeng Cai
Can Cui
Yiyuan Xiong
Wei Wang
Zhongle Xie
Meihui Zhang
OffRL
58
31
0
10 Aug 2021
Deep Reinforcement Learning for Demand Driven Services in Logistics and Transportation Systems: A Survey
Zefang Zong
Tao Feng
Tong Xia
Depeng Jin
Yong Li
52
3
0
10 Aug 2021
Modified Double DQN: addressing stability
Shervin Halat
M. Ebadzadeh
27
2
0
09 Aug 2021
Distilling Neuron Spike with High Temperature in Reinforcement Learning Agents
Ling Zhang
Jian Cao
Yuan Zhang
Bohan Zhou
Shuo Feng
46
9
0
05 Aug 2021
High Performance Across Two Atari Paddle Games Using the Same Perceptual Control Architecture Without Training
T. Gulrez
W. Mansell
26
0
0
04 Aug 2021
RAIN: Reinforced Hybrid Attention Inference Network for Motion Forecasting
Jiachen Li
Fan Yang
Hengbo Ma
Srikanth Malla
Masayoshi Tomizuka
Chiho Choi
91
42
0
03 Aug 2021
Flip Learning: Erase to Segment
Yuhao Huang
Xin Yang
Yuxin Zou
Chaoyu Chen
Jian Wang
Haoran Dou
Nishant Ravikumar
Alejandro F Frangi
Jianqiao Zhou
Dong Ni
40
9
0
02 Aug 2021
Learning to Control DC Motor for Micromobility in Real Time with Reinforcement Learning
Bibek Poudel
Thomas Watson
Weizi Li
60
14
0
31 Jul 2021
Value-Based Reinforcement Learning for Continuous Control Robotic Manipulation in Multi-Task Sparse Reward Settings
Sreehari Rammohan
Shangqun Yu
Bowen He
Eric Hsiung
Eric Rosen
Stefanie Tellex
George Konidaris
OffRL
18
4
0
28 Jul 2021
Human-Level Reinforcement Learning through Theory-Based Modeling, Exploration, and Planning
Pedro Tsividis
J. Loula
Jake Burga
Nathan Foss
Andres Campero
Thomas Pouncy
S. Gershman
J. Tenenbaum
LM&Ro
59
48
0
27 Jul 2021
An Improved Algorithm of Robot Path Planning in Complex Environment Based on Double DQN
Fei Zhang
Chaochen Gu
Fengming Yang
16
11
0
23 Jul 2021
Trajectory Design for UAV-Based Internet-of-Things Data Collection: A Deep Reinforcement Learning Approach
Yang Wang
Zhen Gao
Jun Zhang
Xianbin Cao
Dezhi Zheng
Yue Gao
Derrick Wing Kwan Ng
M. Di Renzo
70
99
0
23 Jul 2021
A reinforcement learning approach to resource allocation in genomic selection
Saba Moeinizade
Guiping Hu
Lizhi Wang
60
15
0
22 Jul 2021
A Deep Reinforcement Learning Approach for Fair Traffic Signal Control
Majid Raeis
A. Leon-Garcia
48
13
0
21 Jul 2021
Bayesian Controller Fusion: Leveraging Control Priors in Deep Reinforcement Learning for Robotics
Krishan Rana
Vibhavari Dasagi
Jesse Haviland
Ben Talbot
Michael Milford
Niko Sünderhauf
BDL
OffRL
76
34
0
21 Jul 2021
Learning Altruistic Behaviours in Reinforcement Learning without External Rewards
Tim Franzmeyer
Mateusz Malinowski
João F. Henriques
58
8
0
20 Jul 2021
Active 3D Shape Reconstruction from Vision and Touch
Edward James Smith
David Meger
Luis Villaseñor-Pineda
Roberto Calandra
Jitendra Malik
Adriana Romero
M. Drozdzal
93
47
0
20 Jul 2021
Constrained Policy Gradient Method for Safe and Fast Reinforcement Learning: a Neural Tangent Kernel Based Approach
B. Varga
Balázs Kulcsár
M. Chehreghani
70
1
0
19 Jul 2021
Decoupled Reinforcement Learning to Stabilise Intrinsically-Motivated Exploration
Lukas Schafer
Filippos Christianos
Josiah P. Hanna
Stefano V. Albrecht
92
23
0
19 Jul 2021
High-level Decisions from a Safe Maneuver Catalog with Reinforcement Learning for Safe and Cooperative Automated Merging
Danial Kamran
Yu Ren
Martin Lauer
52
10
0
15 Jul 2021
A Reinforcement Learning Environment for Mathematical Reasoning via Program Synthesis
Joseph Palermo
Johnny Ye
Alok Singh
AIMat
97
2
0
15 Jul 2021
The Benchmark Lottery
Mostafa Dehghani
Yi Tay
A. Gritsenko
Zhe Zhao
N. Houlsby
Fernando Diaz
Donald Metzler
Oriol Vinyals
117
92
0
14 Jul 2021
QoS-Aware Scheduling in New Radio Using Deep Reinforcement Learning
Jakob Stigenberg
Vidit Saxena
Soma Tayamon
E. Ghadimi
28
3
0
14 Jul 2021
Transfer Learning in Multi-Agent Reinforcement Learning with Double Q-Networks for Distributed Resource Sharing in V2X Communication
Hammad Zafar
Zoran Utkovski
Martin Kasparick
S. Stańczak
OffRL
29
3
0
13 Jul 2021
Conservative Offline Distributional Reinforcement Learning
Yecheng Jason Ma
Dinesh Jayaraman
Osbert Bastani
OffRL
106
83
0
12 Jul 2021
AdaRL: What, Where, and How to Adapt in Transfer Reinforcement Learning
Erdun Gao
Fan Feng
Chaochao Lu
Sara Magliacane
Kun Zhang
106
68
0
06 Jul 2021
Fast-Learning Grasping and Pre-Grasping via Clutter Quantization and Q-map Masking
Dafa Ren
Xiaoqiang Ren
Xiaofan Wang
Sundara Tejaswi Digumarti
Guodong Shi
21
9
0
06 Jul 2021
Ensemble and Auxiliary Tasks for Data-Efficient Deep Reinforcement Learning
Muhammad Rizki Maulana
W. Lee
46
1
0
05 Jul 2021
Low-Dimensional State and Action Representation Learning with MDP Homomorphism Metrics
N. Botteghi
M. Poel
B. Sirmaçek
C. Brune
58
3
0
04 Jul 2021
Hierarchical Policies for Cluttered-Scene Grasping with Latent Plans
Lirui Wang
Xiangyun Meng
Yu Xiang
Dieter Fox
3DPC
DRL
73
27
0
04 Jul 2021
A Novel Deep Reinforcement Learning Based Stock Direction Prediction using Knowledge Graph and Community Aware Sentiments
Anil Berk Altuner
Zeynep Hilal Kilimci
AIFin
29
15
0
02 Jul 2021
Stabilizing Deep Q-Learning with ConvNets and Vision Transformers under Data Augmentation
Nicklas Hansen
H. Su
Xiaolong Wang
OffRL
110
137
0
01 Jul 2021
Offline-to-Online Reinforcement Learning via Balanced Replay and Pessimistic Q-Ensemble
Seunghyun Lee
Younggyo Seo
Kimin Lee
Pieter Abbeel
Jinwoo Shin
OffRL
OnRL
76
192
0
01 Jul 2021
Productivity, Portability, Performance: Data-Centric Python
Yiheng Wang
Yao Zhang
Yanzhang Wang
Yan Wan
Jiao Wang
Zhongyuan Wu
Yuhao Yang
Bowen She
169
101
0
01 Jul 2021
Drone swarm patrolling with uneven coverage requirements
C. Piciarelli
G. Foresti
AI4TS
32
9
0
01 Jul 2021
Deep Multiagent Reinforcement Learning: Challenges and Directions
Annie Wong
Thomas Bäck
Anna V. Kononova
Aske Plaat
AI4CE
116
97
0
29 Jun 2021
Convergent and Efficient Deep Q Network Algorithm
Zhikang T. Wang
Masahito Ueda
80
12
0
29 Jun 2021
Expert Q-learning: Deep Reinforcement Learning with Coarse State Values from Offline Expert Examples
Li Meng
Anis Yazidi
Morten Goodwin
P. Engelstad
OffRL
26
2
0
28 Jun 2021
Reinforcement Learning for Physical Layer Communications
P. Mary
V. Koivunen
C. Moy
AI4CE
38
3
0
22 Jun 2021
Analytically Tractable Bayesian Deep Q-Learning
Luong Ha
L. Nguyen
J. Goulet
BDL
OffRL
35
2
0
21 Jun 2021
Hi-Phy: A Benchmark for Hierarchical Physical Reasoning
Cheng Xue
Vimukthini Pinto
C. Gamage
Peng Zhang
Jochen Renz
59
0
0
17 Jun 2021
Modelling resource allocation in uncertain system environment through deep reinforcement learning
Neel Gandhi
Shakti Mishra
32
1
0
17 Jun 2021
CROP: Certifying Robust Policies for Reinforcement Learning through Functional Smoothing
Fan Wu
Linyi Li
Zijian Huang
Yevgeniy Vorobeychik
Ding Zhao
Yue Liu
AAML
OffRL
85
61
0
17 Jun 2021
Learning Robot Exploration Strategy with 4D Point-Clouds-like Information as Observations
Zhaoting Li
Tingguang Li
Jiankun Wang
Max Meng
3DPC
48
2
0
17 Jun 2021
Offline RL Without Off-Policy Evaluation
David Brandfonbrener
William F. Whitney
Rajesh Ranganath
Joan Bruna
OffRL
110
170
0
16 Jun 2021
Analysis and Optimisation of Bellman Residual Errors with Neural Function Approximation
Martin Gottwald
Sven Gronauer
Hao Shen
Klaus Diepold
36
3
0
16 Jun 2021
User-Guided Personalized Image Aesthetic Assessment based on Deep Reinforcement Learning
Pei Lv
Jianqing Fan
Xixi Nie
Weiming Dong
Xiaoheng Jiang
Bing Zhou
Mingliang Xu
Changsheng Xu
57
29
0
14 Jun 2021
A Deep Reinforcement Learning Approach to Marginalized Importance Sampling with the Successor Representation
Scott Fujimoto
David Meger
Doina Precup
76
17
0
12 Jun 2021
Previous
1
2
3
...
23
24
25
...
44
45
46
Next