ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1509.06461
  4. Cited By
Deep Reinforcement Learning with Double Q-learning
v1v2v3 (latest)

Deep Reinforcement Learning with Double Q-learning

22 September 2015
H. V. Hasselt
A. Guez
David Silver
    OffRL
ArXiv (abs)PDFHTML

Papers citing "Deep Reinforcement Learning with Double Q-learning"

50 / 2,291 papers shown
Title
Experience Sharing Between Cooperative Reinforcement Learning Agents
Experience Sharing Between Cooperative Reinforcement Learning Agents
Lucas O. Souza
G. Ramos
C. Ralha
49
9
0
06 Nov 2019
Distributional Reward Decomposition for Reinforcement Learning
Distributional Reward Decomposition for Reinforcement Learning
Zichuan Lin
Li Zhao
Derek Yang
Tao Qin
Guangwen Yang
Tie-Yan Liu
OffRL
57
16
0
06 Nov 2019
Fully Parameterized Quantile Function for Distributional Reinforcement
  Learning
Fully Parameterized Quantile Function for Distributional Reinforcement Learning
Derek Yang
Li Zhao
Zichuan Lin
Tao Qin
Jiang Bian
Tie-Yan Liu
OODOffRL
117
136
0
05 Nov 2019
An End-to-End Deep RL Framework for Task Arrangement in Crowdsourcing
  Platforms
An End-to-End Deep RL Framework for Task Arrangement in Crowdsourcing Platforms
Caihua Shan
N. Mamoulis
Reynold Cheng
Guoliang Li
Xiang Li
Yuqiu Qian
OffRL
32
21
0
04 Nov 2019
Online Robustness Training for Deep Reinforcement Learning
Online Robustness Training for Deep Reinforcement Learning
Marc Fischer
M. Mirman
Steven Stalder
Martin Vechev
OnRL
102
41
0
03 Nov 2019
Challenging On Car Racing Problem from OpenAI gym
Challenging On Car Racing Problem from OpenAI gym
Changmao Li
21
1
0
02 Nov 2019
Explicit Explore-Exploit Algorithms in Continuous State Spaces
Explicit Explore-Exploit Algorithms in Continuous State Spaces
Mikael Henaff
OffRL
115
32
0
01 Nov 2019
DeepLine: AutoML Tool for Pipelines Generation using Deep Reinforcement
  Learning and Hierarchical Actions Filtering
DeepLine: AutoML Tool for Pipelines Generation using Deep Reinforcement Learning and Hierarchical Actions Filtering
Yuval Heffetz
Roman Vainshtein
Gilad Katz
Lior Rokach
60
40
0
31 Oct 2019
Cascaded LSTMs based Deep Reinforcement Learning for Goal-driven
  Dialogue
Cascaded LSTMs based Deep Reinforcement Learning for Goal-driven Dialogue
Yue Ma
Xiaojie Wang
Zhenjiang Dong
Hong Chen
BDL
36
2
0
31 Oct 2019
Deep Reinforcement Learning with Enhanced Safety for Autonomous Highway
  Driving
Deep Reinforcement Learning with Enhanced Safety for Autonomous Highway Driving
Ali Baheri
S. Nageshrao
H. E. Tseng
Ilya Kolmanovsky
Anouck Girard
Dimitar Filev
54
56
0
28 Oct 2019
Better Exploration with Optimistic Actor-Critic
Better Exploration with Optimistic Actor-Critic
K. Ciosek
Q. Vuong
R. Loftin
Katja Hofmann
77
156
0
28 Oct 2019
BAIL: Best-Action Imitation Learning for Batch Deep Reinforcement
  Learning
BAIL: Best-Action Imitation Learning for Batch Deep Reinforcement Learning
Xinyue Chen
Zijian Zhou
Ziyi Wang
Che Wang
Yanqiu Wu
George Andriopoulos
OffRL
111
125
0
27 Oct 2019
ZPD Teaching Strategies for Deep Reinforcement Learning from
  Demonstrations
ZPD Teaching Strategies for Deep Reinforcement Learning from Demonstrations
Daniel Seita
David M. Chan
Roshan Rao
Chen Tang
Mandi Zhao
John F. Canny
42
12
0
26 Oct 2019
Robust Model Predictive Shielding for Safe Reinforcement Learning with
  Stochastic Dynamics
Robust Model Predictive Shielding for Safe Reinforcement Learning with Stochastic Dynamics
Shuo Li
Osbert Bastani
77
86
0
24 Oct 2019
Learning Q-network for Active Information Acquisition
Learning Q-network for Active Information Acquisition
Heejin Jeong
Brent Schlotfeldt
Hamed Hassani
M. Morari
Daniel D. Lee
George J. Pappas
50
15
0
23 Oct 2019
Faster and Safer Training by Embedding High-Level Knowledge into Deep
  Reinforcement Learning
Faster and Safer Training by Embedding High-Level Knowledge into Deep Reinforcement Learning
Haodi Zhang
Zihang Gao
Yi Zhou
Haotong Zhang
Kaishun Wu
Fangzhen Lin
AI4CE
59
17
0
22 Oct 2019
HIGhER : Improving instruction following with Hindsight Generation for
  Experience Replay
HIGhER : Improving instruction following with Hindsight Generation for Experience Replay
Geoffrey Cideron
Mathieu Seurin
Florian Strub
Olivier Pietquin
73
37
0
21 Oct 2019
Dealing with Sparse Rewards in Reinforcement Learning
Dealing with Sparse Rewards in Reinforcement Learning
J. Hare
64
80
0
21 Oct 2019
Deep Reinforcement Learning Control of Quantum Cartpoles
Deep Reinforcement Learning Control of Quantum Cartpoles
Zhikang T. Wang
Yuto Ashida
Masahito Ueda
76
40
0
21 Oct 2019
Resource Allocation in Mobility-Aware Federated Learning Networks: A
  Deep Reinforcement Learning Approach
Resource Allocation in Mobility-Aware Federated Learning Networks: A Deep Reinforcement Learning Approach
H. T. Nguyen
Nguyen Cong Luong
Jun Zhao
Chau Yuen
Dusit Niyato
67
58
0
21 Oct 2019
Reverse Experience Replay
Reverse Experience Replay
Egor Rotinov
VLMOffRL
60
11
0
19 Oct 2019
Explainable AI: Deep Reinforcement Learning Agents for Residential
  Demand Side Cost Savings in Smart Grids
Explainable AI: Deep Reinforcement Learning Agents for Residential Demand Side Cost Savings in Smart Grids
Hareesh Kumar
P. Mammen
K. Ramamritham
58
10
0
19 Oct 2019
OffWorld Gym: open-access physical robotics environment for real-world
  reinforcement learning benchmark and research
OffWorld Gym: open-access physical robotics environment for real-world reinforcement learning benchmark and research
Ashish Kumar
Toby Buckley
John B. Lanier
Qiaozhi Wang
A. Kavelaars
Ilya Kuzovkin
OffRL
90
14
0
18 Oct 2019
Graph Convolutional Policy for Solving Tree Decomposition via
  Reinforcement Learning Heuristics
Graph Convolutional Policy for Solving Tree Decomposition via Reinforcement Learning Heuristics
Taras Khakhulin
R. Schutski
Ivan Oseledets
48
0
0
18 Oct 2019
Single Episode Policy Transfer in Reinforcement Learning
Single Episode Policy Transfer in Reinforcement Learning
Jiachen Yang
Brenden K. Petersen
H. Zha
Daniel Faissol
OODOffRL
125
35
0
17 Oct 2019
Adaptive Trade-Offs in Off-Policy Learning
Adaptive Trade-Offs in Off-Policy Learning
Mark Rowland
Will Dabney
Rémi Munos
OffRL
131
22
0
16 Oct 2019
Parallel Exploration via Negatively Correlated Search
Parallel Exploration via Negatively Correlated Search
Peng Yang
Qi Yang
K. Tang
Xin Yao
125
14
0
16 Oct 2019
On the Reduction of Variance and Overestimation of Deep Q-Learning
On the Reduction of Variance and Overestimation of Deep Q-Learning
Mohammed Sabry
A. A. A. Khalifa
OODOffRL
67
12
0
14 Oct 2019
Extracting Incentives from Black-Box Decisions
Extracting Incentives from Black-Box Decisions
Yonadav Shavit
William S. Moses
49
9
0
13 Oct 2019
Autonomous Navigation via Deep Reinforcement Learning for Resource
  Constraint Edge Nodes using Transfer Learning
Autonomous Navigation via Deep Reinforcement Learning for Resource Constraint Edge Nodes using Transfer Learning
Aqeel Anwar
A. Raychowdhury
83
74
0
12 Oct 2019
Orchestrating the Development Lifecycle of Machine Learning-Based IoT
  Applications: A Taxonomy and Survey
Orchestrating the Development Lifecycle of Machine Learning-Based IoT Applications: A Taxonomy and Survey
Bin Qian
Jie Su
Z. Wen
D. N. Jha
Yinhao Li
...
Albert Y. Zomaya
Omer F. Rana
Lizhe Wang
Maciej Koutny
R. Ranjan
71
4
0
11 Oct 2019
Autonomous Driving using Safe Reinforcement Learning by Incorporating a
  Regret-based Human Lane-Changing Decision Model
Autonomous Driving using Safe Reinforcement Learning by Incorporating a Regret-based Human Lane-Changing Decision Model
Dong Chen
Longsheng Jiang
Yue Wang
Zhaojian Li
55
62
0
10 Oct 2019
Agent with Warm Start and Active Termination for Plane Localization in
  3D Ultrasound
Agent with Warm Start and Active Termination for Plane Localization in 3D Ultrasound
Haoran Dou
Xin Yang
Jikuan Qian
Wufeng Xue
Hao Qin
...
Lequan Yu
Shujun Wang
Yi Xiong
Pheng-Ann Heng
Dong Ni
56
30
0
10 Oct 2019
Hierarchical Deep Double Q-Routing
Hierarchical Deep Double Q-Routing
Ramy E. Ali
B. Erman
Ejder Bastug
Bruce Cilli
46
17
0
09 Oct 2019
Learning Visual Affordances with Target-Orientated Deep Q-Network to
  Grasp Objects by Harnessing Environmental Fixtures
Learning Visual Affordances with Target-Orientated Deep Q-Network to Grasp Objects by Harnessing Environmental Fixtures
Hengyue Liang
Xibai Lou
Yang Yang
Changhyun Choi
OOD
70
16
0
09 Oct 2019
Model-based Reinforcement Learning for Predictions and Control for Limit
  Order Books
Model-based Reinforcement Learning for Predictions and Control for Limit Order Books
Haoran Wei
Yuanbo Wang
L. Mangu
Keith S. Decker
65
25
0
09 Oct 2019
Ctrl-Z: Recovering from Instability in Reinforcement Learning
Ctrl-Z: Recovering from Instability in Reinforcement Learning
Vibhavari Dasagi
Jake Bruce
T. Peynot
Jurgen Leitner
53
10
0
09 Oct 2019
Multi-step Greedy Reinforcement Learning Algorithms
Multi-step Greedy Reinforcement Learning Algorithms
Manan Tomar
Yonathan Efroni
Mohammad Ghavamzadeh
91
1
0
07 Oct 2019
Reinforcement Learning with Structured Hierarchical Grammar
  Representations of Actions
Reinforcement Learning with Structured Hierarchical Grammar Representations of Actions
Petros Christodoulou
R. T. Lange
A. Shafti
A. Faisal
43
1
0
07 Oct 2019
Multi-Agent Reinforcement Learning for Order-dispatching via
  Order-Vehicle Distribution Matching
Multi-Agent Reinforcement Learning for Order-dispatching via Order-Vehicle Distribution Matching
Ming Zhou
Jiarui Jin
Weinan Zhang
Zhiwei Qin
Yan Jiao
Chenxi Wang
Guobin Wu
Yong Yu
Jieping Ye
48
89
0
07 Oct 2019
Striving for Simplicity and Performance in Off-Policy DRL: Output
  Normalization and Non-Uniform Sampling
Striving for Simplicity and Performance in Off-Policy DRL: Output Normalization and Non-Uniform Sampling
Che Wang
Yanqiu Wu
Q. Vuong
George Andriopoulos
36
6
0
05 Oct 2019
I'm sorry Dave, I'm afraid I can't do that, Deep Q-learning from
  forbidden action
I'm sorry Dave, I'm afraid I can't do that, Deep Q-learning from forbidden action
Mathieu Seurin
Philippe Preux
Olivier Pietquin
47
12
0
04 Oct 2019
Deep Q-Network for Angry Birds
Deep Q-Network for Angry Birds
L. Sy
S. Redmond
43
5
0
04 Oct 2019
Benchmarking Batch Deep Reinforcement Learning Algorithms
Benchmarking Batch Deep Reinforcement Learning Algorithms
Shih-Han Chou
Wen-Yen Chang
W. Hsu
Jianlong Fu
OffRL
74
185
0
03 Oct 2019
Reducing Overestimation Bias in Multi-Agent Domains Using Double
  Centralized Critics
Reducing Overestimation Bias in Multi-Agent Domains Using Double Centralized Critics
J. Ackermann
Pau Cebrian
Antonio Espinosa
Masashi Sugiyama
OffRL
64
122
0
03 Oct 2019
Relationship Explainable Multi-objective Optimization Via Vector Value
  Function Based Reinforcement Learning
Relationship Explainable Multi-objective Optimization Via Vector Value Function Based Reinforcement Learning
Huixin Zhan
Yongcan Cao
66
7
0
02 Oct 2019
Never Worse, Mostly Better: Stable Policy Improvement in Deep
  Reinforcement Learning
Never Worse, Mostly Better: Stable Policy Improvement in Deep Reinforcement Learning
P. Khanna
Guy Tennenholtz
Nadav Merlis
Shie Mannor
Chen Tessler
OffRL
26
1
0
02 Oct 2019
Improving Sample Efficiency in Model-Free Reinforcement Learning from
  Images
Improving Sample Efficiency in Model-Free Reinforcement Learning from Images
Denis Yarats
Amy Zhang
Ilya Kostrikov
Brandon Amos
Joelle Pineau
Rob Fergus
DRL
139
449
0
02 Oct 2019
Reinforcement Learning for Multi-Objective Optimization of Online
  Decisions in High-Dimensional Systems
Reinforcement Learning for Multi-Objective Optimization of Online Decisions in High-Dimensional Systems
Hardik Meisheri
Vinita Baniwal
Nazneen N. Sultana
Balaraman Ravindran
H. Khadilkar
OffRL
28
2
0
01 Oct 2019
Advantage-Weighted Regression: Simple and Scalable Off-Policy
  Reinforcement Learning
Advantage-Weighted Regression: Simple and Scalable Off-Policy Reinforcement Learning
Xue Bin Peng
Aviral Kumar
Grace Zhang
Sergey Levine
OffRL
174
570
0
01 Oct 2019
Previous
123...353637...444546
Next