ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1509.06461
  4. Cited By
Deep Reinforcement Learning with Double Q-learning
v1v2v3 (latest)

Deep Reinforcement Learning with Double Q-learning

22 September 2015
H. V. Hasselt
A. Guez
David Silver
    OffRL
ArXiv (abs)PDFHTML

Papers citing "Deep Reinforcement Learning with Double Q-learning"

50 / 2,291 papers shown
Title
Off-Belief Learning
Off-Belief Learning
Hengyuan Hu
Adam Lerer
Brandon Cui
David J. Wu
Luis Pineda
Noam Brown
Jakob N. Foerster
OffRL
63
73
0
06 Mar 2021
Deep reinforcement learning in medical imaging: A literature review
Deep reinforcement learning in medical imaging: A literature review
S. Kevin Zhou
Hoang Ngan Le
Khoa Luu
Hien V Nguyen
N. Ayache
LM&MAOffRLMedIm
77
149
0
05 Mar 2021
Improving Computational Efficiency in Visual Reinforcement Learning via
  Stored Embeddings
Improving Computational Efficiency in Visual Reinforcement Learning via Stored Embeddings
Lili Chen
Kimin Lee
A. Srinivas
Pieter Abbeel
OffRL
70
11
0
04 Mar 2021
Hierarchical and Partially Observable Goal-driven Policy Learning with
  Goals Relational Graph
Hierarchical and Partially Observable Goal-driven Policy Learning with Goals Relational Graph
Xin Ye
Yezhou Yang
102
25
0
01 Mar 2021
Decision Making in Monopoly using a Hybrid Deep Reinforcement Learning
  Approach
Decision Making in Monopoly using a Hybrid Deep Reinforcement Learning Approach
Trevor Bonjour
Marina Haliem
A. Alsalem
Shilpa Thomas
Hongyu Li
Vaneet Aggarwal
Mayank Kejriwal
Bharat K. Bhargava
97
15
0
01 Mar 2021
Ensemble Bootstrapping for Q-Learning
Ensemble Bootstrapping for Q-Learning
Oren Peer
Chen Tessler
Nadav Merlis
Ron Meir
87
42
0
28 Feb 2021
Sequential Learning-based IaaS Composition
Sequential Learning-based IaaS Composition
Sajib Mistry
Sheik Mohammad Mostakim Fattah
A. Bouguettaya
54
0
0
24 Feb 2021
Balancing Rational and Other-Regarding Preferences in
  Cooperative-Competitive Environments
Balancing Rational and Other-Regarding Preferences in Cooperative-Competitive Environments
Dmitry Ivanov
Vladimir Egorov
A. Shpilman
55
5
0
24 Feb 2021
Greedy-Step Off-Policy Reinforcement Learning
Greedy-Step Off-Policy Reinforcement Learning
Yuhui Wang
Qingyuan Wu
Pengcheng He
Xiaoyang Tan
OffRL
57
1
0
23 Feb 2021
An Interaction-aware Evaluation Method for Highly Automated Vehicles
An Interaction-aware Evaluation Method for Highly Automated Vehicles
Xinpeng Wang
Songan Zhang
Kuan-Hui Lee
H. Peng
33
10
0
23 Feb 2021
Reinforcement Learning with Prototypical Representations
Reinforcement Learning with Prototypical Representations
Denis Yarats
Rob Fergus
A. Lazaric
Lerrel Pinto
SSL
83
226
0
22 Feb 2021
Training a Resilient Q-Network against Observational Interference
Training a Resilient Q-Network against Observational Interference
Chao-Han Huck Yang
I-Te Danny Hung
Ouyang Yi
Pin-Yu Chen
OOD
61
15
0
18 Feb 2021
Adaptive Rational Activations to Boost Deep Reinforcement Learning
Adaptive Rational Activations to Boost Deep Reinforcement Learning
Quentin Delfosse
P. Schramowski
Martin Mundt
Alejandro Molina
Kristian Kersting
139
15
0
18 Feb 2021
Improved Deep Reinforcement Learning with Expert Demonstrations for
  Urban Autonomous Driving
Improved Deep Reinforcement Learning with Expert Demonstrations for Urban Autonomous Driving
Haochen Liu
Zhiyu Huang
Jingda Wu
Chen Lv
94
74
0
18 Feb 2021
Continuous Doubly Constrained Batch Reinforcement Learning
Continuous Doubly Constrained Batch Reinforcement Learning
Rasool Fakoor
Jonas W. Mueller
Kavosh Asadi
Pratik Chaudhari
Alex Smola
OffRL
286
27
0
18 Feb 2021
SCAPE: Learning Stiffness Control from Augmented Position Control
  Experiences
SCAPE: Learning Stiffness Control from Augmented Position Control Experiences
Mincheol Kim
S. Niekum
A. Deshpande
76
4
0
16 Feb 2021
TradeR: Practical Deep Hierarchical Reinforcement Learning for Trade
  Execution
TradeR: Practical Deep Hierarchical Reinforcement Learning for Trade Execution
Karush Suri
Xiaolong Shi
Konstantinos Plataniotis
Y. Lawryshyn
OffRL
42
4
0
16 Feb 2021
Steadily Learn to Drive with Virtual Memory
Steadily Learn to Drive with Virtual Memory
Yuhang Zhang
Yao Mu
Yujie Yang
Yang Guan
Shengbo Eben Li
Qi Sun
Jianyu Chen
40
1
0
16 Feb 2021
Zero-Shot Adaptation for mmWave Beam-Tracking on Overhead Messenger
  Wires through Robust Adversarial Reinforcement Learning
Zero-Shot Adaptation for mmWave Beam-Tracking on Overhead Messenger Wires through Robust Adversarial Reinforcement Learning
Masao Shinzaki
Yusuke Koda
Koji Yamamoto
Takayuki Nishio
M. Morikura
Y. Shirato
D. Uchida
N. Kita
30
7
0
16 Feb 2021
Transferring Domain Knowledge with an Adviser in Continuous Tasks
Transferring Domain Knowledge with an Adviser in Continuous Tasks
Rukshan Wijesinghe
Kasun Vithanage
Dumindu Tissera
A. Xavier
Subha Fernando
Jayathu Samarawickrama
CLL
43
0
0
16 Feb 2021
Training Larger Networks for Deep Reinforcement Learning
Training Larger Networks for Deep Reinforcement Learning
Keita Ota
Devesh K. Jha
Asako Kanezaki
OffRL
97
40
0
16 Feb 2021
Reinforcement Learning for IoT Security: A Comprehensive Survey
Reinforcement Learning for IoT Security: A Comprehensive Survey
Aashma Uprety
D. Rawat
AAML
93
124
0
14 Feb 2021
Q-Value Weighted Regression: Reinforcement Learning with Limited Data
Q-Value Weighted Regression: Reinforcement Learning with Limited Data
Piotr Kozakowski
Lukasz Kaiser
Henryk Michalewski
Afroz Mohiuddin
Katarzyna Kañska
OffRL
73
5
0
12 Feb 2021
Generalizing Decision Making for Automated Driving with an Invariant
  Environment Representation using Deep Reinforcement Learning
Generalizing Decision Making for Automated Driving with an Invariant Environment Representation using Deep Reinforcement Learning
Karl Kurzer
Philip Schorner
Alexander Albers
Hauke Thomsen
Karam Daaboul
Johann Marius Zöllner
51
11
0
12 Feb 2021
Hedging of Financial Derivative Contracts via Monte Carlo Tree Search
Hedging of Financial Derivative Contracts via Monte Carlo Tree Search
O. Szehr
AIFin
30
3
0
11 Feb 2021
Personalization for Web-based Services using Offline Reinforcement
  Learning
Personalization for Web-based Services using Offline Reinforcement Learning
P. Apostolopoulos
Zehui Wang
Hanson Wang
Chad Zhou
Kittipat Virochsiri
Norm Zhou
Igor L. Markov
OffRLOnRL
68
7
0
10 Feb 2021
Adaptive Processor Frequency Adjustment for Mobile Edge Computing with
  Intermittent Energy Supply
Adaptive Processor Frequency Adjustment for Mobile Edge Computing with Intermittent Energy Supply
Tiansheng Huang
Weiwei Lin
Ying Li
Xiumin Wang
Qingbo Wu
Rui Li
Ching-Hsien Hsu
Albert Y. Zomaya
52
6
0
10 Feb 2021
Reinforcement Learning For Constraint Satisfaction Game Agents
  (15-Puzzle, Minesweeper, 2048, and Sudoku)
Reinforcement Learning For Constraint Satisfaction Game Agents (15-Puzzle, Minesweeper, 2048, and Sudoku)
Anav Mehta
AI4CE
122
4
0
09 Feb 2021
Measuring Progress in Deep Reinforcement Learning Sample Efficiency
Measuring Progress in Deep Reinforcement Learning Sample Efficiency
Florian E. Dorner
55
13
0
09 Feb 2021
Deep Reinforcement Learning for the Control of Robotic Manipulation: A
  Focussed Mini-Review
Deep Reinforcement Learning for the Control of Robotic Manipulation: A Focussed Mini-Review
Rongrong Liu
F. Nageotte
P. Zanne
M. de Mathelin
Birgitta Dresp
104
150
0
08 Feb 2021
MSPM: A Modularized and Scalable Multi-Agent Reinforcement
  Learning-based System for Financial Portfolio Management
MSPM: A Modularized and Scalable Multi-Agent Reinforcement Learning-based System for Financial Portfolio Management
Zhenhan Huang
F. Tanaka
AIFin
91
25
0
06 Feb 2021
Corner Case Generation and Analysis for Safety Assessment of Autonomous
  Vehicles
Corner Case Generation and Analysis for Safety Assessment of Autonomous Vehicles
Haowei Sun
Shuo Feng
Xintao Yan
Henry X. Liu
AAML
79
54
0
06 Feb 2021
An advantage actor-critic algorithm for robotic motion planning in dense
  and dynamic scenarios
An advantage actor-critic algorithm for robotic motion planning in dense and dynamic scenarios
Chengmin Zhou
Bingding Huang
Pasi Fränti
32
1
0
05 Feb 2021
Experience-Based Heuristic Search: Robust Motion Planning with Deep
  Q-Learning
Experience-Based Heuristic Search: Robust Motion Planning with Deep Q-Learning
Julian Bernhard
Robert Gieselmann
Klemens Esterle
Alois Knoll
42
17
0
05 Feb 2021
Addressing Inherent Uncertainty: Risk-Sensitive Behavior Generation for
  Automated Driving using Distributional Reinforcement Learning
Addressing Inherent Uncertainty: Risk-Sensitive Behavior Generation for Automated Driving using Distributional Reinforcement Learning
Julian Bernhard
Stefan Pollok
Alois Knoll
47
33
0
05 Feb 2021
A review of motion planning algorithms for intelligent robotics
A review of motion planning algorithms for intelligent robotics
Chengmin Zhou
Bingding Huang
Pasi Fränti
63
4
0
04 Feb 2021
DRLDO: A novel DRL based De-ObfuscationSystem for Defense against
  Metamorphic Malware
DRLDO: A novel DRL based De-ObfuscationSystem for Defense against Metamorphic Malware
Mohit Sewak
S. K. Sahay
Hemant Rathore
23
13
0
01 Feb 2021
Reinforcement Learning for Freight Booking Control Problems
Reinforcement Learning for Freight Booking Control Problems
Justin Dumouchelle
Emma Frejinger
Andrea Lodi
61
1
0
29 Jan 2021
Acting in Delayed Environments with Non-Stationary Markov Policies
Acting in Delayed Environments with Non-Stationary Markov Policies
E. Derman
Gal Dalal
Shie Mannor
84
34
0
28 Jan 2021
Learning Abstract Representations through Lossy Compression of
  Multi-Modal Signals
Learning Abstract Representations through Lossy Compression of Multi-Modal Signals
Charles Wilmot
Gianluca Baldassarre
Jochen Triesch
33
5
0
27 Jan 2021
Reinforcement Learning for Selective Key Applications in Power Systems:
  Recent Advances and Future Challenges
Reinforcement Learning for Selective Key Applications in Power Systems: Recent Advances and Future Challenges
Xin Chen
Guannan Qu
Yujie Tang
S. Low
Na Li
83
239
0
27 Jan 2021
The MineRL 2020 Competition on Sample Efficient Reinforcement Learning
  using Human Priors
The MineRL 2020 Competition on Sample Efficient Reinforcement Learning using Human Priors
William H. Guss
Cayden R. Codel
Katja Hofmann
Brandon Houghton
Noburu Kuno
...
John Schulman
Manuela Veloso
Nicholay Topin
Avinash Ummadisingu
Phillip Wang
OffRL
83
65
0
26 Jan 2021
Learning Synthetic Environments for Reinforcement Learning with
  Evolution Strategies
Learning Synthetic Environments for Reinforcement Learning with Evolution Strategies
Fabio Ferreira
Thomas Nierhoff
Frank Hutter
69
8
0
24 Jan 2021
Solving optimal stopping problems with Deep Q-Learning
Solving optimal stopping problems with Deep Q-Learning
John Ery
Loris Michel
14
6
0
24 Jan 2021
Multi-intersection Traffic Optimisation: A Benchmark Dataset and a
  Strong Baseline
Multi-intersection Traffic Optimisation: A Benchmark Dataset and a Strong Baseline
Hu Wang
Hao Chen
Qi Wu
Congbo Ma
Yidong Li
Chunhua Shen
60
13
0
24 Jan 2021
Decoupled Exploration and Exploitation Policies for Sample-Efficient
  Reinforcement Learning
Decoupled Exploration and Exploitation Policies for Sample-Efficient Reinforcement Learning
William F. Whitney
Michael Bloesch
Jost Tobias Springenberg
A. Abdolmaleki
Kyunghyun Cho
Martin Riedmiller
OffRL
77
16
0
23 Jan 2021
E-commerce warehousing: learning a storage policy
E-commerce warehousing: learning a storage policy
Adrien Rimélé
P. Grangier
M. Gamache
M. Gendreau
Louis-Martin Rousseau
60
8
0
21 Jan 2021
Stable deep reinforcement learning method by predicting uncertainty in
  rewards as a subtask
Stable deep reinforcement learning method by predicting uncertainty in rewards as a subtask
Kanata Suzuki
T. Ogata
78
2
0
18 Jan 2021
A Safe Hierarchical Planning Framework for Complex Driving Scenarios
  based on Reinforcement Learning
A Safe Hierarchical Planning Framework for Complex Driving Scenarios based on Reinforcement Learning
Jinning Li
Liting Sun
Jianyu Chen
Masayoshi Tomizuka
Wei Zhan
68
47
0
17 Jan 2021
Adaptive Remote Sensing Image Attribute Learning for Active Object
  Detection
Adaptive Remote Sensing Image Attribute Learning for Active Object Detection
Nuo Xu
Chunlei Huo
Jiacheng Guo
Yiwei Liu
Jian Wang
Chunhong Pan
ObjD
52
4
0
16 Jan 2021
Previous
123...262728...444546
Next