ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1509.06461
  4. Cited By
Deep Reinforcement Learning with Double Q-learning
v1v2v3 (latest)

Deep Reinforcement Learning with Double Q-learning

22 September 2015
H. V. Hasselt
A. Guez
David Silver
    OffRL
ArXiv (abs)PDFHTML

Papers citing "Deep Reinforcement Learning with Double Q-learning"

50 / 2,291 papers shown
Title
Low Dimensional State Representation Learning with Reward-shaped Priors
Low Dimensional State Representation Learning with Reward-shaped Priors
N. Botteghi
Ruben Obbink
D. Geijs
M. Poel
B. Sirmaçek
C. Brune
A. Mersha
Stefano Stramigioli
SSLOffRL
44
4
0
29 Jul 2020
FlexPool: A Distributed Model-Free Deep Reinforcement Learning Algorithm
  for Joint Passengers & Goods Transportation
FlexPool: A Distributed Model-Free Deep Reinforcement Learning Algorithm for Joint Passengers & Goods Transportation
Kaushik Manchella
A. Umrawal
Vaneet Aggarwal
77
62
0
27 Jul 2020
Variance Reduction for Deep Q-Learning using Stochastic Recursive
  Gradient
Variance Reduction for Deep Q-Learning using Stochastic Recursive Gradient
Hao Jia
Xiao Zhang
Jun Xu
Wei Zeng
Hao Jiang
Xiao Yan
Ji-Rong Wen
79
3
0
25 Jul 2020
Maximum Mutation Reinforcement Learning for Scalable Control
Maximum Mutation Reinforcement Learning for Scalable Control
Karush Suri
Xiaolong Shi
Konstantinos N. Plataniotis
Y. Lawryshyn
74
4
0
24 Jul 2020
Distributional Reinforcement Learning via Moment Matching
Distributional Reinforcement Learning via Moment Matching
Thanh Tang Nguyen
Sunil R. Gupta
Svetha Venkatesh
OOD
20
22
0
24 Jul 2020
Bridging the Imitation Gap by Adaptive Insubordination
Bridging the Imitation Gap by Adaptive Insubordination
Luca Weihs
Unnat Jain
Iou-Jen Liu
Jordi Salvador
Svetlana Lazebnik
Aniruddha Kembhavi
Alex Schwing
89
36
0
23 Jul 2020
EMaQ: Expected-Max Q-Learning Operator for Simple Yet Effective Offline
  and Online RL
EMaQ: Expected-Max Q-Learning Operator for Simple Yet Effective Offline and Online RL
Seyed Kamyar Seyed Ghasemipour
Dale Schuurmans
S. Gu
OffRL
295
122
0
21 Jul 2020
Adaptive Traffic Control with Deep Reinforcement Learning: Towards
  State-of-the-art and Beyond
Adaptive Traffic Control with Deep Reinforcement Learning: Towards State-of-the-art and Beyond
Siavash Alemzadeh
Ramin Moslemi
Ratnesh K. Sharma
M. Mesbahi
OffRL
28
5
0
21 Jul 2020
UAV Target Tracking in Urban Environments Using Deep Reinforcement
  Learning
UAV Target Tracking in Urban Environments Using Deep Reinforcement Learning
Sarthak Bhagat
Sujit PB
76
50
0
21 Jul 2020
Integrating Deep Reinforcement Learning Networks with Health System
  Simulations
Integrating Deep Reinforcement Learning Networks with Health System Simulations
Michael Allen
T. Monks
AI4CE
41
4
0
21 Jul 2020
Active MR k-space Sampling with Reinforcement Learning
Active MR k-space Sampling with Reinforcement Learning
Luis Villaseñor-Pineda
Sumana Basu
Adriana Romero
Roberto Calandra
M. Drozdzal
83
71
0
20 Jul 2020
Multi-robot Cooperative Object Transportation using Decentralized Deep
  Reinforcement Learning
Multi-robot Cooperative Object Transportation using Decentralized Deep Reinforcement Learning
Lin Zhang
Hao Xiong
Ou Ma
Zhaokui Wang
109
6
0
17 Jul 2020
Collision Avoidance Robotics Via Meta-Learning (CARML)
Collision Avoidance Robotics Via Meta-Learning (CARML)
A. Iyer
Aravind Mahadevan
37
3
0
16 Jul 2020
Meta-Gradient Reinforcement Learning with an Objective Discovered Online
Meta-Gradient Reinforcement Learning with an Objective Discovered Online
Zhongwen Xu
H. V. Hasselt
Matteo Hessel
Junhyuk Oh
Satinder Singh
David Silver
82
78
0
16 Jul 2020
Weighing Counts: Sequential Crowd Counting by Reinforcement Learning
Weighing Counts: Sequential Crowd Counting by Reinforcement Learning
Liang Liu
Hao Lu
Hongwei Zou
Haipeng Xiong
Zhiguo Cao
Chunhua Shen
OffRL
78
72
0
16 Jul 2020
Mixture of Step Returns in Bootstrapped DQN
Mixture of Step Returns in Bootstrapped DQN
Po-Han Chiang
Hsuan-Kung Yang
Zhang-Wei Hong
Chun-Yi Lee
40
4
0
16 Jul 2020
Odyssey: Creation, Analysis and Detection of Trojan Models
Odyssey: Creation, Analysis and Detection of Trojan Models
Marzieh Edraki
Nazmul Karim
Nazanin Rahnavard
Ajmal Mian
M. Shah
AAML
97
14
0
16 Jul 2020
Information Freshness-Aware Task Offloading in Air-Ground Integrated
  Edge Computing Systems
Information Freshness-Aware Task Offloading in Air-Ground Integrated Edge Computing Systems
Xianfu Chen
Celimuge Wu
Tao Chen
Zhi Liu
Honggang Zhang
M. Bennis
Hang Liu
Yusheng Ji
85
71
0
15 Jul 2020
Qgraph-bounded Q-learning: Stabilizing Model-Free Off-Policy Deep
  Reinforcement Learning
Qgraph-bounded Q-learning: Stabilizing Model-Free Off-Policy Deep Reinforcement Learning
Sabrina Hoppe
Marc Toussaint
OffRL
44
7
0
15 Jul 2020
Analysis of Q-learning with Adaptation and Momentum Restart for Gradient
  Descent
Analysis of Q-learning with Adaptation and Momentum Restart for Gradient Descent
Chuhan Wu
Fangzhao Wu
Tao Qi
Yongfeng Huang
51
25
0
15 Jul 2020
Robustifying Reinforcement Learning Agents via Action Space Adversarial
  Training
Robustifying Reinforcement Learning Agents via Action Space Adversarial Training
Kai Liang Tan
Yasaman Esfandiari
Xian Yeow Lee
Aakanksha
Soumik Sarkar
AAML
135
57
0
14 Jul 2020
Revisiting Fundamentals of Experience Replay
Revisiting Fundamentals of Experience Replay
W. Fedus
Prajit Ramachandran
Rishabh Agarwal
Yoshua Bengio
Hugo Larochelle
Mark Rowland
Will Dabney
KELMOffRL
97
242
0
13 Jul 2020
Reinforcement Learning of Musculoskeletal Control from Functional
  Simulations
Reinforcement Learning of Musculoskeletal Control from Functional Simulations
Emanuel Joos
Fabien Péan
Orçun Göksel
AI4CE
75
12
0
13 Jul 2020
Designing Personalized Interaction of a Socially Assistive Robot for
  Stroke Rehabilitation Therapy
Designing Personalized Interaction of a Socially Assistive Robot for Stroke Rehabilitation Therapy
Min Hun Lee
D. Siewiorek
A. Smailagic
Alexandre Bernardino
S. Bermúdez i Badia
10
4
0
13 Jul 2020
Implicit Distributional Reinforcement Learning
Implicit Distributional Reinforcement Learning
Yuguang Yue
Zhendong Wang
Mingyuan Zhou
OffRL
86
16
0
13 Jul 2020
An Equivalence between Loss Functions and Non-Uniform Sampling in
  Experience Replay
An Equivalence between Loss Functions and Non-Uniform Sampling in Experience Replay
Scott Fujimoto
David Meger
Doina Precup
96
58
0
12 Jul 2020
Data-Efficient Reinforcement Learning with Self-Predictive
  Representations
Data-Efficient Reinforcement Learning with Self-Predictive Representations
Max Schwarzer
Ankesh Anand
Rishab Goel
R. Devon Hjelm
Aaron Courville
Philip Bachman
118
321
0
12 Jul 2020
Learning Abstract Models for Strategic Exploration and Fast Reward
  Transfer
Learning Abstract Models for Strategic Exploration and Fast Reward Transfer
Emmy Liu
Ramtin Keramati
Sudarshan Seshadri
Kelvin Guu
Panupong Pasupat
Emma Brunskill
Percy Liang
OffRL
69
6
0
12 Jul 2020
Simulating multi-exit evacuation using deep reinforcement learning
Simulating multi-exit evacuation using deep reinforcement learning
Dong Xu
Xiao Shi Huang
Joseph D Mango
Xiang Li
Zhenlong Li
25
23
0
11 Jul 2020
The Mean-Squared Error of Double Q-Learning
The Mean-Squared Error of Double Q-Learning
Wentao Weng
Harsh Gupta
Niao He
Lei Ying
R. Srikant
57
17
0
09 Jul 2020
SUNRISE: A Simple Unified Framework for Ensemble Learning in Deep
  Reinforcement Learning
SUNRISE: A Simple Unified Framework for Ensemble Learning in Deep Reinforcement Learning
Kimin Lee
Michael Laskin
A. Srinivas
Pieter Abbeel
OffRL
113
205
0
09 Jul 2020
On the Reliability and Generalizability of Brain-inspired Reinforcement
  Learning Algorithms
On the Reliability and Generalizability of Brain-inspired Reinforcement Learning Algorithms
Dongjae Kim
J. Lee
J. Shin
M. Yang
Sang Wan Lee
OffRL
25
2
0
09 Jul 2020
Auto-MAP: A DQN Framework for Exploring Distributed Execution Plans for
  DNN Workloads
Auto-MAP: A DQN Framework for Exploring Distributed Execution Plans for DNN Workloads
Siyu Wang
Yi Rong
Shiqing Fan
Zhen Zheng
Lansong Diao
Guoping Long
Jun Yang
Xiaoyong Liu
Wei Lin
68
9
0
08 Jul 2020
Double Prioritized State Recycled Experience Replay
Double Prioritized State Recycled Experience Replay
Fanchen Bu
D. Chang
OffRL
40
11
0
08 Jul 2020
Meta-active Learning in Probabilistically-Safe Optimization
Meta-active Learning in Probabilistically-Safe Optimization
Mariah L. Schrum
M. Connolly
Eric R. Cole
Mihir Ghetiya
R. Gross
Matthew C. Gombolay
66
12
0
07 Jul 2020
Off-Policy Evaluation via the Regularized Lagrangian
Off-Policy Evaluation via the Regularized Lagrangian
Mengjiao Yang
Ofir Nachum
Bo Dai
Lihong Li
Dale Schuurmans
OffRL
56
118
0
07 Jul 2020
Deep Reinforcement Learning with Interactive Feedback in a Human-Robot
  Environment
Deep Reinforcement Learning with Interactive Feedback in a Human-Robot Environment
Ithan Moreira
Javier Rivas
Francisco Cruz
Richard Dazeley
Angel Ayala
Bruno José Torres Fernandes
42
35
0
07 Jul 2020
Predictive Maintenance for Edge-Based Sensor Networks: A Deep
  Reinforcement Learning Approach
Predictive Maintenance for Edge-Based Sensor Networks: A Deep Reinforcement Learning Approach
Kevin Shen-Hoong Ong
Dusit Niyato
Chau Yuen
16
24
0
07 Jul 2020
Cognitive Radio Network Throughput Maximization with Deep Reinforcement
  Learning
Cognitive Radio Network Throughput Maximization with Deep Reinforcement Learning
Kevin Shen-Hoong Ong
Yang Zhang
Dusit Niyato
18
2
0
07 Jul 2020
Reward Machines for Cooperative Multi-Agent Reinforcement Learning
Reward Machines for Cooperative Multi-Agent Reinforcement Learning
Cyrus Neary
Zhe Xu
Bo Wu
Ufuk Topcu
83
47
0
03 Jul 2020
Expected Eligibility Traces
Expected Eligibility Traces
H. V. Hasselt
Sephora Madjiheurem
Matteo Hessel
David Silver
André Barreto
Diana Borsa
64
38
0
03 Jul 2020
Hedging using reinforcement learning: Contextual $k$-Armed Bandit versus
  $Q$-learning
Hedging using reinforcement learning: Contextual kkk-Armed Bandit versus QQQ-learning
Loris Cannelli
Giuseppe Nuti
M. Sala
O. Szehr
OffRL
79
12
0
03 Jul 2020
Dueling Deep Q-Network for Unsupervised Inter-frame Eye Movement
  Correction in Optical Coherence Tomography Volumes
Dueling Deep Q-Network for Unsupervised Inter-frame Eye Movement Correction in Optical Coherence Tomography Volumes
Y. George
S. Sedai
B. Antony
H. Ishikawa
Gadi Wollstein
J. Schuman
R. Garnavi
MedIm
11
2
0
03 Jul 2020
Decentralized Deep Reinforcement Learning for Network Level Traffic
  Signal Control
Decentralized Deep Reinforcement Learning for Network Level Traffic Signal Control
Jinqiu Guo
23
1
0
02 Jul 2020
ε-BMC: A Bayesian Ensemble Approach to Epsilon-Greedy
  Exploration in Model-Free Reinforcement Learning
ε-BMC: A Bayesian Ensemble Approach to Epsilon-Greedy Exploration in Model-Free Reinforcement Learning
Michael Gimelfarb
Scott Sanner
Chi-Guhn Lee
29
15
0
02 Jul 2020
UAV Path Planning for Wireless Data Harvesting: A Deep Reinforcement
  Learning Approach
UAV Path Planning for Wireless Data Harvesting: A Deep Reinforcement Learning Approach
Harald Bayerlein
Mirco Theile
Marco Caccamo
David Gesbert
75
56
0
01 Jul 2020
Convex Regularization in Monte-Carlo Tree Search
Convex Regularization in Monte-Carlo Tree Search
Tuan Dam
Carlo DÉramo
Jan Peters
Joni Pajarinen
OffRL
75
11
0
01 Jul 2020
Group Equivariant Deep Reinforcement Learning
Group Equivariant Deep Reinforcement Learning
Arnab Kumar Mondal
Pratheeksha Nair
Kaleem Siddiqi
61
33
0
01 Jul 2020
Regularly Updated Deterministic Policy Gradient Algorithm
Regularly Updated Deterministic Policy Gradient Algorithm
Shuai Han
Wenbo Zhou
Shuai Lu
Jiayu Yu
23
22
0
01 Jul 2020
Deep reinforcement learning approach to MIMO precoding problem:
  Optimality and Robustness
Deep reinforcement learning approach to MIMO precoding problem: Optimality and Robustness
Heunchul Lee
Maksym A. Girnyk
Jaeseong Jeong
41
15
0
30 Jun 2020
Previous
123...303132...444546
Next