ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1509.06461
  4. Cited By
Deep Reinforcement Learning with Double Q-learning

Deep Reinforcement Learning with Double Q-learning

22 September 2015
H. V. Hasselt
A. Guez
David Silver
    OffRL
ArXivPDFHTML

Papers citing "Deep Reinforcement Learning with Double Q-learning"

50 / 1,039 papers shown
Title
Temporal Shift Reinforcement Learning
Temporal Shift Reinforcement Learning
Deep Thomas
Tichakorn Wongpiromsarn
Ali Jannesari
OffRL
31
0
0
05 Sep 2021
An Exploration of Deep Learning Methods in Hungry Geese
An Exploration of Deep Learning Methods in Hungry Geese
Nikzad Khani
Matthew Kluska
17
0
0
05 Sep 2021
Event-Based Communication in Distributed Q-Learning
Event-Based Communication in Distributed Q-Learning
Daniel Jarne Ornia
M. Mazo
33
1
0
03 Sep 2021
A Survey of Exploration Methods in Reinforcement Learning
A Survey of Exploration Methods in Reinforcement Learning
Susan Amin
Maziar Gomrokchi
Harsh Satija
H. V. Hoof
Doina Precup
OffRL
43
81
0
01 Sep 2021
Reinforcement Learning based Condition-oriented Maintenance Scheduling
  for Flow Line Systems
Reinforcement Learning based Condition-oriented Maintenance Scheduling for Flow Line Systems
Raphael Lamprecht
Ferdinand Wurst
Marco F. Huber
22
3
0
27 Aug 2021
WAD: A Deep Reinforcement Learning Agent for Urban Autonomous Driving
WAD: A Deep Reinforcement Learning Agent for Urban Autonomous Driving
Arjit Sharma
Sahil Sharma
11
3
0
27 Aug 2021
Federated Reinforcement Learning: Techniques, Applications, and Open
  Challenges
Federated Reinforcement Learning: Techniques, Applications, and Open Challenges
Jiaju Qi
Qihao Zhou
Lei Lei
Kan Zheng
FedML
39
147
0
26 Aug 2021
No DBA? No regret! Multi-armed bandits for index tuning of analytical
  and HTAP workloads with provable guarantees
No DBA? No regret! Multi-armed bandits for index tuning of analytical and HTAP workloads with provable guarantees
R. Perera
Bastian Oetomo
Benjamin I. P. Rubinstein
Renata Borovica-Gajic
24
8
0
23 Aug 2021
Personalized next-best action recommendation with multi-party
  interaction learning for automated decision-making
Personalized next-best action recommendation with multi-party interaction learning for automated decision-making
LongBing Cao
Chengzhang Zhu
22
8
0
19 Aug 2021
Optimal Actor-Critic Policy with Optimized Training Datasets
Optimal Actor-Critic Policy with Optimized Training Datasets
C. Banerjee
Zhiyong Chen
N. Noman
M. Zamani
OffRL
40
7
0
16 Aug 2021
DQN Control Solution for KDD Cup 2021 City Brain Challenge
DQN Control Solution for KDD Cup 2021 City Brain Challenge
Yitian Chen
Kunlong Chen
Kunjin Chen
Lin Wang
27
0
0
14 Aug 2021
Reinforcement Learning Approach to Active Learning for Image
  Classification
Reinforcement Learning Approach to Active Learning for Image Classification
Thorben Werner
16
1
0
12 Aug 2021
Graph Attention Network-based Multi-agent Reinforcement Learning for
  Slicing Resource Management in Dense Cellular Network
Graph Attention Network-based Multi-agent Reinforcement Learning for Slicing Resource Management in Dense Cellular Network
Yan Shao
Rongpeng Li
Bing Hu
Yingxiao Wu
Zhifeng Zhao
Honggang Zhang
36
46
0
11 Aug 2021
DQ-GAT: Towards Safe and Efficient Autonomous Driving with Deep
  Q-Learning and Graph Attention Networks
DQ-GAT: Towards Safe and Efficient Autonomous Driving with Deep Q-Learning and Graph Attention Networks
Peide Cai
Hengli Wang
Yuxiang Sun
Ming-Yuan Liu
GNN
56
39
0
11 Aug 2021
A Survey on Deep Reinforcement Learning for Data Processing and
  Analytics
A Survey on Deep Reinforcement Learning for Data Processing and Analytics
Qingpeng Cai
Can Cui
Yiyuan Xiong
Wei Wang
Zhongle Xie
Meihui Zhang
OffRL
26
29
0
10 Aug 2021
Deep Reinforcement Learning for Demand Driven Services in Logistics and
  Transportation Systems: A Survey
Deep Reinforcement Learning for Demand Driven Services in Logistics and Transportation Systems: A Survey
Zefang Zong
Tao Feng
Tong Xia
Depeng Jin
Yong Li
27
3
0
10 Aug 2021
Modified Double DQN: addressing stability
Modified Double DQN: addressing stability
Shervin Halat
M. Ebadzadeh
20
2
0
09 Aug 2021
Distilling Neuron Spike with High Temperature in Reinforcement Learning
  Agents
Distilling Neuron Spike with High Temperature in Reinforcement Learning Agents
Ling Zhang
Jian Cao
Yuan Zhang
Bohan Zhou
Shuo Feng
27
9
0
05 Aug 2021
High Performance Across Two Atari Paddle Games Using the Same Perceptual
  Control Architecture Without Training
High Performance Across Two Atari Paddle Games Using the Same Perceptual Control Architecture Without Training
T. Gulrez
W. Mansell
24
0
0
04 Aug 2021
RAIN: Reinforced Hybrid Attention Inference Network for Motion
  Forecasting
RAIN: Reinforced Hybrid Attention Inference Network for Motion Forecasting
Jiachen Li
Fan Yang
Hengbo Ma
Srikanth Malla
Masayoshi Tomizuka
Chiho Choi
29
42
0
03 Aug 2021
Flip Learning: Erase to Segment
Flip Learning: Erase to Segment
Yuhao Huang
Xin Yang
Yuxin Zou
Chaoyu Chen
Jian Wang
Haoran Dou
Nishant Ravikumar
Alejandro F Frangi
Jianqiao Zhou
Dong Ni
21
9
0
02 Aug 2021
Learning to Control DC Motor for Micromobility in Real Time with
  Reinforcement Learning
Learning to Control DC Motor for Micromobility in Real Time with Reinforcement Learning
Bibek Poudel
Thomas Watson
Weizi Li
29
13
0
31 Jul 2021
Human-Level Reinforcement Learning through Theory-Based Modeling,
  Exploration, and Planning
Human-Level Reinforcement Learning through Theory-Based Modeling, Exploration, and Planning
Pedro Tsividis
J. Loula
Jake Burga
Nathan Foss
Andres Campero
Thomas Pouncy
S. Gershman
J. Tenenbaum
LM&Ro
26
46
0
27 Jul 2021
Trajectory Design for UAV-Based Internet-of-Things Data Collection: A
  Deep Reinforcement Learning Approach
Trajectory Design for UAV-Based Internet-of-Things Data Collection: A Deep Reinforcement Learning Approach
Yang Wang
Zhen Gao
Jun Zhang
Xianbin Cao
Dezhi Zheng
Yue Gao
Derrick Wing Kwan Ng
M. Di Renzo
49
94
0
23 Jul 2021
Bayesian Controller Fusion: Leveraging Control Priors in Deep
  Reinforcement Learning for Robotics
Bayesian Controller Fusion: Leveraging Control Priors in Deep Reinforcement Learning for Robotics
Krishan Rana
Vibhavari Dasagi
Jesse Haviland
Ben Talbot
Michael Milford
Niko Sünderhauf
BDL
OffRL
32
31
0
21 Jul 2021
Active 3D Shape Reconstruction from Vision and Touch
Active 3D Shape Reconstruction from Vision and Touch
Edward James Smith
David Meger
Luis Villaseñor-Pineda
Roberto Calandra
Jitendra Malik
Adriana Romero
M. Drozdzal
38
45
0
20 Jul 2021
Decoupled Reinforcement Learning to Stabilise Intrinsically-Motivated
  Exploration
Decoupled Reinforcement Learning to Stabilise Intrinsically-Motivated Exploration
Lukas Schafer
Filippos Christianos
Josiah P. Hanna
Stefano V. Albrecht
55
22
0
19 Jul 2021
High-level Decisions from a Safe Maneuver Catalog with Reinforcement
  Learning for Safe and Cooperative Automated Merging
High-level Decisions from a Safe Maneuver Catalog with Reinforcement Learning for Safe and Cooperative Automated Merging
Danial Kamran
Yu Ren
Martin Lauer
40
10
0
15 Jul 2021
The Benchmark Lottery
The Benchmark Lottery
Mostafa Dehghani
Yi Tay
A. Gritsenko
Zhe Zhao
N. Houlsby
Fernando Diaz
Donald Metzler
Oriol Vinyals
61
90
0
14 Jul 2021
Conservative Offline Distributional Reinforcement Learning
Conservative Offline Distributional Reinforcement Learning
Yecheng Jason Ma
Dinesh Jayaraman
Osbert Bastani
OffRL
73
80
0
12 Jul 2021
AdaRL: What, Where, and How to Adapt in Transfer Reinforcement Learning
AdaRL: What, Where, and How to Adapt in Transfer Reinforcement Learning
Erdun Gao
Fan Feng
Chaochao Lu
Sara Magliacane
Kun Zhang
55
66
0
06 Jul 2021
Low-Dimensional State and Action Representation Learning with MDP
  Homomorphism Metrics
Low-Dimensional State and Action Representation Learning with MDP Homomorphism Metrics
N. Botteghi
M. Poel
B. Sirmaçek
C. Brune
29
3
0
04 Jul 2021
Hierarchical Policies for Cluttered-Scene Grasping with Latent Plans
Hierarchical Policies for Cluttered-Scene Grasping with Latent Plans
Lirui Wang
Xiangyun Meng
Yu Xiang
Dieter Fox
3DPC
DRL
26
27
0
04 Jul 2021
A Novel Deep Reinforcement Learning Based Stock Direction Prediction
  using Knowledge Graph and Community Aware Sentiments
A Novel Deep Reinforcement Learning Based Stock Direction Prediction using Knowledge Graph and Community Aware Sentiments
Anil Berk Altuner
Zeynep Hilal Kilimci
AIFin
25
15
0
02 Jul 2021
Stabilizing Deep Q-Learning with ConvNets and Vision Transformers under
  Data Augmentation
Stabilizing Deep Q-Learning with ConvNets and Vision Transformers under Data Augmentation
Nicklas Hansen
H. Su
Xiaolong Wang
OffRL
54
135
0
01 Jul 2021
Productivity, Portability, Performance: Data-Centric Python
Productivity, Portability, Performance: Data-Centric Python
Yiheng Wang
Yao Zhang
Yanzhang Wang
Yan Wan
Jiao Wang
Zhongyuan Wu
Yuhao Yang
Bowen She
64
95
0
01 Jul 2021
Drone swarm patrolling with uneven coverage requirements
Drone swarm patrolling with uneven coverage requirements
C. Piciarelli
G. Foresti
AI4TS
12
9
0
01 Jul 2021
Convergent and Efficient Deep Q Network Algorithm
Convergent and Efficient Deep Q Network Algorithm
Zhikang T. Wang
Masahito Ueda
38
12
0
29 Jun 2021
Hi-Phy: A Benchmark for Hierarchical Physical Reasoning
Cheng Xue
Vimukthini Pinto
C. Gamage
Peng Zhang
Jochen Renz
31
0
0
17 Jun 2021
Offline RL Without Off-Policy Evaluation
Offline RL Without Off-Policy Evaluation
David Brandfonbrener
William F. Whitney
Rajesh Ranganath
Joan Bruna
OffRL
47
163
0
16 Jun 2021
Taylor Expansion of Discount Factors
Taylor Expansion of Discount Factors
Yunhao Tang
Mark Rowland
Rémi Munos
Michal Valko
OffRL
44
5
0
11 Jun 2021
DouZero: Mastering DouDizhu with Self-Play Deep Reinforcement Learning
DouZero: Mastering DouDizhu with Self-Play Deep Reinforcement Learning
Daochen Zha
Jingru Xie
Wenye Ma
Sheng Zhang
Xiangru Lian
Xia Hu
Ji Liu
30
117
0
11 Jun 2021
InFlow: Robust outlier detection utilizing Normalizing Flows
InFlow: Robust outlier detection utilizing Normalizing Flows
Nishant Kumar
Pia Hanfeld
Michael Hecht
Michael Bussmann
Stefan Gumhold
Nico Hoffmann
OODD
OOD
TPM
29
4
0
10 Jun 2021
Simplifying Deep Reinforcement Learning via Self-Supervision
Simplifying Deep Reinforcement Learning via Self-Supervision
Daochen Zha
Kwei-Herng Lai
Kaixiong Zhou
Xia Hu
SSL
54
15
0
10 Jun 2021
TempoRL: Learning When to Act
TempoRL: Learning When to Act
André Biedenkapp
Raghunandan Rajan
Frank Hutter
Marius Lindauer
OffRL
21
28
0
09 Jun 2021
XIRL: Cross-embodiment Inverse Reinforcement Learning
XIRL: Cross-embodiment Inverse Reinforcement Learning
Kevin Zakka
Andy Zeng
Peter R. Florence
Jonathan Tompson
Jeannette Bohg
Debidatta Dwibedi
SSL
50
121
0
07 Jun 2021
Hierarchical Robot Navigation in Novel Environments using Rough 2-D Maps
Hierarchical Robot Navigation in Novel Environments using Rough 2-D Maps
Chengguang Xu
Chris Amato
Lawson L. S. Wong
28
6
0
07 Jun 2021
Efficient Continuous Control with Double Actors and Regularized Critics
Efficient Continuous Control with Double Actors and Regularized Critics
Jiafei Lyu
Xiaoteng Ma
Jiangpeng Yan
Xiu Li
OffRL
19
48
0
06 Jun 2021
Same State, Different Task: Continual Reinforcement Learning without
  Interference
Same State, Different Task: Continual Reinforcement Learning without Interference
Samuel Kessler
Jack Parker-Holder
Philip J. Ball
S. Zohren
Stephen J. Roberts
CLL
OffRL
29
46
0
05 Jun 2021
Iterative Empirical Game Solving via Single Policy Best Response
Iterative Empirical Game Solving via Single Policy Best Response
Max O. Smith
Thomas W. Anthony
Michael P. Wellman
22
18
0
03 Jun 2021
Previous
123...101112...192021
Next