ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1509.06461
  4. Cited By
Deep Reinforcement Learning with Double Q-learning
v1v2v3 (latest)

Deep Reinforcement Learning with Double Q-learning

22 September 2015
H. V. Hasselt
A. Guez
David Silver
    OffRL
ArXiv (abs)PDFHTML

Papers citing "Deep Reinforcement Learning with Double Q-learning"

50 / 2,291 papers shown
Title
Deep Reinforcement Learning for Inquiry Dialog Policies with Logical
  Formula Embeddings
Deep Reinforcement Learning for Inquiry Dialog Policies with Logical Formula Embeddings
Takuya Hiraoka
Masaaki Tsuchida
Yotaro Watanabe
28
0
0
02 Aug 2017
A Distributional Perspective on Reinforcement Learning
A Distributional Perspective on Reinforcement Learning
Marc G. Bellemare
Will Dabney
Rémi Munos
OffRL
108
1,510
0
21 Jul 2017
Trial without Error: Towards Safe Reinforcement Learning via Human
  Intervention
Trial without Error: Towards Safe Reinforcement Learning via Human Intervention
William Saunders
Girish Sastry
Andreas Stuhlmuller
Owain Evans
OffRL
79
231
0
17 Jul 2017
Lenient Multi-Agent Deep Reinforcement Learning
Lenient Multi-Agent Deep Reinforcement Learning
Gregory Palmer
K. Tuyls
D. Bloembergen
Rahul Savani
82
160
0
14 Jul 2017
Distral: Robust Multitask Reinforcement Learning
Distral: Robust Multitask Reinforcement Learning
Yee Whye Teh
V. Bapst
Wojciech M. Czarnecki
John Quan
J. Kirkpatrick
R. Hadsell
N. Heess
Razvan Pascanu
216
553
0
13 Jul 2017
Learning Heuristic Search via Imitation
Learning Heuristic Search via Imitation
M. Bhardwaj
Sanjiban Choudhury
Sebastian Scherer
71
83
0
10 Jul 2017
Trust-PCL: An Off-Policy Trust Region Method for Continuous Control
Trust-PCL: An Off-Policy Trust Region Method for Continuous Control
Ofir Nachum
Mohammad Norouzi
Kelvin Xu
Dale Schuurmans
89
107
0
06 Jul 2017
Hashing over Predicted Future Frames for Informed Exploration of Deep
  Reinforcement Learning
Hashing over Predicted Future Frames for Informed Exploration of Deep Reinforcement Learning
Haiyan Yin
Jianda Chen
Sinno Jialin Pan
51
5
0
03 Jul 2017
Sample-efficient Actor-Critic Reinforcement Learning with Supervised
  Data for Dialogue Management
Sample-efficient Actor-Critic Reinforcement Learning with Supervised Data for Dialogue Management
Pei-hao Su
Paweł Budzianowski
Stefan Ultes
Milica Gasic
S. Young
OffRL
136
130
0
01 Jul 2017
Noisy Networks for Exploration
Noisy Networks for Exploration
Meire Fortunato
M. G. Azar
Bilal Piot
Jacob Menick
Ian Osband
...
Rémi Munos
Demis Hassabis
Olivier Pietquin
Charles Blundell
Shane Legg
116
897
0
30 Jun 2017
Towards Monocular Vision based Obstacle Avoidance through Deep
  Reinforcement Learning
Towards Monocular Vision based Obstacle Avoidance through Deep Reinforcement Learning
Linhai Xie
Sen Wang
Andrew Markham
A. Trigoni
54
173
0
29 Jun 2017
Count-Based Exploration in Feature Space for Reinforcement Learning
Count-Based Exploration in Feature Space for Reinforcement Learning
Jarryd Martin
S. N. Sasikumar
Tom Everitt
Marcus Hutter
76
124
0
25 Jun 2017
Robust and Efficient Transfer Learning with Hidden-Parameter Markov
  Decision Processes
Robust and Efficient Transfer Learning with Hidden-Parameter Markov Decision Processes
Taylor W. Killian
Samuel Daulton
George Konidaris
Finale Doshi-Velez
117
106
0
20 Jun 2017
Dex: Incremental Learning for Complex Environments in Deep Reinforcement
  Learning
Dex: Incremental Learning for Complex Environments in Deep Reinforcement Learning
Nick Erickson
Qi Zhao
CLLOffRL
422
2
0
19 Jun 2017
Schema Networks: Zero-shot Transfer with a Generative Causal Model of
  Intuitive Physics
Schema Networks: Zero-shot Transfer with a Generative Causal Model of Intuitive Physics
Ken Kansky
Tom Silver
David A. Mély
Mohamed Eldawy
Miguel Lazaro-Gredilla
Xinghua Lou
N. Dorfman
Szymon Sidor
Scott Phoenix
Dileep George
AI4CE
122
236
0
14 Jun 2017
Hybrid Reward Architecture for Reinforcement Learning
Hybrid Reward Architecture for Reinforcement Learning
H. V. Seijen
Mehdi Fatemi
Joshua Romoff
Romain Laroche
Tavian Barnes
Jeffrey Tsang
100
253
0
13 Jun 2017
UCB Exploration via Q-Ensembles
UCB Exploration via Q-Ensembles
Richard Y. Chen
Szymon Sidor
Pieter Abbeel
John Schulman
OffRL
87
6
0
05 Jun 2017
The Atari Grand Challenge Dataset
The Atari Grand Challenge Dataset
Vitaly Kurin
Sebastian Nowozin
Katja Hofmann
Lucas Beyer
Bastian Leibe
OffRL
86
45
0
31 May 2017
Sequential Dynamic Decision Making with Deep Neural Nets on a Test-Time
  Budget
Sequential Dynamic Decision Making with Deep Neural Nets on a Test-Time Budget
Henghui Zhu
Feng Nan
I. Paschalidis
Venkatesh Saligrama
11
2
0
31 May 2017
Continuous State-Space Models for Optimal Sepsis Treatment - a Deep
  Reinforcement Learning Approach
Continuous State-Space Models for Optimal Sepsis Treatment - a Deep Reinforcement Learning Approach
Aniruddh Raghu
Matthieu Komorowski
Leo Anthony Celi
Peter Szolovits
Marzyeh Ghassemi
OffRL
58
193
0
23 May 2017
Shallow Updates for Deep Reinforcement Learning
Shallow Updates for Deep Reinforcement Learning
Nir Levine
Tom Zahavy
D. Mankowitz
Aviv Tamar
Shie Mannor
OffRL
72
48
0
21 May 2017
Learning to Factor Policies and Action-Value Functions: Factored Action
  Space Representations for Deep Reinforcement learning
Learning to Factor Policies and Action-Value Functions: Factored Action Space Representations for Deep Reinforcement learning
Sahil Sharma
A. Suresh
Rahul Ramesh
Balaraman Ravindran
OffRL
56
36
0
20 May 2017
Discrete Sequential Prediction of Continuous Actions for Deep RL
Discrete Sequential Prediction of Continuous Actions for Deep RL
Luke Metz
Julian Ibarz
Navdeep Jaitly
James Davidson
BDLOffRL
90
120
0
14 May 2017
Deep Episodic Value Iteration for Model-based Meta-Reinforcement
  Learning
Deep Episodic Value Iteration for Model-based Meta-Reinforcement Learning
Steven Hansen
OffRLVLM
38
5
0
09 May 2017
Reinforcement Learning with External Knowledge and Two-Stage Q-functions
  for Predicting Popular Reddit Threads
Reinforcement Learning with External Knowledge and Two-Stage Q-functions for Predicting Popular Reddit Threads
Ji He
Mari Ostendorf
Xiaodong He
OffRLLRM
45
10
0
20 Apr 2017
The Reactor: A fast and sample-efficient Actor-Critic agent for
  Reinforcement Learning
The Reactor: A fast and sample-efficient Actor-Critic agent for Reinforcement Learning
A. Gruslys
Will Dabney
M. G. Azar
Bilal Piot
Marc G. Bellemare
Rémi Munos
76
58
0
15 Apr 2017
Deep Q-learning from Demonstrations
Deep Q-learning from Demonstrations
Todd Hester
Matej Vecerík
Olivier Pietquin
Marc Lanctot
Tom Schaul
...
Gabriel Dulac-Arnold
Ian Osband
J. Agapiou
Joel Z Leibo
A. Gruslys
OffRL
94
157
0
12 Apr 2017
Deep Deterministic Policy Gradient for Urban Traffic Light Control
Deep Deterministic Policy Gradient for Urban Traffic Light Control
Noe Casas
79
168
0
27 Mar 2017
Deep Value Networks Learn to Evaluate and Iteratively Refine Structured
  Outputs
Deep Value Networks Learn to Evaluate and Iteratively Refine Structured Outputs
Michael Gygli
Mohammad Norouzi
A. Angelova
TDI
147
68
0
13 Mar 2017
Micro-Objective Learning : Accelerating Deep Reinforcement Learning
  through the Discovery of Continuous Subgoals
Micro-Objective Learning : Accelerating Deep Reinforcement Learning through the Discovery of Continuous Subgoals
Sungtae Lee
Sang-Woo Lee
Jinyoung Choi
Donghyun Kwak
Byoung-Tak Zhang
54
2
0
11 Mar 2017
Deep Robust Kalman Filter
Deep Robust Kalman Filter
Shirli Di-Castro Shashua
Shie Mannor
BDL
79
28
0
07 Mar 2017
Neural Episodic Control
Neural Episodic Control
Alexander Pritzel
Benigno Uria
Sriram Srinivasan
A. Badia
Oriol Vinyals
Demis Hassabis
Daan Wierstra
Charles Blundell
OffRLBDL
113
346
0
06 Mar 2017
Surprise-Based Intrinsic Motivation for Deep Reinforcement Learning
Surprise-Based Intrinsic Motivation for Deep Reinforcement Learning
Joshua Achiam
S. Shankar Sastry
80
238
0
06 Mar 2017
Count-Based Exploration with Neural Density Models
Count-Based Exploration with Neural Density Models
Georg Ostrovski
Marc G. Bellemare
Aaron van den Oord
Rémi Munos
102
626
0
03 Mar 2017
Scaffolding Networks: Incremental Learning and Teaching Through
  Questioning
Scaffolding Networks: Incremental Learning and Teaching Through Questioning
Asli Celikyilmaz
Li Deng
Lihong Li
Chong-Jun Wang
LRMCLL
48
2
0
28 Feb 2017
Learning Control for Air Hockey Striking using Deep Reinforcement
  Learning
Learning Control for Air Hockey Striking using Deep Reinforcement Learning
Ayal Taitler
N. Shimkin
55
10
0
26 Feb 2017
Online Meta-learning by Parallel Algorithm Competition
Online Meta-learning by Parallel Algorithm Competition
Stefan Elfwing
E. Uchibe
Kenji Doya
73
22
0
24 Feb 2017
Beating the World's Best at Super Smash Bros. with Deep Reinforcement
  Learning
Beating the World's Best at Super Smash Bros. with Deep Reinforcement Learning
Vlad Firoiu
William F. Whitney
J. Tenenbaum
94
36
0
21 Feb 2017
Learning to Multi-Task by Active Sampling
Learning to Multi-Task by Active Sampling
Sahil Sharma
Ashutosh Jha
Parikshit Hegde
Balaraman Ravindran
151
21
0
20 Feb 2017
Sigmoid-Weighted Linear Units for Neural Network Function Approximation
  in Reinforcement Learning
Sigmoid-Weighted Linear Units for Neural Network Function Approximation in Reinforcement Learning
Stefan Elfwing
E. Uchibe
Kenji Doya
145
1,761
0
10 Feb 2017
Deep Reinforcement Learning for Robotic Manipulation-The state of the
  art
Deep Reinforcement Learning for Robotic Manipulation-The state of the art
S. Amarjyoti
53
65
0
31 Jan 2017
Learning Light Transport the Reinforced Way
Learning Light Transport the Reinforced Way
Ken Dahm
A. Keller
75
64
0
25 Jan 2017
Deep Reinforcement Learning: An Overview
Deep Reinforcement Learning: An Overview
Yuxi Li
OffRLVLM
346
1,549
0
25 Jan 2017
Reinforcement Learning via Recurrent Convolutional Neural Networks
Reinforcement Learning via Recurrent Convolutional Neural Networks
Tanmay Shankar
S. K. Dwivedy
Prithwijit Guha
SSL
44
20
0
09 Jan 2017
Learning to predict where to look in interactive environments using deep
  recurrent q-learning
Learning to predict where to look in interactive environments using deep recurrent q-learning
Seyed Sajad Mousavi
Michael Schukat
Enda Howley
Ali Borji
N. Mozayani
59
31
0
17 Dec 2016
Deep Reinforcement Learning with Successor Features for Navigation
  across Similar Environments
Deep Reinforcement Learning with Successor Features for Navigation across Similar Environments
Jingwei Zhang
Jost Tobias Springenberg
Joschka Boedecker
Wolfram Burgard
85
295
0
16 Dec 2016
Deep Learning of Robotic Tasks without a Simulator using Strong and Weak
  Human Supervision
Deep Learning of Robotic Tasks without a Simulator using Strong and Weak Human Supervision
Bar Hilleli
Ran El-Yaniv
45
2
0
04 Dec 2016
Overcoming catastrophic forgetting in neural networks
Overcoming catastrophic forgetting in neural networks
J. Kirkpatrick
Razvan Pascanu
Neil C. Rabinowitz
J. Veness
Guillaume Desjardins
...
A. Grabska-Barwinska
Demis Hassabis
Claudia Clopath
D. Kumaran
R. Hadsell
CLL
389
7,619
0
02 Dec 2016
Transfer Learning Across Patient Variations with Hidden Parameter Markov
  Decision Processes
Transfer Learning Across Patient Variations with Hidden Parameter Markov Decision Processes
Taylor W. Killian
George Konidaris
Finale Doshi-Velez
OOD
44
9
0
01 Dec 2016
Improving Policy Gradient by Exploring Under-appreciated Rewards
Improving Policy Gradient by Exploring Under-appreciated Rewards
Ofir Nachum
Mohammad Norouzi
Dale Schuurmans
106
44
0
28 Nov 2016
Previous
123...444546
Next