ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1509.06461
  4. Cited By
Deep Reinforcement Learning with Double Q-learning
v1v2v3 (latest)

Deep Reinforcement Learning with Double Q-learning

22 September 2015
H. V. Hasselt
A. Guez
David Silver
    OffRL
ArXiv (abs)PDFHTML

Papers citing "Deep Reinforcement Learning with Double Q-learning"

41 / 2,291 papers shown
Title
Nonparametric General Reinforcement Learning
Nonparametric General Reinforcement Learning
Jan Leike
OffRL
102
26
0
28 Nov 2016
Reinforcement Learning through Asynchronous Advantage Actor-Critic on a
  GPU
Reinforcement Learning through Asynchronous Advantage Actor-Critic on a GPU
Mohammad Babaeizadeh
I. Frosio
Stephen Tyree
Jason Clemons
Jan Kautz
OffRL
80
259
0
18 Nov 2016
#Exploration: A Study of Count-Based Exploration for Deep Reinforcement
  Learning
#Exploration: A Study of Count-Based Exploration for Deep Reinforcement Learning
Haoran Tang
Rein Houthooft
Davis Foote
Adam Stooke
Xi Chen
Yan Duan
John Schulman
F. Turck
Pieter Abbeel
OffRL
143
776
0
15 Nov 2016
Playing SNES in the Retro Learning Environment
Playing SNES in the Retro Learning Environment
Nadav Bhonker
Shai Rozenberg
Itay Hubara
63
19
0
07 Nov 2016
Averaged-DQN: Variance Reduction and Stabilization for Deep
  Reinforcement Learning
Averaged-DQN: Variance Reduction and Stabilization for Deep Reinforcement Learning
Oron Anschel
Nir Baram
N. Shimkin
102
318
0
07 Nov 2016
Combining policy gradient and Q-learning
Combining policy gradient and Q-learning
Brendan O'Donoghue
Rémi Munos
Koray Kavukcuoglu
Volodymyr Mnih
OffRLOnRL
105
140
0
05 Nov 2016
Learning to Play in a Day: Faster Deep Reinforcement Learning by
  Optimality Tightening
Learning to Play in a Day: Faster Deep Reinforcement Learning by Optimality Tightening
Frank S. He
Yang Liu
Alex Schwing
Jian-wei Peng
91
84
0
05 Nov 2016
Towards Lifelong Self-Supervision: A Deep Learning Direction for
  Robotics
Towards Lifelong Self-Supervision: A Deep Learning Direction for Robotics
J. M. Wong
79
11
0
01 Nov 2016
Learning Runtime Parameters in Computer Systems with Delayed Experience
  Injection
Learning Runtime Parameters in Computer Systems with Delayed Experience Injection
Michael Schaarschmidt
Felix Gessert
Valentin Dalibard
Eiko Yoneki
30
9
0
31 Oct 2016
Particle Swarm Optimization for Generating Interpretable Fuzzy
  Reinforcement Learning Policies
Particle Swarm Optimization for Generating Interpretable Fuzzy Reinforcement Learning Policies
D. Hein
A. Hentschel
Thomas Runkler
Steffen Udluft
OffRL
150
80
0
19 Oct 2016
Multi-Objective Deep Reinforcement Learning
Multi-Objective Deep Reinforcement Learning
Hossam Mossalam
Yannis Assael
D. Roijers
Shimon Whiteson
83
154
0
09 Oct 2016
Information-Theoretic Methods for Planning and Learning in Partially
  Observable Markov Decision Processes
Information-Theoretic Methods for Planning and Learning in Partially Observable Markov Decision Processes
Roy Fox
29
0
0
24 Sep 2016
Playing FPS Games with Deep Reinforcement Learning
Playing FPS Games with Deep Reinforcement Learning
Guillaume Lample
Devendra Singh Chaplot
OffRLEgoV
100
588
0
18 Sep 2016
Interactive Spoken Content Retrieval by Deep Reinforcement Learning
Interactive Spoken Content Retrieval by Deep Reinforcement Learning
Yen-Chen Wu
Tzu-Hsiang Lin
Pei-Hung Chung
Hung-yi Lee
Tsung-Hsien Wen
27
12
0
16 Sep 2016
Episodic Exploration for Deep Deterministic Policies: An Application to
  StarCraft Micromanagement Tasks
Episodic Exploration for Deep Deterministic Policies: An Application to StarCraft Micromanagement Tasks
Nicolas Usunier
Gabriel Synnaeve
Zeming Lin
Soumith Chintala
99
138
0
10 Sep 2016
Reward Augmented Maximum Likelihood for Neural Structured Prediction
Reward Augmented Maximum Likelihood for Neural Structured Prediction
Mohammad Norouzi
Samy Bengio
Zhiwen Chen
Navdeep Jaitly
M. Schuster
Yonghui Wu
Dale Schuurmans
118
253
0
01 Sep 2016
BBQ-Networks: Efficient Exploration in Deep Reinforcement Learning for
  Task-Oriented Dialogue Systems
BBQ-Networks: Efficient Exploration in Deep Reinforcement Learning for Task-Oriented Dialogue Systems
Zachary Chase Lipton
Xiujun Li
Jianfeng Gao
Lihong Li
Faisal Ahmed
Li Deng
97
6
0
17 Aug 2016
Playing Atari Games with Deep Reinforcement Learning and Human
  Checkpoint Replay
Playing Atari Games with Deep Reinforcement Learning and Human Checkpoint Replay
Ionel-Alexandru Hosu
Traian Rebedea
88
97
0
18 Jul 2016
Deep Reinforcement Learning With Macro-Actions
Deep Reinforcement Learning With Macro-Actions
Ishan Durugkar
Clemens Rosenbaum
S. Dernbach
Sridhar Mahadevan
56
25
0
15 Jun 2016
Policy Networks with Two-Stage Training for Dialogue Systems
Policy Networks with Two-Stage Training for Dialogue Systems
Mehdi Fatemi
Layla El Asri
Hannes Schulz
Jing He
Kaheer Suleman
OffRL
88
108
0
10 Jun 2016
Towards End-to-End Learning for Dialog State Tracking and Management
  using Deep Reinforcement Learning
Towards End-to-End Learning for Dialog State Tracking and Management using Deep Reinforcement Learning
Tiancheng Zhao
M. Eskénazi
114
265
0
08 Jun 2016
Unifying Count-Based Exploration and Intrinsic Motivation
Unifying Count-Based Exploration and Intrinsic Motivation
Marc G. Bellemare
S. Srinivasan
Georg Ostrovski
Tom Schaul
D. Saxton
Rémi Munos
195
1,485
0
06 Jun 2016
Deep Reinforcement Learning Radio Control and Signal Detection with
  KeRLym, a Gym RL Agent
Deep Reinforcement Learning Radio Control and Signal Detection with KeRLym, a Gym RL Agent
Tim O'Shea
T. Clancy
53
19
0
30 May 2016
Learning from the memory of Atari 2600
Learning from the memory of Atari 2600
Jakub Sygnowski
Henryk Michalewski
116
12
0
04 May 2016
Classifying Options for Deep Reinforcement Learning
Classifying Options for Deep Reinforcement Learning
Kai Arulkumaran
Nat Dilokthanakul
Murray Shanahan
Anil Anthony Bharath
69
20
0
27 Apr 2016
A Deep Hierarchical Approach to Lifelong Learning in Minecraft
A Deep Hierarchical Approach to Lifelong Learning in Minecraft
Chen Tessler
Shahar Givony
Tom Zahavy
D. Mankowitz
Shie Mannor
CLL
175
381
0
25 Apr 2016
Easy Monotonic Policy Iteration
Easy Monotonic Policy Iteration
Joshua Achiam
OffRL
49
0
0
29 Feb 2016
Learning values across many orders of magnitude
Learning values across many orders of magnitude
H. V. Hasselt
A. Guez
Matteo Hessel
Volodymyr Mnih
David Silver
88
170
0
24 Feb 2016
Deep Exploration via Bootstrapped DQN
Deep Exploration via Bootstrapped DQN
Ian Osband
Charles Blundell
Alexander Pritzel
Benjamin Van Roy
127
1,315
0
15 Feb 2016
Learning to Communicate to Solve Riddles with Deep Distributed Recurrent
  Q-Networks
Learning to Communicate to Solve Riddles with Deep Distributed Recurrent Q-Networks
Jakob N. Foerster
Yannis Assael
Nando de Freitas
Shimon Whiteson
85
147
0
08 Feb 2016
Graying the black box: Understanding DQNs
Graying the black box: Understanding DQNs
Tom Zahavy
Nir Ben-Zrihem
Shie Mannor
84
263
0
08 Feb 2016
Asynchronous Methods for Deep Reinforcement Learning
Asynchronous Methods for Deep Reinforcement Learning
Volodymyr Mnih
Adria Puigdomenech Badia
M. Berk Mirza
Alex Graves
Timothy Lillicrap
Tim Harley
David Silver
Koray Kavukcuoglu
223
8,893
0
04 Feb 2016
Taming the Noise in Reinforcement Learning via Soft Updates
Taming the Noise in Reinforcement Learning via Soft Updates
Roy Fox
Ari Pakman
Naftali Tishby
112
341
0
28 Dec 2015
Increasing the Action Gap: New Operators for Reinforcement Learning
Increasing the Action Gap: New Operators for Reinforcement Learning
Marc G. Bellemare
Georg Ostrovski
A. Guez
Philip S. Thomas
Rémi Munos
78
157
0
15 Dec 2015
How to Discount Deep Reinforcement Learning: Towards New Dynamic
  Strategies
How to Discount Deep Reinforcement Learning: Towards New Dynamic Strategies
Vincent François-Lavet
R. Fonteneau
D. Ernst
87
111
0
07 Dec 2015
Deep Attention Recurrent Q-Network
Deep Attention Recurrent Q-Network
Ivan Sorokin
Alexey Seleznev
Mikhail Pavlov
A. Fedorov
Anastasiia Ignateva
75
152
0
05 Dec 2015
State of the Art Control of Atari Games Using Shallow Reinforcement
  Learning
State of the Art Control of Atari Games Using Shallow Reinforcement Learning
Yitao Liang
Marlos C. Machado
Erik Talvitie
Michael Bowling
105
113
0
04 Dec 2015
Dueling Network Architectures for Deep Reinforcement Learning
Dueling Network Architectures for Deep Reinforcement Learning
Ziyun Wang
Tom Schaul
Matteo Hessel
H. V. Hasselt
Marc Lanctot
Nando de Freitas
OffRL
112
3,780
0
20 Nov 2015
Policy Distillation
Policy Distillation
Andrei A. Rusu
Sergio Gomez Colmenarejo
Çağlar Gülçehre
Guillaume Desjardins
J. Kirkpatrick
Razvan Pascanu
Volodymyr Mnih
Koray Kavukcuoglu
R. Hadsell
137
698
0
19 Nov 2015
Prioritized Experience Replay
Prioritized Experience Replay
Tom Schaul
John Quan
Ioannis Antonoglou
David Silver
OffRL
248
3,807
0
18 Nov 2015
Deep Reinforcement Learning in Parameterized Action Space
Deep Reinforcement Learning in Parameterized Action Space
Matthew J. Hausknecht
Peter Stone
78
308
0
13 Nov 2015
Previous
123...444546