v1v2v3 (latest)

Deep Reinforcement Learning with Double Q-learning

22 September 2015

David Silver

Papers citing "Deep Reinforcement Learning with Double Q-learning"

41 / 2,291 papers shown

Title
Nonparametric General Reinforcement Learning Jan Leike OffRL 102 26 0 28 Nov 2016
Reinforcement Learning through Asynchronous Advantage Actor-Critic on a GPU Mohammad Babaeizadeh I. Frosio Stephen Tyree Jason Clemons Jan Kautz OffRL 80 259 0 18 Nov 2016
#Exploration: A Study of Count-Based Exploration for Deep Reinforcement Learning Haoran Tang Rein Houthooft Davis Foote Adam Stooke Xi Chen Yan Duan John Schulman F. Turck Pieter Abbeel OffRL 143 776 0 15 Nov 2016
Playing SNES in the Retro Learning Environment Nadav Bhonker Shai Rozenberg Itay Hubara 63 19 0 07 Nov 2016
Averaged-DQN: Variance Reduction and Stabilization for Deep Reinforcement Learning Oron Anschel Nir Baram N. Shimkin 102 318 0 07 Nov 2016
Combining policy gradient and Q-learning Brendan O'Donoghue Rémi Munos Koray Kavukcuoglu Volodymyr Mnih OffRL OnRL 105 140 0 05 Nov 2016
Learning to Play in a Day: Faster Deep Reinforcement Learning by Optimality Tightening Frank S. He Yang Liu Alex Schwing Jian-wei Peng 91 84 0 05 Nov 2016
Towards Lifelong Self-Supervision: A Deep Learning Direction for Robotics J. M. Wong 79 11 0 01 Nov 2016
Learning Runtime Parameters in Computer Systems with Delayed Experience Injection Michael Schaarschmidt Felix Gessert Valentin Dalibard Eiko Yoneki 30 9 0 31 Oct 2016
Particle Swarm Optimization for Generating Interpretable Fuzzy Reinforcement Learning Policies D. Hein A. Hentschel Thomas Runkler Steffen Udluft OffRL 150 80 0 19 Oct 2016
Multi-Objective Deep Reinforcement Learning Hossam Mossalam Yannis Assael D. Roijers Shimon Whiteson 83 154 0 09 Oct 2016
Information-Theoretic Methods for Planning and Learning in Partially Observable Markov Decision Processes Roy Fox 29 0 0 24 Sep 2016
Playing FPS Games with Deep Reinforcement Learning Guillaume Lample Devendra Singh Chaplot OffRL EgoV 100 588 0 18 Sep 2016
Interactive Spoken Content Retrieval by Deep Reinforcement Learning Yen-Chen Wu Tzu-Hsiang Lin Pei-Hung Chung Hung-yi Lee Tsung-Hsien Wen 27 12 0 16 Sep 2016
Episodic Exploration for Deep Deterministic Policies: An Application to StarCraft Micromanagement Tasks Nicolas Usunier Gabriel Synnaeve Zeming Lin Soumith Chintala 99 138 0 10 Sep 2016
Reward Augmented Maximum Likelihood for Neural Structured Prediction Mohammad Norouzi Samy Bengio Zhiwen Chen Navdeep Jaitly M. Schuster Yonghui Wu Dale Schuurmans 118 253 0 01 Sep 2016
BBQ-Networks: Efficient Exploration in Deep Reinforcement Learning for Task-Oriented Dialogue Systems Zachary Chase Lipton Xiujun Li Jianfeng Gao Lihong Li Faisal Ahmed Li Deng 97 6 0 17 Aug 2016
Playing Atari Games with Deep Reinforcement Learning and Human Checkpoint Replay Ionel-Alexandru Hosu Traian Rebedea 88 97 0 18 Jul 2016
Deep Reinforcement Learning With Macro-Actions Ishan Durugkar Clemens Rosenbaum S. Dernbach Sridhar Mahadevan 56 25 0 15 Jun 2016
Policy Networks with Two-Stage Training for Dialogue Systems Mehdi Fatemi Layla El Asri Hannes Schulz Jing He Kaheer Suleman OffRL 88 108 0 10 Jun 2016
Towards End-to-End Learning for Dialog State Tracking and Management using Deep Reinforcement Learning Tiancheng Zhao M. Eskénazi 114 265 0 08 Jun 2016
Unifying Count-Based Exploration and Intrinsic Motivation Marc G. Bellemare S. Srinivasan Georg Ostrovski Tom Schaul D. Saxton Rémi Munos 195 1,485 0 06 Jun 2016
Deep Reinforcement Learning Radio Control and Signal Detection with KeRLym, a Gym RL Agent Tim O'Shea T. Clancy 53 19 0 30 May 2016
Learning from the memory of Atari 2600 Jakub Sygnowski Henryk Michalewski 116 12 0 04 May 2016
Classifying Options for Deep Reinforcement Learning Kai Arulkumaran Nat Dilokthanakul Murray Shanahan Anil Anthony Bharath 69 20 0 27 Apr 2016
A Deep Hierarchical Approach to Lifelong Learning in Minecraft Chen Tessler Shahar Givony Tom Zahavy D. Mankowitz Shie Mannor CLL 175 381 0 25 Apr 2016
Easy Monotonic Policy Iteration Joshua Achiam OffRL 49 0 0 29 Feb 2016
Learning values across many orders of magnitude H. V. Hasselt A. Guez Matteo Hessel Volodymyr Mnih David Silver 88 170 0 24 Feb 2016
Deep Exploration via Bootstrapped DQN Ian Osband Charles Blundell Alexander Pritzel Benjamin Van Roy 127 1,315 0 15 Feb 2016
Learning to Communicate to Solve Riddles with Deep Distributed Recurrent Q-Networks Jakob N. Foerster Yannis Assael Nando de Freitas Shimon Whiteson 85 147 0 08 Feb 2016
Graying the black box: Understanding DQNs Tom Zahavy Nir Ben-Zrihem Shie Mannor 84 263 0 08 Feb 2016
Asynchronous Methods for Deep Reinforcement Learning Volodymyr Mnih Adria Puigdomenech Badia M. Berk Mirza Alex Graves Timothy Lillicrap Tim Harley David Silver Koray Kavukcuoglu 223 8,893 0 04 Feb 2016
Taming the Noise in Reinforcement Learning via Soft Updates Roy Fox Ari Pakman Naftali Tishby 112 341 0 28 Dec 2015
Increasing the Action Gap: New Operators for Reinforcement Learning Marc G. Bellemare Georg Ostrovski A. Guez Philip S. Thomas Rémi Munos 78 157 0 15 Dec 2015
How to Discount Deep Reinforcement Learning: Towards New Dynamic Strategies Vincent François-Lavet R. Fonteneau D. Ernst 87 111 0 07 Dec 2015
Deep Attention Recurrent Q-Network Ivan Sorokin Alexey Seleznev Mikhail Pavlov A. Fedorov Anastasiia Ignateva 75 152 0 05 Dec 2015
State of the Art Control of Atari Games Using Shallow Reinforcement Learning Yitao Liang Marlos C. Machado Erik Talvitie Michael Bowling 105 113 0 04 Dec 2015
Dueling Network Architectures for Deep Reinforcement Learning Ziyun Wang Tom Schaul Matteo Hessel H. V. Hasselt Marc Lanctot Nando de Freitas OffRL 112 3,780 0 20 Nov 2015
Policy Distillation Andrei A. Rusu Sergio Gomez Colmenarejo Çağlar Gülçehre Guillaume Desjardins J. Kirkpatrick Razvan Pascanu Volodymyr Mnih Koray Kavukcuoglu R. Hadsell 137 698 0 19 Nov 2015
Prioritized Experience Replay Tom Schaul John Quan Ioannis Antonoglou David Silver OffRL 248 3,807 0 18 Nov 2015
Deep Reinforcement Learning in Parameterized Action Space Matthew J. Hausknecht Peter Stone 78 308 0 13 Nov 2015