v1v2 (latest)

Asynchronous Methods for Deep Reinforcement Learning

4 February 2016

Volodymyr Mnih

Adria Puigdomenech Badia

David Silver

Papers citing "Asynchronous Methods for Deep Reinforcement Learning"

50 / 3,591 papers shown

Title
Adversarial Advantage Actor-Critic Model for Task-Completion Dialogue Policy Learning Baolin Peng Xiujun Li Jianfeng Gao Jingjing Liu Yun-Nung Chen Kam-Fai Wong 91 65 0 31 Oct 2017
Action-depedent Control Variates for Policy Optimization via Stein's Identity Hao Liu Yihao Feng Yi Mao Dengyong Zhou Jian-wei Peng Qiang Liu 94 4 0 30 Oct 2017
Predicting Head Movement in Panoramic Video: A Deep Reinforcement Learning Approach Yuhang Song Mai Xu Jianyi Wang Minglang Qiao Liangyu Huo Zulin Wang 105 207 0 30 Oct 2017
Diff-DAC: Distributed Actor-Critic for Average Multitask Deep Reinforcement Learning Sergio Valcarcel Macua Aleksi Tukiainen D. Hernández David Baldazo Enrique Munoz de Cote S. Zazo 114 29 0 28 Oct 2017
Generalization Tower Network: A Novel Deep Neural Network Architecture for Multi-Task Learning Yuhang Song Mai Xu Songyang Zhang Liangyu Huo 57 3 0 27 Oct 2017
Understanding Early Word Learning in Situated Artificial Agents Felix Hill S. Clark Karl Moritz Hermann Phil Blunsom LM&Ro 93 32 0 26 Oct 2017
DoShiCo Challenge: Domain Shift in Control Prediction Klaas Kelchtermans Tinne Tuytelaars 22 0 0 26 Oct 2017
Meta Learning Shared Hierarchies Kevin Frans Jonathan Ho Xi Chen Pieter Abbeel John Schulman 84 355 0 26 Oct 2017
Consequentialist conditional cooperation in social dilemmas with imperfect information A. Peysakhovich Adam Lerer 89 65 0 19 Oct 2017
Map-based Multi-Policy Reinforcement Learning: Enhancing Adaptability of Robots by Deep Reinforcement Learning A. Kume Eiichi Matsumoto K. Takahashi W. Ko Jethro Tan 69 11 0 17 Oct 2017
Deep Imitation Learning for Complex Manipulation Tasks from Virtual Reality Teleoperation Tianhao Zhang Zoe McCarthy Owen Jow Dennis Lee Xi Chen Ken Goldberg Pieter Abbeel SSL 156 663 0 12 Oct 2017
Arguing Machines: Human Supervision of Black Box AI Systems That Make Life-Critical Decisions Alex Fridman Li Ding Benedikt Jenik B. Reimer 51 14 0 12 Oct 2017
AMBER: Adaptive Multi-Batch Experience Replay for Continuous Action Control Seungyul Han Y. Sung OffRL 33 8 0 12 Oct 2017
Emergent Complexity via Multi-Agent Competition Trapit Bansal J. Pachocki Szymon Sidor Ilya Sutskever Igor Mordatch 86 392 0 10 Oct 2017
MSC: A Dataset for Macro-Management in StarCraft II Huikai Wu Yanqi Zong Junge Zhang Kaiqi Huang 59 16 0 09 Oct 2017
Recurrent Deterministic Policy Gradient Method for Bipedal Locomotion on Rough Terrain Challenge Doo Re Song Chuanyu Yang C. McGreavy Zhibin Li 167 30 0 08 Oct 2017
Socially Compliant Navigation through Raw Depth Inputs with Generative Adversarial Imitation Learning L. Tai Jingwei Zhang Ming-Yuan Liu Wolfram Burgard GAN 67 180 0 06 Oct 2017
Rainbow: Combining Improvements in Deep Reinforcement Learning Matteo Hessel Joseph Modayil H. V. Hasselt Tom Schaul Georg Ostrovski Will Dabney Dan Horgan Bilal Piot M. G. Azar David Silver OffRL 112 2,283 0 06 Oct 2017
Detecting Adversarial Attacks on Neural Network Policies with Visual Foresight Yen-Chen Lin Ming-Yuan Liu Min Sun Jia-Bin Huang AAML 104 49 0 02 Oct 2017
Parameter Sharing Deep Deterministic Policy Gradient for Cooperative Multi-agent Reinforcement Learning Xiangxiang Chu Hangjun Ye 74 56 0 01 Oct 2017
Vision-based deep execution monitoring Francesco Puja S. Grazioso A. Tammaro Valsamis Ntouskos Marta Sanzari F. Pirri 34 1 0 29 Sep 2017
Deep TAMER: Interactive Agent Shaping in High-Dimensional State Spaces Garrett A. Warnell Nicholas R. Waytowich Vernon J. Lawhern Peter Stone 86 273 0 28 Sep 2017
Overcoming Exploration in Reinforcement Learning with Demonstrations Ashvin Nair Bob McGrew Marcin Andrychowicz Wojciech Zaremba Pieter Abbeel OffRL 188 791 0 28 Sep 2017
Towards continuous control of flippers for a multi-terrain robot using deep reinforcement learning Giuseppe Paolo L. Tai Ming-Yuan Liu 17 7 0 25 Sep 2017
Learning Unmanned Aerial Vehicle Control for Autonomous Target Following Siyi Li Tianbo Liu Fangqiu Yi Dit-Yan Yeung Shaojie Shen 53 38 0 24 Sep 2017
Expanding Motor Skills through Relay Neural Networks Visak C. V. Kumar Sehoon Ha Chenxi Liu 27 2 0 22 Sep 2017
Avoidance of Manual Labeling in Robotic Autonomous Navigation Through Multi-Sensory Semi-Supervised Learning Junhong Xu Shangyue Zhu Hanqing Guo Shaoen Wu SSL 31 3 0 22 Sep 2017
Learning Human Behaviors for Robot-Assisted Dressing Alexander Clegg Wenhao Yu Jie Tan Charles C. Kemp Greg Turk Chenxi Liu 38 3 0 20 Sep 2017
OptionGAN: Learning Joint Reward-Policy Options using Generative Adversarial Inverse Reinforcement Learning Peter Henderson Wei-Di Chang Pierre-Luc Bacon David Meger Joelle Pineau Doina Precup GAN 77 73 0 20 Sep 2017
Deep Reinforcement Learning that Matters Peter Henderson Riashat Islam Philip Bachman Joelle Pineau Doina Precup David Meger OffRL 155 1,970 0 19 Sep 2017
Revisiting the Arcade Learning Environment: Evaluation Protocols and Open Problems for General Agents Marlos C. Machado Marc G. Bellemare Erik Talvitie J. Veness Matthew J. Hausknecht Michael Bowling 114 558 0 18 Sep 2017
Memory Augmented Control Networks Arbaaz Khan Clark Zhang Nikolay Atanasov Konstantinos Karydis Vijay Kumar Daniel D. Lee 82 77 0 17 Sep 2017
The Uncertainty Bellman Equation and Exploration Brendan O'Donoghue Ian Osband Rémi Munos Volodymyr Mnih 90 193 0 15 Sep 2017
Transforming Cooling Optimization for Green Data Center via Deep Reinforcement Learning Yuanlong Li Yonggang Wen K. Guan Dacheng Tao AI4CE 88 180 0 15 Sep 2017
Shared Learning : Enhancing Reinforcement in $Q$ -Ensembles Rakesh R Menon Balaraman Ravindran 33 0 0 14 Sep 2017
A2-RL: Aesthetics Aware Reinforcement Learning for Image Cropping Debang Li Huikai Wu Junge Zhang Kaiqi Huang OffRL 56 9 0 14 Sep 2017
When Waiting is not an Option : Learning Options with a Deliberation Cost J. Harb Pierre-Luc Bacon Martin Klissarov Doina Precup 71 150 0 14 Sep 2017
A Study of AI Population Dynamics with Million-agent Reinforcement Learning Yaodong Yang Lantao Yu Yiwei Bai Jun Wang Weinan Zhang Ying Wen Yong Yu 64 7 0 13 Sep 2017
Pre-training Neural Networks with Human Demonstrations for Deep Reinforcement Learning G. V. D. L. Cruz Yunshu Du Matthew E. Taylor 3DH OffRL 84 58 0 12 Sep 2017
Deep Reinforcement Learning with Surrogate Agent-Environment Interface Songli Wang Yutao Jing 27 1 0 12 Sep 2017
TensorFlow Agents: Efficient Batched Reinforcement Learning in TensorFlow Danijar Hafner James Davidson Vincent Vanhoucke OffRL 59 49 0 08 Sep 2017
Prosocial learning agents solve generalized Stag Hunts better than selfish ones A. Peysakhovich Adam Lerer 114 109 0 08 Sep 2017
BOOK: Storing Algorithm-Invariant Episodes for Deep Reinforcement Learning Simyung Chang Y. Yoo Jaeseok Choi Nojun Kwak OffRL 15 1 0 05 Sep 2017
Mean Actor Critic Cameron Allen Kavosh Asadi Melrose Roderick Abdel-rahman Mohamed George Konidaris Michael Littman 92 45 0 01 Sep 2017
Deep Learning for Video Game Playing Niels Justesen Philip Bontrager Julian Togelius S. Risi VLM 101 208 0 25 Aug 2017
Learning the Enigma with Recurrent Neural Networks S. Greydanus 81 39 0 24 Aug 2017
Reinforcement Learning in POMDPs with Memoryless Options and Option-Observation Initiation Sets Denis Steckelmacher D. Roijers Anna Harutyunyan Peter Vrancx Hélène Plisnier A. Nowé 113 20 0 22 Aug 2017
Teaching UAVs to Race: End-to-End Regression of Agile Controls in Simulation Matthias Mueller Vincent Casser Neil G. Smith D. L. Michels Guohao Li 84 10 0 19 Aug 2017
Sim4CV: A Photo-Realistic Simulator for Computer Vision Applications Matthias Muller Vincent Casser Jean Lahoud Neil G. Smith Guohao Li VGen 74 181 0 19 Aug 2017
A Brief Survey of Deep Reinforcement Learning Kai Arulkumaran M. Deisenroth Miles Brundage Anil Anthony Bharath OffRL 173 2,830 0 19 Aug 2017