v1v2 (latest)

Asynchronous Methods for Deep Reinforcement Learning

4 February 2016

Volodymyr Mnih

Adria Puigdomenech Badia

David Silver

Papers citing "Asynchronous Methods for Deep Reinforcement Learning"

50 / 3,591 papers shown

Title
Sequential Dynamic Decision Making with Deep Neural Nets on a Test-Time Budget Henghui Zhu Feng Nan I. Paschalidis Venkatesh Saligrama 11 2 0 31 May 2017
Experience Replay Using Transition Sequences Thommen George Karimpanal Roland Bouffanais OffRL 39 14 0 30 May 2017
End-to-end Active Object Tracking via Reinforcement Learning Wenhan Luo Peng Sun Fangwei Zhong Wei Liu Yadong Mu Yizhou Wang 95 86 0 30 May 2017
Constrained Policy Optimization Joshua Achiam David Held Aviv Tamar Pieter Abbeel 199 1,339 0 30 May 2017
Learning End-to-end Multimodal Sensor Policies for Autonomous Navigation Guan-Horng Liu Avinash Siravuru Sai P. Selvaraj Manuela Veloso George Kantor 89 69 0 30 May 2017
Convergent Tree Backup and Retrace with Function Approximation Ahmed Touati Pierre-Luc Bacon Doina Precup Pascal Vincent 106 40 0 25 May 2017
The Marginal Value of Adaptive Gradient Methods in Machine Learning Ashia Wilson Rebecca Roelofs Mitchell Stern Nathan Srebro Benjamin Recht ODL 125 1,035 0 23 May 2017
Enhanced Experience Replay Generation for Efficient Reinforcement Learning Vincent Huang Tobias Ley Martha Vlachou-Konchylaki Wenfeng Hu OnRL GAN SyDa 41 10 0 23 May 2017
Visual Semantic Planning using Deep Successor Representations Yuke Zhu Daniel Gordon Eric Kolve Dieter Fox Li Fei-Fei Abhinav Gupta Roozbeh Mottaghi Ali Farhadi 112 142 0 23 May 2017
Neural Network Memory Architectures for Autonomous Robot Navigation Steven W. Chen Nikolay Atanasov Arbaaz Khan Konstantinos Karydis Daniel D. Lee Vijay Kumar 51 7 0 23 May 2017
Pairwise Confusion for Fine-Grained Visual Classification Abhimanyu Dubey O. Gupta Pei Guo Ramesh Raskar Ryan Farrell Nikhil Naik 54 10 0 22 May 2017
A unified view of entropy-regularized Markov decision processes Gergely Neu Anders Jonsson Vicencc Gómez 121 264 0 22 May 2017
Guide Actor-Critic for Continuous Control Voot Tangkaratt A. Abdolmaleki Masashi Sugiyama 67 17 0 22 May 2017
Shallow Updates for Deep Reinforcement Learning Nir Levine Tom Zahavy D. Mankowitz Aviv Tamar Shie Mannor OffRL 72 48 0 21 May 2017
Learning to Mix n-Step Returns: Generalizing lambda-Returns for Deep Reinforcement Learning Sahil Sharma J. GirishRaguvir S. Ramesh Balaraman Ravindran 38 6 0 21 May 2017
Learning to Factor Policies and Action-Value Functions: Factored Action Space Representations for Deep Reinforcement learning Sahil Sharma A. Suresh Rahul Ramesh Balaraman Ravindran OffRL 56 36 0 20 May 2017
Relaxed Wasserstein with Applications to GANs Xin Guo Johnny Hong Tianyi Lin Nan Yang GAN 114 35 0 19 May 2017
Atari games and Intel processors R. Adamski T. Grel Maciek Klimek Henryk Michalewski 34 5 0 19 May 2017
Feature Control as Intrinsic Motivation for Hierarchical Reinforcement Learning Nat Dilokthanakul Christos Kaplanis Nick Pawlowski Murray Shanahan 87 92 0 18 May 2017
Delving into adversarial attacks on deep policies Jernej Kos Basel Alomair AAML 72 228 0 18 May 2017
Probabilistically Safe Policy Transfer David Held Zoe McCarthy Michael Zhang Fred Shentu Pieter Abbeel 86 19 0 15 May 2017
Curiosity-driven Exploration by Self-supervised Prediction Deepak Pathak Pulkit Agrawal Alexei A. Efros Trevor Darrell LRM SSL 183 2,456 0 15 May 2017
Discrete Sequential Prediction of Continuous Actions for Deep RL Luke Metz Julian Ibarz Navdeep Jaitly James Davidson BDL OffRL 92 121 0 14 May 2017
Efficient Parallel Methods for Deep Reinforcement Learning Alfredo V. Clemente Humberto Nicolás Castejón Martínez A. Chandra 85 115 0 13 May 2017
Metacontrol for Adaptive Imagination-Based Optimization Jessica B. Hamrick A. J. Ballard Razvan Pascanu Oriol Vinyals N. Heess Peter W. Battaglia 76 69 0 07 May 2017
Toward Low-Flying Autonomous MAV Trail Navigation using Deep Neural Networks for Environmental Awareness Nikolai Smolyanskiy A. Kamenev Jeffrey Smith Stan Birchfield 144 223 0 07 May 2017
Traffic Light Control Using Deep Policy-Gradient and Value-Function Based Reinforcement Learning Seyed Sajad Mousavi Michael Schukat Enda Howley 95 309 0 28 Apr 2017
Mapping Instructions and Visual Observations to Actions with Reinforcement Learning Dipendra Kumar Misra John Langford Yoav Artzi 86 247 0 28 Apr 2017
General Video Game AI: Learning from Screen Capture Kamolwan Kunanusont Simon Lucas Diego Perez-Liebana 60 20 0 23 Apr 2017
Equivalence Between Policy Gradients and Soft Q-Learning John Schulman Xi Chen Pieter Abbeel OffRL 132 349 0 21 Apr 2017
Beating Atari with Natural Language Guided Reinforcement Learning Russell Kaplan Chris Sauer A. Sosa LM&Ro 86 69 0 18 Apr 2017
Investigating Recurrence and Eligibility Traces in Deep Q-Networks J. Harb Doina Precup 54 21 0 18 Apr 2017
The Reactor: A fast and sample-efficient Actor-Critic agent for Reinforcement Learning A. Gruslys Will Dabney M. G. Azar Bilal Piot Marc G. Bellemare Rémi Munos 76 58 0 15 Apr 2017
Virtual to Real Reinforcement Learning for Autonomous Driving Xinlei Pan Yurong You Ziyan Wang Cewu Lu OffRL 121 338 0 13 Apr 2017
Deep Reinforcement Learning-based Image Captioning with Embedding Reward Zhou Ren Xiaoyu Wang Ning Zhang Xutao Lv Li Li 65 324 0 12 Apr 2017
Deep Q-learning from Demonstrations Todd Hester Matej Vecerík Olivier Pietquin Marc Lanctot Tom Schaul ... Gabriel Dulac-Arnold Ian Osband J. Agapiou Joel Z Leibo A. Gruslys OffRL 94 157 0 12 Apr 2017
Data-efficient Deep Reinforcement Learning for Dexterous Manipulation I. Popov N. Heess Timothy Lillicrap Roland Hafner Gabriel Barth-Maron Matej Vecerík Thomas Lampe Yuval Tassa Tom Erez Martin Riedmiller OffRL 99 265 0 10 Apr 2017
Stein Variational Policy Gradient Yang Liu Prajit Ramachandran Qiang Liu Jian-wei Peng 80 141 0 07 Apr 2017
Recurrent Environment Simulators Silvia Chiappa S. Racanière Daan Wierstra S. Mohamed 85 211 0 07 Apr 2017
Learned Watershed: End-to-End Learning of Seeded Segmentation Steffen Wolf Lukas Schott Ullrich Kothe Fred Hamprecht 56 35 0 07 Apr 2017
Multiagent Bidirectionally-Coordinated Nets: Emergence of Human-level Coordination in Learning to Play StarCraft Combat Games Peng Peng Ying Wen Yaodong Yang Quan Yuan Zhenkun Tang Haitao Long Jun Wang 110 336 0 29 Mar 2017
Socially Aware Motion Planning with Deep Reinforcement Learning Yu Fan Chen Michael Everett Miao Liu Jonathan P. How 121 683 0 26 Mar 2017
Combining Neural Networks and Tree Search for Task and Motion Planning in Challenging Environments Chris Paxton Vasumathi Raman Gregory Hager Marin Kobilarov 78 123 0 22 Mar 2017
An End-to-End Approach to Natural Language Object Retrieval via Context-Aware Deep Reinforcement Learning Fan Wu Zhongwen Xu Yi Yang ObjD 60 11 0 22 Mar 2017
Learning to Navigate Cloth using Haptics Alexander Clegg Wenhao Yu Zackory M. Erickson Jie Tan Chenxi Liu Greg Turk 86 23 0 20 Mar 2017
Evolution Strategies as a Scalable Alternative to Reinforcement Learning Tim Salimans Jonathan Ho Xi Chen Szymon Sidor Ilya Sutskever 174 1,545 0 10 Mar 2017
Tactics of Adversarial Attack on Deep Reinforcement Learning Agents Yen-Chen Lin Zhang-Wei Hong Yuan-Hong Liao Meng-Li Shih Ming-Yuan Liu Min Sun AAML 141 419 0 08 Mar 2017
Combining Self-Supervised Learning and Imitation for Vision-Based Rope Manipulation Ashvin Nair Dian Chen Pulkit Agrawal Phillip Isola Pieter Abbeel Jitendra Malik Sergey Levine SSL 83 312 0 06 Mar 2017
Neural Episodic Control Alexander Pritzel Benigno Uria Sriram Srinivasan A. Badia Oriol Vinyals Demis Hassabis Daan Wierstra Charles Blundell OffRL BDL 113 346 0 06 Mar 2017
Context-Based Concurrent Experience Sharing in Multiagent Systems Dan Garant Bruno Castro da Silva V. Lesser Chongjie Zhang 22 4 0 06 Mar 2017