v1v2 (latest)

Asynchronous Methods for Deep Reinforcement Learning

4 February 2016

Volodymyr Mnih

Adria Puigdomenech Badia

David Silver

Papers citing "Asynchronous Methods for Deep Reinforcement Learning"

50 / 3,591 papers shown

Title
The Actor-Advisor: Policy Gradient With Off-Policy Advice Hélène Plisnier Denis Steckelmacher D. Roijers A. Nowé CML OffRL 27 6 0 07 Feb 2019
Decentralized Multi-Agents by Imitation of a Centralized Controller A. Lin Mark J. Debord Katia Estabridis G. Hewer Guido Montufar Stanley Osher 65 6 0 06 Feb 2019
Distilling Policy Distillation Wojciech M. Czarnecki Razvan Pascanu Simon Osindero Siddhant M. Jayakumar G. Swirszcz Max Jaderberg 85 134 0 06 Feb 2019
Neural Fictitious Self-Play on ELF Mini-RTS Keigo Kawamura Yoshimasa Tsuruoka 59 7 0 06 Feb 2019
Separating value functions across time-scales Joshua Romoff Peter Henderson Ahmed Touati Emma Brunskill Joelle Pineau Yann Ollivier 87 25 0 05 Feb 2019
Learning to Schedule Communication in Multi-agent Reinforcement Learning Daewoo Kim Sang-chul Moon D. Hostallero Wan Ju Kang Taeyoung Lee Kyunghwan Son Yung Yi 80 208 0 05 Feb 2019
Embodied Multimodal Multitask Learning Devendra Singh Chaplot Lisa Lee Ruslan Salakhutdinov Devi Parikh Dhruv Batra LM&Ro 96 24 0 04 Feb 2019
Obstacle Tower: A Generalization Challenge in Vision, Control, and Planning Arthur Juliani Ahmed Khalifa Vincent-Pierre Berges Jonathan Harper Ervin Teng Hunter Henry A. Crespi Julian Togelius Danny Lange 87 144 0 04 Feb 2019
Incremental Learning with Maximum Entropy Regularization: Rethinking Forgetting and Intransigence Dahyun Kim Jihwan Bae Yeonsik Jo Jonghyun Choi OOD CLL 75 20 0 03 Feb 2019
Certified Reinforcement Learning with Logic Guidance Mohammadhosein Hasanbeig Daniel Kroening Alessandro Abate 127 57 0 02 Feb 2019
Visual Rationalizations in Deep Reinforcement Learning for Atari Games L. Weitkamp Elise van der Pol Zeynep Akata 84 27 0 01 Feb 2019
Competitive Experience Replay Hao Liu Alexander R. Trott R. Socher Caiming Xiong OffRL 128 53 0 01 Feb 2019
The Hanabi Challenge: A New Frontier for AI Research Nolan Bard Jakob N. Foerster A. Chandar Neil Burch Marc Lanctot ... Iain Dunning Shibl Mourad Hugo Larochelle Marc G. Bellemare Michael Bowling LLMAG 126 355 0 01 Feb 2019
TF-Replicator: Distributed Machine Learning for Researchers P. Buchlovsky David Budden Dominik Grewe Chris Jones John Aslanides ... Aidan Clark Sergio Gomez Colmenarejo Aedan Pope Fabio Viola Dan Belov GNN OffRL AI4CE 81 20 0 01 Feb 2019
Tsallis Reinforcement Learning: A Unified Framework for Maximum Entropy Reinforcement Learning Kyungjae Lee Sungyub Kim Sungbin Lim Sungjoon Choi Songhwai Oh 150 28 0 31 Jan 2019
A Theory of Regularized Markov Decision Processes Matthieu Geist B. Scherrer Olivier Pietquin 147 333 0 31 Jan 2019
Go-Explore: a New Approach for Hard-Exploration Problems Adrien Ecoffet Joost Huizinga Joel Lehman Kenneth O. Stanley Jeff Clune AI4TS 134 370 0 30 Jan 2019
Benchmarking Classic and Learned Navigation in Complex 3D Environments Dmytro Mishkin Alexey Dosovitskiy V. Koltun 137 75 0 30 Jan 2019
InfoBot: Transfer and Exploration via the Information Bottleneck Anirudh Goyal Riashat Islam Daniel Strouse Zafarali Ahmed M. Botvinick Hugo Larochelle Yoshua Bengio Sergey Levine OffRL 131 167 0 30 Jan 2019
Probability Functional Descent: A Unifying Perspective on GANs, Variational Inference, and Reinforcement Learning Casey Chu Jose H. Blanchet Peter Glynn GAN 75 26 0 30 Jan 2019
Privacy-preserving Q-Learning with Functional Noise in Continuous State Spaces Baoxiang Wang N. Hegde 88 65 0 30 Jan 2019
Trust Region-Guided Proximal Policy Optimization Yuhui Wang Hao He Xiaoyang Tan Yaozhong Gan OffRL 89 57 0 29 Jan 2019
Self-organization of action hierarchy and compositionality by reinforcement learning with recurrent neural networks Dongqi Han Kenji Doya Jun Tani AI4CE 126 20 0 29 Jan 2019
A Regulation Enforcement Solution for Multi-agent Reinforcement Learning Fan-Yun Sun Yen-Yu Chang Yueh-hua Wu Shou-De Lin 25 2 0 29 Jan 2019
Making Deep Q-learning methods robust to time discretization Corentin Tallec Léonard Blier Yann Ollivier OOD OffRL 67 91 0 28 Jan 2019
Q-learning with UCB Exploration is Sample Efficient for Infinite-Horizon MDP Kefan Dong Yuanhao Wang Xiaoyu Chen Liwei Wang OffRL 81 97 0 27 Jan 2019
Model-based Deep Reinforcement Learning for Dynamic Portfolio Optimization Pengqian Yu J. Lee Ilya Kulyatin Zekun Shi Sakyasingha Dasgupta 74 64 0 25 Jan 2019
Ablation Studies in Artificial Neural Networks Richard Meyes Melanie Lu Constantin Waubert de Puiseau Tobias Meisen 69 218 0 24 Jan 2019
Distributed Learning of Decentralized Control Policies for Articulated Mobile Robots Guillaume Sartoretti William Paivine Yunfei Shi Yue Wu Howie Choset 54 55 0 24 Jan 2019
Never Forget: Balancing Exploration and Exploitation via Learning Optical Flow Hsuan-Kung Yang Po-Han Chiang Kuan-Wei Ho Min-Fong Hong Chun-Yi Lee 45 7 0 24 Jan 2019
Combinational Q-Learning for Dou Di Zhu Yang You Liangwei Li B. Guo Weiming Wang Cewu Lu OffRL 61 13 0 24 Jan 2019
Causal Reasoning from Meta-reinforcement Learning Ishita Dasgupta Jane X. Wang Silvia Chiappa Jovana Mitrović Pedro A. Ortega David Raposo Edward Hughes Peter W. Battaglia M. Botvinick Z. Kurth-Nelson CML LRM 79 122 0 23 Jan 2019
Machine Learning for Wireless Communications in the Internet of Things: A Comprehensive Survey Jithin Jagannath Nicholas Polosky Anu Jagannath Francesco Restuccia Tommaso Melodia 106 232 0 23 Jan 2019
Trust Region Value Optimization using Kalman Filtering Shirli Di-Castro Shashua Shie Mannor 61 8 0 23 Jan 2019
Towards Non-saturating Recurrent Units for Modelling Long-term Dependencies A. Chandar Chinnadhurai Sankar Eugene Vorontsov Samira Ebrahimi Kahou Yoshua Bengio 101 56 0 22 Jan 2019
Towards Physically Safe Reinforcement Learning under Supervision Yinan Zhang Devin J. Balkcom Haoxiang Li OffRL 17 4 0 19 Jan 2019
Lifelong Federated Reinforcement Learning: A Learning Architecture for Navigation in Cloud Robotic Systems Boyi Liu Lujia Wang Ming-Yuan Liu 98 253 0 19 Jan 2019
On-Policy Trust Region Policy Optimisation with Replay Buffers D. Kangin N. Pugeault OffRL 23 3 0 18 Jan 2019
Amplifying the Imitation Effect for Reinforcement Learning of UCAV's Mission Execution G. Lee Chang Ouk Kim 33 4 0 17 Jan 2019
Learning Autonomous Exploration and Mapping with Semantic Vision Xiangyang Zhi Xuming He Sören Schwertfeger 128 9 0 15 Jan 2019
AutoPhase: Compiler Phase-Ordering for High Level Synthesis with Deep Reinforcement Learning Ameer Haj-Ali Qijing Huang William S. Moses J. Xiang Ion Stoica Krste Asanović J. Wawrzynek 52 36 0 15 Jan 2019
A Deep Recurrent Q Network towards Self-adapting Distributed Microservices architecture Basel Magableh 60 16 0 13 Jan 2019
Neural network gradient-based learning of black-box function interfaces Alon Jacovi Guy Hadash Einat Kermany Boaz Carmeli Ofer Lavi George Kour Jonathan Berant 48 13 0 13 Jan 2019
An investigation of model-free planning A. Guez M. Berk Mirza Karol Gregor Rishabh Kabra S. Racanière ... Laurent Orseau Tom Eccles Greg Wayne David Silver Timothy Lillicrap OffRL 106 117 0 11 Jan 2019
Motion Perception in Reinforcement Learning with Dynamic Objects Artemij Amiranashvili Alexey Dosovitskiy V. Koltun Thomas Brox 74 35 0 10 Jan 2019
Model-Predictive Policy Learning with Uncertainty Regularization for Driving in Dense Traffic Mikael Henaff A. Canziani Yann LeCun OOD 118 123 0 08 Jan 2019
Paired Open-Ended Trailblazer (POET): Endlessly Generating Increasingly Complex and Diverse Learning Environments and Their Solutions Rui Wang Joel Lehman Jeff Clune Kenneth O. Stanley 131 250 0 07 Jan 2019
Recurrent Control Nets for Deep Reinforcement Learning Vincent Liu Ademi Adeniji Nathaniel Lee Jason Zhao Mario Srouji 18 3 0 06 Jan 2019
Exploring applications of deep reinforcement learning for real-world autonomous driving systems V. Talpaert Ibrahim Sobh Ravi Kiran Patrick Mannion S. Yogamani Ahmad El-Sallab P. Pérez 70 74 0 06 Jan 2019
What Should I Do Now? Marrying Reinforcement Learning and Symbolic Planning Daniel Gordon Dieter Fox Ali Farhadi 78 20 0 06 Jan 2019