v1v2 (latest)

Asynchronous Methods for Deep Reinforcement Learning

4 February 2016

Volodymyr Mnih

Adria Puigdomenech Badia

David Silver

Papers citing "Asynchronous Methods for Deep Reinforcement Learning"

50 / 3,591 papers shown

Title
Multi-Agent Meta-Reinforcement Learning for Self-Powered and Sustainable Edge Computing Systems M. S. Munir N. H. Tran Walid Saad Choong Seon Hong 143 21 0 20 Feb 2020
Efficient Deep Reinforcement Learning via Adaptive Policy Transfer Tianpei Yang Jianye Hao Zhaopeng Meng Zongzhang Zhang Yujing Hu ... Changjie Fan Weixun Wang Wulong Liu Zhaodong Wang J. Peng OffRL 89 12 0 19 Feb 2020
Multi-Issue Bargaining With Deep Reinforcement Learning Ho-Chun Herbert Chang 42 2 0 18 Feb 2020
MoTiAC: Multi-Objective Actor-Critics for Real-Time Bidding Haolin Zhou Chaoqi Yang Xiaofeng Gao Qiong Chen Gongshen Liu Guihai Chen 71 6 0 18 Feb 2020
Symbolic Network: Generalized Neural Policies for Relational MDPs Sankalp Garg Aniket Bajpai Mausam 34 5 0 18 Feb 2020
Kalman meets Bellman: Improving Policy Evaluation through Value Tracking Shirli Di-Castro Shashua Shie Mannor OffRL 76 12 0 17 Feb 2020
Adaptive Experience Selection for Policy Gradient S. Mohamad Giovanni Montana 106 0 0 17 Feb 2020
Reinforcement learning for the privacy preservation and manipulation of eye tracking data Wolfgang Fuhl Efe Bozkir Enkelejda Kasneci 60 1 0 17 Feb 2020
First Order Constrained Optimization in Policy Space Yiming Zhang Q. Vuong George Andriopoulos 46 4 0 16 Feb 2020
Deep RL Agent for a Real-Time Action Strategy Game Michal Warchalski Dimitrije Radojević M. Milosevic 18 0 0 15 Feb 2020
Resource Management in Wireless Networks via Multi-Agent Deep Reinforcement Learning Navid Naderializadeh J. Sydir M. Simsek Hosein Nikopour 79 129 0 14 Feb 2020
Stable Training of DNN for Speech Enhancement based on Perceptually-Motivated Black-Box Cost Function M. Kawanaka Yuma Koizumi Ryoichi Miyazaki Kohei Yatabe AAML 70 23 0 14 Feb 2020
Hoplite: Efficient and Fault-Tolerant Collective Communication for Task-Based Distributed Systems Siyuan Zhuang Zhuohan Li Danyang Zhuo Stephanie Wang Eric Liang Robert Nishihara Philipp Moritz Ion Stoica 40 24 0 13 Feb 2020
Improving Generalization of Reinforcement Learning with Minimax Distributional Soft Actor-Critic Yangang Ren Jingliang Duan Shengbo Eben Li Yang Guan Qi Sun OffRL 60 30 0 13 Feb 2020
Effective Reinforcement Learning through Evolutionary Surrogate-Assisted Prescription Olivier Francon Santiago Gonzalez Babak Hodjat Elliot Meyerson Risto Miikkulainen Xin Qiu Hormoz Shahrzad 80 17 0 13 Feb 2020
Learning to Generate Levels From Nothing Philip Bontrager Julian Togelius GAN 61 22 0 12 Feb 2020
Data Efficient Training for Reinforcement Learning with Adaptive Behavior Policy Sharing Ge Liu Rui Wu Heng-Tze Cheng Jing Wang Jayden Ooi Lihong Li Ang Li Wai Lok Sibon Li Craig Boutilier Ed H. Chi OffRL 36 4 0 12 Feb 2020
Intrinsic Motivation for Encouraging Synergistic Behavior Rohan Chitnis Shubham Tulsiani Saurabh Gupta Abhinav Gupta 50 28 0 12 Feb 2020
Regret Bounds for Discounted MDPs Shuang Liu H. Su OffRL 80 19 0 12 Feb 2020
SparseIDS: Learning Packet Sampling with Reinforcement Learning Maximilian Bachl Fares Meghdouri J. Fabini Tanja Zseby 46 6 0 10 Feb 2020
Discrete Action On-Policy Learning with Action-Value Critic Yuguang Yue Yunhao Tang Mingzhang Yin Mingyuan Yin OffRL 78 5 0 10 Feb 2020
Self-Attentive Associative Memory Hung Le T. Tran Svetha Venkatesh 101 56 0 10 Feb 2020
Capsule Network Performance with Autonomous Navigation Tom Molnar Eugenio Culurciello 3DPC 25 2 0 08 Feb 2020
Dynamic Energy Dispatch Based on Deep Reinforcement Learning in IoT-Driven Smart Isolated Microgrids Lei Lei Yue Tan Glenn Dahlenburg W. Xiang K. Zheng 76 71 0 07 Feb 2020
Social diversity and social preferences in mixed-motive reinforcement learning Kevin R. McKee I. Gemp Brian McWilliams Edgar A. Duénez-Guzmán Edward Hughes Joel Z Leibo 97 85 0 06 Feb 2020
Attractive or Faithful? Popularity-Reinforced Learning for Inspired Headline Generation Yun-Zhu Song Hong-Han Shuai Sung-Lin Yeh Yi-Lun Wu Lun-Wei Ku Chao-Han Huck Yang 81 21 0 06 Feb 2020
Temporal-adaptive Hierarchical Reinforcement Learning Wen-Ji Zhou Yang Yu 55 3 0 06 Feb 2020
Does the Markov Decision Process Fit the Data: Testing for the Markov Property in Sequential Decision Making C. Shi Runzhe Wan R. Song Wenbin Lu Ling Leng 82 39 0 05 Feb 2020
Compositional Languages Emerge in a Neural Iterated Learning Model Yi Ren Shangmin Guo Matthieu Labeau Shay B. Cohen S. Kirby 164 98 0 04 Feb 2020
Learning rewards for robotic ultrasound scanning using probabilistic temporal ranking Michael G. Burke Katie Lu Daniel Angelov Artūras Straižys Craig Innes Kartic Subr S. Ramamoorthy 58 11 0 04 Feb 2020
Unsupervised Domain Adaptive Object Detection using Forward-Backward Cyclic Adaptation Siqi Yang Lin Wu Arnold Wiliem Brian C. Lovell ObjD 60 19 0 03 Feb 2020
Deep Reinforcement Learning for Autonomous Driving: A Survey B. R. Kiran Ibrahim Sobh V. Talpaert Patrick Mannion A. A. Sallab S. Yogamani P. Pérez 367 1,710 0 02 Feb 2020
Constrained Deep Reinforcement Learning for Energy Sustainable Multi-UAV based Random Access IoT Networks with NOMA Sami Khairy Prasanna Balaprakash L. Cai Y. Cheng 31 73 0 31 Jan 2020
Locally Private Distributed Reinforcement Learning Hajime Ono Tsubasa Takahashi OffRL 69 23 0 31 Jan 2020
Towards the Systematic Reporting of the Energy and Carbon Footprints of Machine Learning Peter Henderson Jie Hu Joshua Romoff Emma Brunskill Dan Jurafsky Joelle Pineau 118 459 0 31 Jan 2020
Preventing Imitation Learning with Adversarial Policy Ensembles Albert Zhan Stas Tiomkin Pieter Abbeel 40 3 0 31 Jan 2020
Using Fractal Neural Networks to Play SimCity 1 and Conway's Game of Life at Variable Scales Sam Earle AI4CE 76 18 0 29 Jan 2020
MEMO: A Deep Network for Flexible Combination of Episodic Memories Andrea Banino Adria Puigdomenech Badia Raphael Köster Martin Chadwick V. Zambaldi Demis Hassabis Caswell Barry M. Botvinick D. Kumaran Charles Blundell KELM 87 35 0 29 Jan 2020
Variational Autoencoders for Opponent Modeling in Multi-Agent Systems Georgios Papoudakis Stefano V. Albrecht BDL DRL 64 29 0 29 Jan 2020
Robust Multimodal Image Registration Using Deep Recurrent Reinforcement Learning Shanhui Sun Jing Hu Mingqing Yao Jinrong Hu Xiaodong Yang Qi Song Xi Wu 77 24 0 29 Jan 2020
Towards Learning Multi-agent Negotiations via Self-Play Yichuan Tang 77 33 0 28 Jan 2020
Rotation, Translation, and Cropping for Zero-Shot Generalization Chang Ye Ahmed Khalifa Philip Bontrager Julian Togelius 104 38 0 27 Jan 2020
Challenges and Countermeasures for Adversarial Attacks on Deep Reinforcement Learning Inaam Ilahi Muhammad Usama Junaid Qadir M. Janjua Ala I. Al-Fuqaha D. Hoang Dusit Niyato AAML 147 137 0 27 Jan 2020
PCGRL: Procedural Content Generation via Reinforcement Learning Ahmed Khalifa Philip Bontrager Sam Earle Julian Togelius 80 146 0 24 Jan 2020
EgoMap: Projective mapping and structured egocentric memory for Deep RL E. Beeching Christian Wolf J. Dibangoye Olivier Simonin EgoV 89 27 0 24 Jan 2020
Graph Constrained Reinforcement Learning for Natural Language Action Spaces Prithviraj Ammanabrolu Matthew J. Hausknecht AI4CE LLMAG 111 129 0 23 Jan 2020
Interpretable End-to-end Urban Autonomous Driving with Latent Deep Reinforcement Learning Jianyu Chen Shengbo Eben Li Masayoshi Tomizuka 155 246 0 23 Jan 2020
Q-Learning in enormous action spaces via amortized approximate maximization T. Wiele David Warde-Farley A. Mnih Volodymyr Mnih 78 60 0 22 Jan 2020
On Simple Reactive Neural Networks for Behaviour-Based Reinforcement Learning Ameya Pore G. Aragon-Camarasa 61 11 0 22 Jan 2020
Reinforcement Learning Based Vehicle-cell Association Algorithm for Highly Mobile Millimeter Wave Communication Hamza Khan Anis Elgabli S. Samarakoon M. Bennis Choong Seon Hong 45 33 0 22 Jan 2020