v1v2 (latest)

Asynchronous Methods for Deep Reinforcement Learning

4 February 2016

Volodymyr Mnih

Adria Puigdomenech Badia

David Silver

Papers citing "Asynchronous Methods for Deep Reinforcement Learning"

50 / 3,591 papers shown

Title
On The Transferability of Deep-Q Networks M. Sabatelli Pierre Geurts 83 2 0 06 Oct 2021
Approximate Newton policy gradient algorithms Haoya Li Samarth Gupta Hsiangfu Yu Lexing Ying Inderjit Dhillon 70 3 0 05 Oct 2021
Deep reinforcement learning for guidewire navigation in coronary artery phantom Jihoon Kweon Kyunghwan Kim Chaehyuk Lee Hwi Kwon Jinwoo Park ... Inwook Back J. Roh Y. Moon Jaesoon Choi Young-Hak Kim OnRL 65 34 0 05 Oct 2021
Mapless Navigation: Learning UAVs Motion forExploration of Unknown Environments Sunggoo Jung David Hyunchul Shim 40 0 0 04 Oct 2021
Automating Privilege Escalation with Deep Reinforcement Learning Kalle Kujanpää Willie Victor Alexander Ilin AAML 41 16 0 04 Oct 2021
Collective eXplainable AI: Explaining Cooperative Strategies and Agent Contribution in Multiagent Reinforcement Learning with Shapley Values Alexandre Heuillet Fabien Couthouis Natalia Díaz Rodríguez 87 65 0 04 Oct 2021
Parallel Actors and Learners: A Framework for Generating Scalable RL Implementations Chi Zhang S. Kuppannagari Viktor Prasanna OffRL 34 8 0 03 Oct 2021
Batch size-invariance for policy optimization Jacob Hilton K. Cobbe John Schulman 120 14 0 01 Oct 2021
Learning the Markov Decision Process in the Sparse Gaussian Elimination Yingshi Chen 36 1 0 30 Sep 2021
Reinforcement Learning for Classical Planning: Viewing Heuristics as Dense Reward Generators Clement Gehring Masataro Asai Rohan Chitnis Tom Silver L. Kaelbling Shirin Sohrabi Michael Katz OffRL 84 38 0 30 Sep 2021
Dr Jekyll and Mr Hyde: the Strange Case of Off-Policy Policy Updates Romain Laroche Rémi Tachet des Combes 91 8 0 29 Sep 2021
Deep Reinforcement Learning Versus Evolution Strategies: A Comparative Survey Amjad Yousef Majid Serge Saaybi Tomas van Rietbergen Vincent François-Lavet R. V. Prasad Chris Verhoeven OffRL 135 60 0 28 Sep 2021
Exploring More When It Needs in Deep Reinforcement Learning Youtian Guo Qitong Gao 31 0 0 28 Sep 2021
Deep Reinforcement Learning with Adjustments H. Khorasgani Haiyan Wang Chetan Gupta Susumu Serita 25 2 0 28 Sep 2021
Runtime Safety Assurance for Learning-enabled Control of Autonomous Driving Vehicles Shengduo Chen Yao Sun Dachuan Li Qiang Wang Qi Hao J. Sifakis 82 18 0 28 Sep 2021
The Role of Lookahead and Approximate Policy Evaluation in Reinforcement Learning with Linear Value Function Approximation Anna Winnicki Joseph Lubars Michael Livesay R. Srikant 74 3 0 28 Sep 2021
MiniHack the Planet: A Sandbox for Open-Ended Reinforcement Learning Research Mikayel Samvelyan Robert Kirk Vitaly Kurin Jack Parker-Holder Minqi Jiang Eric Hambro Fabio Petroni Heinrich Küttler Edward Grefenstette Tim Rocktaschel OffRL 319 91 0 27 Sep 2021
Efficiently Training On-Policy Actor-Critic Networks in Robotic Deep Reinforcement Learning with Demonstration-like Sampled Exploration Zhaorun Chen Binhao Chen S. Xie Liang Gong Chengliang Liu Zhengfeng Zhang Junping Zhang OffRL 25 2 0 27 Sep 2021
Applying supervised and reinforcement learning methods to create neural-network-based agents for playing StarCraft II Michal Opanowicz 30 0 0 26 Sep 2021
Stackelberg Actor-Critic: Game-Theoretic Reinforcement Learning Algorithms Liyuan Zheng Tanner Fiez Zane Alumbaugh Benjamin J. Chasnov Lillian J. Ratliff OffRL 99 42 0 25 Sep 2021
NICE: Robust Scheduling through Reinforcement Learning-Guided Integer Programming Luke Kenworthy Siddharth Nayak Christopher R. Chin H. Balakrishnan 116 8 0 24 Sep 2021
A Graph Policy Network Approach for Volt-Var Control in Power Distribution Systems Xian Yeow Lee Soumik Sarkar Yubo Wang 35 31 0 24 Sep 2021
The $f$ -Divergence Reinforcement Learning Framework Chen Gong Qiang He Yunpeng Bai Zhouyi Yang Xiaoyu Chen Xinwen Hou Xianjie Zhang Yu Liu Guoliang Fan 68 3 0 24 Sep 2021
Improved Soft Actor-Critic: Mixing Prioritized Off-Policy Samples with On-Policy Experience C. Banerjee Zhiyong Chen N. Noman 56 34 0 24 Sep 2021
ADVERSARIALuscator: An Adversarial-DRL Based Obfuscator and Metamorphic Malware SwarmGenerator Mohit Sewak S. K. Sahay Hemant Rathore AAML 58 8 0 23 Sep 2021
On Bonus-Based Exploration Methods in the Arcade Learning Environment Adrien Ali Taïga W. Fedus Marlos C. Machado Aaron Courville Marc G. Bellemare 57 61 0 22 Sep 2021
Real Robot Challenge: A Robotics Competition in the Cloud Stefan Bauer Felix Widmaier M. Wuthrich Annika Buchholz Sebastian Stark ... David Córdova Bulens Kevin McGuinness Noel E. O'Connor S. Redmond Bernhard Schölkopf 61 12 0 22 Sep 2021
Estimation Error Correction in Deep Reinforcement Learning for Deterministic Actor-Critic Methods Baturay Saglam Enes Duran Dogan C. Cicek Furkan B. Mutlu Suleyman S. Kozat OffRL 73 12 0 22 Sep 2021
Benchmarking Lane-changing Decision-making for Deep Reinforcement Learning Junjie Wang Qichao Zhang Dongbin Zhao OffRL 26 1 0 22 Sep 2021
Context-Specific Representation Abstraction for Deep Option Learning Marwa Abdulhai Dong-Ki Kim Matthew D Riemer Miao Liu Gerald Tesauro Jonathan P. How OffRL 92 10 0 20 Sep 2021
Multi-Agent Embodied Visual Semantic Navigation with Scene Prior Knowledge Xinzhu Liu Di Guo Huaping Liu F. Sun EgoV 77 25 0 20 Sep 2021
A Survey of Text Games for Reinforcement Learning informed by Natural Language P. Osborne Heido Nomm André Freitas AI4CE 101 24 0 20 Sep 2021
CompilerGym: Robust, Performant Compiler Optimization Environments for AI Research Chris Cummins Bram Wasti Jiadong Guo Brandon Cui Jason Ansel ... Jia-Wei Liu O. Teytaud Benoit Steiner Yuandong Tian Hugh Leather 78 76 0 17 Sep 2021
Targeted Attack on Deep RL-based Autonomous Driving with Learned Visual Patterns Prasanth Buddareddygari Travis Zhang Yezhou Yang Yi Ren AAML 61 15 0 16 Sep 2021
DROMO: Distributionally Robust Offline Model-based Policy Optimization Ruizhen Liu Dazhi Zhong Zhi-Cong Chen OffRL 60 3 0 15 Sep 2021
Exploration in Deep Reinforcement Learning: From Single-Agent to Multiagent Domain Jianye Hao Tianpei Yang Hongyao Tang Chenjia Bai Jinyi Liu Zhaopeng Meng Peng Liu Zhen Wang OffRL 86 103 0 14 Sep 2021
DSDF: An approach to handle stochastic agents in collaborative multi-agent reinforcement learning S. K. Perepu Kaushik Dey 29 0 0 14 Sep 2021
Theoretical Guarantees of Fictitious Discount Algorithms for Episodic Reinforcement Learning and Global Convergence of Policy Gradient Methods Xin Guo Anran Hu Junzi Zhang OffRL 86 6 0 13 Sep 2021
Direct Advantage Estimation Hsiao-Ru Pan Nico Gürtler Alexander Neitz Bernhard Schölkopf OffRL CML 62 13 0 13 Sep 2021
Computation Rate Maximum for Mobile Terminals in UAV-assisted Wireless Powered MEC Networks with Fairness Constraint Xiaoyi Zhou Liang Huang Tong Ye Weiqiang Sun 29 1 0 13 Sep 2021
Robust Stability of Neural Network-controlled Nonlinear Systems with Parametric Variability Soumyabrata Talukder Ratnesh Kumar 46 8 0 13 Sep 2021
Reinforcement Learning for Load-balanced Parallel Particle Tracing Jiayi Xu Hanqi Guo Han-Wei Shen Mukund Raj Skylar W. Wurster Tom Peterka 32 6 0 13 Sep 2021
Direct Random Search for Fine Tuning of Deep Reinforcement Learning Policies Sean Gillen Asutay Ozmen Katie Byl 32 0 0 12 Sep 2021
Learning Selective Communication for Multi-Agent Path Finding Ziyuan Ma Yudong Luo Jia Pan AI4CE 83 52 0 12 Sep 2021
Incentivizing an Unknown Crowd Jing Dong Shuai Li Baoxiang Wang OffRL 30 0 0 09 Sep 2021
On the Approximation of Cooperative Heterogeneous Multi-Agent Reinforcement Learning (MARL) using Mean Field Control (MFC) Washim Uddin Mondal Mridul Agarwal Vaneet Aggarwal S. Ukkusuri 134 44 0 09 Sep 2021
PowerGym: A Reinforcement Learning Environment for Volt-Var Control in Power Distribution Systems Ting-Han Fan Xian Yeow Lee Yubo Wang 178 24 0 08 Sep 2021
On the impact of MDP design for Reinforcement Learning agents in Resource Management Renato Luiz de Freitas Cunha Luiz Chaimowicz 22 3 0 07 Sep 2021
Guiding Global Placement With Reinforcement Learning Robert M. Kirby Kolby Nottingham Rajarshi Roy Saad Godil Bryan Catanzaro 28 2 0 06 Sep 2021
Hindsight Reward Tweaking via Conditional Deep Reinforcement Learning Ning Wei Jiahua Liang Di Xie Shiliang Pu 50 0 0 06 Sep 2021