v1v2 (latest)

Asynchronous Methods for Deep Reinforcement Learning

4 February 2016

Volodymyr Mnih

Adria Puigdomenech Badia

David Silver

Papers citing "Asynchronous Methods for Deep Reinforcement Learning"

50 / 3,592 papers shown

Title
Learning for Visual Navigation by Imagining the Success M. Moghaddam Ehsan Abbasnejad Qi Wu Javen Qinfeng Shi Anton Van Den Hengel 66 3 0 28 Feb 2021
Multi-agent Reinforcement Learning in OpenSpiel: A Reproduction Report Michael Walton Viliam Lisý 70 5 0 27 Feb 2021
Low-Precision Reinforcement Learning: Running Soft Actor-Critic in Half Precision Johan Bjorck Xiangyu Chen Christopher De Sa Carla P. Gomes Kilian Q. Weinberger 137 6 0 26 Feb 2021
Synthetic Returns for Long-Term Credit Assignment David Raposo Samuel Ritter Adam Santoro Greg Wayne T. Weber M. Botvinick H. V. Hasselt Francis Song AI4TS 107 35 0 24 Feb 2021
Balancing Rational and Other-Regarding Preferences in Cooperative-Competitive Environments Dmitry Ivanov Vladimir Egorov A. Shpilman 68 5 0 24 Feb 2021
FIXAR: A Fixed-Point Deep Reinforcement Learning Platform with Quantization-Aware Training and Adaptive Parallelism Jenny Yang Seongmin Hong Joo-Young Kim 61 18 0 24 Feb 2021
Honey, I Shrunk The Actor: A Case Study on Preserving Performance with Smaller Actors in Actor-Critic RL Siddharth Mysore B. Mabsout R. Mancuso Kate Saenko OffRL 47 9 0 23 Feb 2021
Doubly Robust Off-Policy Actor-Critic: Convergence and Optimality Tengyu Xu Zhuoran Yang Zhaoran Wang Yingbin Liang OffRL 114 25 0 23 Feb 2021
Differentiable Logic Machines Matthieu Zimmer Xuening Feng Claire Glanois Zhaohui Jiang Jianyi Zhang Paul Weng Li Dong Hao Jianye Liu Wulong AI4CE 101 23 0 23 Feb 2021
Communication Efficient Parallel Reinforcement Learning Mridul Agarwal Bhargav Ganguly Vaneet Aggarwal 77 11 0 22 Feb 2021
Delayed Rewards Calibration via Reward Empirical Sufficiency Yixuan Liu Hu Wang Xiaowei Wang Xiaoyue Sun Liuyue Jiang Minhui Xue 88 0 0 21 Feb 2021
How To Train Your HERON Antoine Richard Stéphanie Aravecchia Thomas Schillaci Matthieu Geist C´edric Pradalier 41 3 0 20 Feb 2021
Decoupling Value and Policy for Generalization in Reinforcement Learning Roberta Raileanu Rob Fergus DRL OffRL 121 99 0 20 Feb 2021
On Proximal Policy Optimization's Heavy-tailed Gradients Saurabh Garg Joshua Zhanson Emilio Parisotto Adarsh Prasad J. Zico Kolter Zachary Chase Lipton Sivaraman Balakrishnan Ruslan Salakhutdinov Pradeep Ravikumar 100 13 0 20 Feb 2021
Training a Resilient Q-Network against Observational Interference Chao-Han Huck Yang I-Te Danny Hung Ouyang Yi Pin-Yu Chen OOD 73 15 0 18 Feb 2021
State Entropy Maximization with Random Encoders for Efficient Exploration Younggyo Seo Lili Chen Jinwoo Shin Honglak Lee Pieter Abbeel Kimin Lee 119 129 0 18 Feb 2021
Adaptive Rational Activations to Boost Deep Reinforcement Learning Quentin Delfosse P. Schramowski Martin Mundt Alejandro Molina Kristian Kersting 141 15 0 18 Feb 2021
Improved Deep Reinforcement Learning with Expert Demonstrations for Urban Autonomous Driving Haochen Liu Zhiyu Huang Jingda Wu Chen Lv 102 74 0 18 Feb 2021
Learning Memory-Dependent Continuous Control from Demonstrations Siqing Hou Dongqi Han Jun Tani 30 0 0 18 Feb 2021
Multi-Agent Reinforcement Learning of 3D Furniture Layout Simulation in Indoor Graphics Scenes Xinhan Di Pengqian Yu AI4CE 3DV 92 12 0 18 Feb 2021
On the Convergence and Sample Efficiency of Variance-Reduced Policy Gradient Method Junyu Zhang Chengzhuo Ni Zheng Yu Csaba Szepesvári Mengdi Wang 130 69 0 17 Feb 2021
TradeR: Practical Deep Hierarchical Reinforcement Learning for Trade Execution Karush Suri Xiaolong Shi Konstantinos Plataniotis Y. Lawryshyn OffRL 47 4 0 16 Feb 2021
Transferring Domain Knowledge with an Adviser in Continuous Tasks Rukshan Wijesinghe Kasun Vithanage Dumindu Tissera A. Xavier Subha Fernando Jayathu Samarawickrama CLL 50 0 0 16 Feb 2021
A Survey of Machine Learning for Computer Architecture and Systems Nan Wu Yuan Xie AI4TS AI4CE 108 153 0 16 Feb 2021
Scaling Multi-Agent Reinforcement Learning with Selective Parameter Sharing Filippos Christianos Georgios Papoudakis Arrasy Rahman Stefano V. Albrecht 94 121 0 15 Feb 2021
Tractable structured natural gradient descent using local parameterizations Wu Lin Frank Nielsen Mohammad Emtiyaz Khan Mark Schmidt 146 30 0 15 Feb 2021
Reinforcement Learning for IoT Security: A Comprehensive Survey Aashma Uprety D. Rawat AAML 93 124 0 14 Feb 2021
Resilient Machine Learning for Networked Cyber Physical Systems: A Survey for Machine Learning Security to Securing Machine Learning for CPS Felix O. Olowononi D. Rawat Chunmei Liu 95 138 0 14 Feb 2021
Interactive Learning from Activity Description Khanh Nguyen Dipendra Kumar Misra Robert Schapire Miroslav Dudík Patrick Shafto 115 35 0 13 Feb 2021
Modelling Cooperation in Network Games with Spatio-Temporal Complexity Michiel A. Bakker Richard Everett Laura Weidinger Iason Gabriel William S. Isaac Joel Z Leibo Edward Hughes 69 5 0 13 Feb 2021
Q-Value Weighted Regression: Reinforcement Learning with Limited Data Piotr Kozakowski Lukasz Kaiser Henryk Michalewski Afroz Mohiuddin Katarzyna Kañska OffRL 81 5 0 12 Feb 2021
Deep Reinforcement Agent for Scheduling in HPC Yuping Fan Z. Lan T. Childers Paul M. Rich W. Allcock M. Papka 57 37 0 11 Feb 2021
Derivative-Free Reinforcement Learning: A Review Hong Qian Yang Yu OffRL 144 42 0 10 Feb 2021
Improving Model-Based Reinforcement Learning with Internal State Representations through Self-Supervision Julien Scholz C. Weber Muhammad Burhan Hafez S. Wermter 46 3 0 10 Feb 2021
Learning Equational Theorem Proving Jelle Piepenbrock Tom Heskes Mikolávs Janota Josef Urban AIMat LRM 38 4 0 10 Feb 2021
Policy Augmentation: An Exploration Strategy for Faster Convergence of Deep Reinforcement Learning Algorithms A. Mahyari 22 1 0 10 Feb 2021
Adaptive Pairwise Weights for Temporal Credit Assignment Zeyu Zheng Risto Vuorio Richard L. Lewis Satinder Singh 68 5 0 09 Feb 2021
Learning State Representations from Random Deep Action-conditional Predictions Zeyu Zheng Vivek Veeriah Risto Vuorio Richard L. Lewis Satinder Singh 70 5 0 09 Feb 2021
Measuring Progress in Deep Reinforcement Learning Sample Efficiency Florian E. Dorner 55 13 0 09 Feb 2021
How to Stay Curious while Avoiding Noisy TVs using Aleatoric Uncertainty Estimation Augustine N. Mavor-Parker K. Young Caswell Barry Lewis D. Griffin 95 22 0 08 Feb 2021
Adversarially Guided Actor-Critic Yannis Flet-Berliac Johan Ferret Olivier Pietquin Philippe Preux Matthieu Geist 77 73 0 08 Feb 2021
Towards Hierarchical Task Decomposition using Deep Reinforcement Learning for Pick and Place Subtasks Luca Marzari Ameya Pore Diego DallÁlba G. Aragon-Camarasa Alessandro Farinelli Paolo Fiorini 87 29 0 08 Feb 2021
Concentrated Document Topic Model Hao Lei Ying Chen 31 1 0 06 Feb 2021
Rethinking the Implementation Tricks and Monotonicity Constraint in Cooperative Multi-Agent Reinforcement Learning Jian Hu Siyang Jiang Seth Austin Harding Haibin Wu Shihua Liao 217 91 0 06 Feb 2021
An advantage actor-critic algorithm for robotic motion planning in dense and dynamic scenarios Chengmin Zhou Bingding Huang Pasi Fränti 47 1 0 05 Feb 2021
Experience-Based Heuristic Search: Robust Motion Planning with Deep Q-Learning Julian Bernhard Robert Gieselmann Klemens Esterle Alois Knoll 42 17 0 05 Feb 2021
A review of motion planning algorithms for intelligent robotics Chengmin Zhou Bingding Huang Pasi Fränti 77 4 0 04 Feb 2021
Proactive and AoI-aware Failure Recovery for Stateful NFV-enabled Zero-Touch 6G Networks: Model-Free DRL Approach Amirhossein Shaghaghi Abolfazl Zakeri Nader Mokari M. Javan M. Behdadfar Eduard Axel Jorswieck 36 20 0 02 Feb 2021
DRLDO: A novel DRL based De-ObfuscationSystem for Defense against Metamorphic Malware Mohit Sewak S. K. Sahay Hemant Rathore 43 13 0 01 Feb 2021
Deep Reinforcement Learning Aided Monte Carlo Tree Search for MIMO Detection Tz-Wei Mo Ronald Y. Chang Te-Yi Kan 64 1 0 30 Jan 2021