v1v2 (latest)

Asynchronous Methods for Deep Reinforcement Learning

4 February 2016

Volodymyr Mnih

Adria Puigdomenech Badia

David Silver

Papers citing "Asynchronous Methods for Deep Reinforcement Learning"

50 / 3,591 papers shown

Title
Mastering Atari Games with Limited Data Weirui Ye Shao-Wei Liu Thanard Kurutach Pieter Abbeel Yang Gao VLM 155 242 0 30 Oct 2021
Adaptive Discretization in Online Reinforcement Learning Sean R. Sinclair Siddhartha Banerjee Chao Yu OffRL 87 17 0 29 Oct 2021
Understanding the Effect of Stochasticity in Policy Optimization Jincheng Mei Bo Dai Chenjun Xiao Csaba Szepesvári Dale Schuurmans 75 19 0 29 Oct 2021
Learning to Ground Multi-Agent Communication with Autoencoders Toru Lin Minyoung Huh C. Stauffer Ser-Nam Lim Phillip Isola AI4CE 55 56 0 28 Oct 2021
Bayesian Sequential Optimal Experimental Design for Nonlinear Models Using Policy Gradient Reinforcement Learning Wanggang Shen Xun Huan 66 40 0 28 Oct 2021
URLB: Unsupervised Reinforcement Learning Benchmark Michael Laskin Denis Yarats Hao Liu Kimin Lee Albert Zhan Kevin Lu Catherine Cang Lerrel Pinto Pieter Abbeel SSL OffRL 86 140 0 28 Oct 2021
Reinforcement Learning in Linear MDPs: Constant Regret and Representation Selection Matteo Papini Andrea Tirinzoni Aldo Pacchiano Marcello Restelli A. Lazaric Matteo Pirotta 88 20 0 27 Oct 2021
Distributed Multi-Agent Deep Reinforcement Learning Framework for Whole-building HVAC Control Vinay Hanumaiah Sahika Genc AI4CE 64 6 0 26 Oct 2021
EnTRPO: Trust Region Policy Optimization Method with Entropy Regularization Sahar Roostaie M. Ebadzadeh 54 3 0 26 Oct 2021
Multi-Agent Advisor Q-Learning Sriram Ganapathi Subramanian Matthew E. Taylor Kate Larson Mark Crowley OffRL 114 10 0 26 Oct 2021
History Aware Multimodal Transformer for Vision-and-Language Navigation Shizhe Chen Pierre-Louis Guhur Cordelia Schmid Ivan Laptev LM&Ro 84 236 0 25 Oct 2021
Goal-Aware Cross-Entropy for Multi-Target Reinforcement Learning Kibeom Kim Min Whoo Lee Yoonsung Kim Je-hwan Ryu Minsu Lee Byoung-Tak Zhang 71 8 0 25 Oct 2021
Learning What to Memorize: Using Intrinsic Motivation to Form Useful Memory in Partially Observable Reinforcement Learning Alper Demir 128 3 0 25 Oct 2021
A Deep Reinforcement Learning Approach for Audio-based Navigation and Audio Source Localization in Multi-speaker Environments Petros Giannakopoulos Aggelos Pikrakis Y. Cotronis 143 3 0 25 Oct 2021
Fully Distributed Actor-Critic Architecture for Multitask Deep Reinforcement Learning John Harwell Angel Sylvester Aleksi Tukiainen Enrique Munoz de Cote 56 4 0 23 Oct 2021
A Reinforcement Learning Approach to Parameter Selection for Distributed Optimal Power Flow Tai-Yin Chiu Alyssa Kody Youngdae Kim Kibaek Kim Daniel K. Molzahn 41 21 0 22 Oct 2021
An Economy of Neural Networks: Learning from Heterogeneous Experiences A. Kuriksha 47 8 0 22 Oct 2021
Statistical discrimination in learning agents Edgar A. Duénez-Guzmán Kevin R. McKee Yiran Mao Ben Coppin Silvia Chiappa ... Yoram Bachrach Suzanne Sadedin William S. Isaac K. Tuyls Joel Z Leibo 77 7 0 21 Oct 2021
On games and simulators as a platform for development of artificial intelligence for command and control Vinicius G. Goecks Nicholas R. Waytowich Derrik E. Asher Song Jun Park Mark R. Mittrick ... Anne Logie Mark S. Dennison T. Trout Priya Narayanan Alexander Kott 90 26 0 21 Oct 2021
Actor-critic is implicitly biased towards high entropy optimal policies Yuzheng Hu Ziwei Ji Matus Telgarsky 106 11 0 21 Oct 2021
Neuro-Symbolic Reinforcement Learning with First-Order Logic Daiki Kimura Masaki Ono Subhajit Chaudhury Ryosuke Kohita Akifumi Wachi Don Joven Agravante Michiaki Tatsubori Asim Munawar Alexander G. Gray NAI 97 37 0 21 Oct 2021
Estimating Optimal Infinite Horizon Dynamic Treatment Regimes via pT-Learning Wenzhuo Zhou Ruoqing Zhu Annie Qu 83 22 0 20 Oct 2021
CIM-PPO:Proximal Policy Optimization with Liu-Correntropy Induced Metric Yunxiao Guo Han Long Xiaojun Duan Kaiyuan Feng Maochu Li Xiaying Ma 36 0 0 20 Oct 2021
Faster Algorithm and Sharper Analysis for Constrained Markov Decision Process Tianjiao Li Ziwei Guan Shaofeng Zou Tengyu Xu Yingbin Liang Guanghui Lan 68 30 0 20 Oct 2021
Beyond Exact Gradients: Convergence of Stochastic Soft-Max Policy Gradient Methods with Entropy Regularization Yuhao Ding Junzi Zhang Hyunin Lee Javad Lavaei 123 19 0 19 Oct 2021
Neural Network Compatible Off-Policy Natural Actor-Critic Algorithm Raghuram Bharadwaj Diddigi Prateek Jain P. J S. Bhatnagar CML OffRL 90 3 0 19 Oct 2021
In a Nutshell, the Human Asked for This: Latent Goals for Following Temporal Specifications Borja G. Leon Murray Shanahan Francesco Belardinelli AI4CE 100 16 0 18 Oct 2021
RL4RS: A Real-World Dataset for Reinforcement Learning based Recommender System Kai Wang Zhene Zou Minghao Zhao Qilin Deng Yue Shang Yile Liang Runze Wu Xudong Shen Tangjie Lyu Changjie Fan OffRL 61 9 0 18 Oct 2021
Electric Vehicle Automatic Charging System Based on Vision-force Fusion Dashun Guo Liang Xie Hongxiang Yu Yue Wang R. Xiong 61 4 0 18 Oct 2021
Improving Robustness of Reinforcement Learning for Power System Control with Adversarial Training Alexander Pan Yongkyun Lee Huan Zhang Yize Chen Yuanyuan Shi AAML 62 17 0 18 Oct 2021
A Dual Approach to Constrained Markov Decision Processes with Entropy Regularization Donghao Ying Yuhao Ding Javad Lavaei 83 34 0 17 Oct 2021
Explore before Moving: A Feasible Path Estimation and Memory Recalling Framework for Embodied Navigation Yang Wu Shirui Feng Guanbin Li Liang Lin 21 0 0 16 Oct 2021
ToM2C: Target-oriented Multi-agent Communication and Cooperation with Theory of Mind Yuan-Fang Wang Fangwei Zhong Jing Xu Yizhou Wang LLMAG 116 70 0 15 Oct 2021
Containerized Distributed Value-Based Multi-Agent Reinforcement Learning Siyang Wu Tonghan Wang Chenghao Li Yang Hu Chongjie Zhang OffRL 50 1 0 15 Oct 2021
Effects of Different Optimization Formulations in Evolutionary Reinforcement Learning on Diverse Behavior Generation Victor Villin Naoki Masuyama Yusuke Nojima 79 2 0 15 Oct 2021
SaLinA: Sequential Learning of Agents Ludovic Denoyer Alfredo De la Fuente S. Duong Jean-Baptiste Gaya Pierre-Alexandre Kamienny Daniel H. Thompson 94 11 0 15 Oct 2021
EdgeML: Towards Network-Accelerated Federated Learning over Wireless Edge Pinyarash Pinyoanuntapong Prabhu Janakaraj Ravikumar Balakrishnan Minwoo Lee Chong Chen Pu Wang 79 13 0 14 Oct 2021
A Framework for Learning to Request Rich and Contextually Useful Information from Humans Khanh Nguyen Yonatan Bisk Hal Daumé 117 16 0 14 Oct 2021
NeurIPS 2021 Competition IGLU: Interactive Grounded Language Understanding in a Collaborative Environment Julia Kiseleva Ziming Li Mohammad Aliannejadi Shrestha Mohanty Maartje ter Hoeve ... Arthur Szlam Yuxuan Sun Katja Hofmann Michel Galley Ahmed Hassan Awadallah LLMAG 133 15 0 13 Oct 2021
Evaluation of Abstractive Summarisation Models with Machine Translation in Deliberative Processes Miguel Arana Catania Rob Procter Yulan He Maria Liakata 21 3 0 12 Oct 2021
Learning to Coordinate in Multi-Agent Systems: A Coordinated Actor-Critic Algorithm and Finite-Time Guarantees Siliang Zeng Tianyi Chen Alfredo García Mingyi Hong 92 11 0 11 Oct 2021
Learning a subspace of policies for online adaptation in Reinforcement Learning Jean-Baptiste Gaya Laure Soulier Ludovic Denoyer OffRL 95 15 0 11 Oct 2021
REIN-2: Giving Birth to Prepared Reinforcement Learning Agents Using Reinforcement Learning Agents A. Lazaridis I. Vlahavas OffRL 60 2 0 11 Oct 2021
Recurrent Model-Free RL Can Be a Strong Baseline for Many POMDPs Tianwei Ni Benjamin Eysenbach Ruslan Salakhutdinov 83 110 0 11 Oct 2021
Reinforcement Learning for Systematic FX Trading Gabriel Borrageiro Nikan B. Firoozye P. Barucca 97 7 0 10 Oct 2021
Braxlines: Fast and Interactive Toolkit for RL-driven Behavior Engineering beyond Reward Maximization S. Gu Manfred Diaz Daniel Freeman Hiroki Furuta Seyed Kamyar Seyed Ghasemipour Anton Raichuk Byron David Erik Frey Erwin Coumans Olivier Bachem 80 14 0 10 Oct 2021
Situated Dialogue Learning through Procedural Environment Generation Prithviraj Ammanabrolu Renee Jia Mark O. Riedl 158 14 0 07 Oct 2021
Offline RL With Resource Constrained Online Deployment Jayanth Reddy Regatti A. Deshmukh Frank Cheng Young Hun Jung Abhishek Gupta Ürün Dogan OffRL 74 14 0 07 Oct 2021
Hybrid Pointer Networks for Traveling Salesman Problems Optimization Ahmed Stohy Heba-Tullah Abdelhakam Sayed Ali Mohammed Elhenawy Abdallah A. Hassan Mahmoud Masoud Sébastien Glaser A. Rakotonirainy 60 14 0 06 Oct 2021
Optimized Recommender Systems with Deep Reinforcement Learning Lucas Farris OffRL 25 0 0 06 Oct 2021