Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
1602.01783
Cited By
v1
v2 (latest)
Asynchronous Methods for Deep Reinforcement Learning
4 February 2016
Volodymyr Mnih
Adria Puigdomenech Badia
M. Berk Mirza
Alex Graves
Timothy Lillicrap
Tim Harley
David Silver
Koray Kavukcuoglu
Re-assign community
ArXiv (abs)
PDF
HTML
Papers citing
"Asynchronous Methods for Deep Reinforcement Learning"
50 / 3,591 papers shown
Title
Mastering Atari Games with Limited Data
Weirui Ye
Shao-Wei Liu
Thanard Kurutach
Pieter Abbeel
Yang Gao
VLM
155
242
0
30 Oct 2021
Adaptive Discretization in Online Reinforcement Learning
Sean R. Sinclair
Siddhartha Banerjee
Chao Yu
OffRL
87
17
0
29 Oct 2021
Understanding the Effect of Stochasticity in Policy Optimization
Jincheng Mei
Bo Dai
Chenjun Xiao
Csaba Szepesvári
Dale Schuurmans
75
19
0
29 Oct 2021
Learning to Ground Multi-Agent Communication with Autoencoders
Toru Lin
Minyoung Huh
C. Stauffer
Ser-Nam Lim
Phillip Isola
AI4CE
55
56
0
28 Oct 2021
Bayesian Sequential Optimal Experimental Design for Nonlinear Models Using Policy Gradient Reinforcement Learning
Wanggang Shen
Xun Huan
66
40
0
28 Oct 2021
URLB: Unsupervised Reinforcement Learning Benchmark
Michael Laskin
Denis Yarats
Hao Liu
Kimin Lee
Albert Zhan
Kevin Lu
Catherine Cang
Lerrel Pinto
Pieter Abbeel
SSL
OffRL
86
140
0
28 Oct 2021
Reinforcement Learning in Linear MDPs: Constant Regret and Representation Selection
Matteo Papini
Andrea Tirinzoni
Aldo Pacchiano
Marcello Restelli
A. Lazaric
Matteo Pirotta
88
20
0
27 Oct 2021
Distributed Multi-Agent Deep Reinforcement Learning Framework for Whole-building HVAC Control
Vinay Hanumaiah
Sahika Genc
AI4CE
64
6
0
26 Oct 2021
EnTRPO: Trust Region Policy Optimization Method with Entropy Regularization
Sahar Roostaie
M. Ebadzadeh
54
3
0
26 Oct 2021
Multi-Agent Advisor Q-Learning
Sriram Ganapathi Subramanian
Matthew E. Taylor
Kate Larson
Mark Crowley
OffRL
114
10
0
26 Oct 2021
History Aware Multimodal Transformer for Vision-and-Language Navigation
Shizhe Chen
Pierre-Louis Guhur
Cordelia Schmid
Ivan Laptev
LM&Ro
84
236
0
25 Oct 2021
Goal-Aware Cross-Entropy for Multi-Target Reinforcement Learning
Kibeom Kim
Min Whoo Lee
Yoonsung Kim
Je-hwan Ryu
Minsu Lee
Byoung-Tak Zhang
71
8
0
25 Oct 2021
Learning What to Memorize: Using Intrinsic Motivation to Form Useful Memory in Partially Observable Reinforcement Learning
Alper Demir
128
3
0
25 Oct 2021
A Deep Reinforcement Learning Approach for Audio-based Navigation and Audio Source Localization in Multi-speaker Environments
Petros Giannakopoulos
Aggelos Pikrakis
Y. Cotronis
143
3
0
25 Oct 2021
Fully Distributed Actor-Critic Architecture for Multitask Deep Reinforcement Learning
John Harwell
Angel Sylvester
Aleksi Tukiainen
Enrique Munoz de Cote
56
4
0
23 Oct 2021
A Reinforcement Learning Approach to Parameter Selection for Distributed Optimal Power Flow
Tai-Yin Chiu
Alyssa Kody
Youngdae Kim
Kibaek Kim
Daniel K. Molzahn
41
21
0
22 Oct 2021
An Economy of Neural Networks: Learning from Heterogeneous Experiences
A. Kuriksha
47
8
0
22 Oct 2021
Statistical discrimination in learning agents
Edgar A. Duénez-Guzmán
Kevin R. McKee
Yiran Mao
Ben Coppin
Silvia Chiappa
...
Yoram Bachrach
Suzanne Sadedin
William S. Isaac
K. Tuyls
Joel Z Leibo
77
7
0
21 Oct 2021
On games and simulators as a platform for development of artificial intelligence for command and control
Vinicius G. Goecks
Nicholas R. Waytowich
Derrik E. Asher
Song Jun Park
Mark R. Mittrick
...
Anne Logie
Mark S. Dennison
T. Trout
Priya Narayanan
Alexander Kott
90
26
0
21 Oct 2021
Actor-critic is implicitly biased towards high entropy optimal policies
Yuzheng Hu
Ziwei Ji
Matus Telgarsky
106
11
0
21 Oct 2021
Neuro-Symbolic Reinforcement Learning with First-Order Logic
Daiki Kimura
Masaki Ono
Subhajit Chaudhury
Ryosuke Kohita
Akifumi Wachi
Don Joven Agravante
Michiaki Tatsubori
Asim Munawar
Alexander G. Gray
NAI
97
37
0
21 Oct 2021
Estimating Optimal Infinite Horizon Dynamic Treatment Regimes via pT-Learning
Wenzhuo Zhou
Ruoqing Zhu
Annie Qu
83
22
0
20 Oct 2021
CIM-PPO:Proximal Policy Optimization with Liu-Correntropy Induced Metric
Yunxiao Guo
Han Long
Xiaojun Duan
Kaiyuan Feng
Maochu Li
Xiaying Ma
36
0
0
20 Oct 2021
Faster Algorithm and Sharper Analysis for Constrained Markov Decision Process
Tianjiao Li
Ziwei Guan
Shaofeng Zou
Tengyu Xu
Yingbin Liang
Guanghui Lan
68
30
0
20 Oct 2021
Beyond Exact Gradients: Convergence of Stochastic Soft-Max Policy Gradient Methods with Entropy Regularization
Yuhao Ding
Junzi Zhang
Hyunin Lee
Javad Lavaei
123
19
0
19 Oct 2021
Neural Network Compatible Off-Policy Natural Actor-Critic Algorithm
Raghuram Bharadwaj Diddigi
Prateek Jain
P. J
S. Bhatnagar
CML
OffRL
90
3
0
19 Oct 2021
In a Nutshell, the Human Asked for This: Latent Goals for Following Temporal Specifications
Borja G. Leon
Murray Shanahan
Francesco Belardinelli
AI4CE
100
16
0
18 Oct 2021
RL4RS: A Real-World Dataset for Reinforcement Learning based Recommender System
Kai Wang
Zhene Zou
Minghao Zhao
Qilin Deng
Yue Shang
Yile Liang
Runze Wu
Xudong Shen
Tangjie Lyu
Changjie Fan
OffRL
61
9
0
18 Oct 2021
Electric Vehicle Automatic Charging System Based on Vision-force Fusion
Dashun Guo
Liang Xie
Hongxiang Yu
Yue Wang
R. Xiong
61
4
0
18 Oct 2021
Improving Robustness of Reinforcement Learning for Power System Control with Adversarial Training
Alexander Pan
Yongkyun Lee
Huan Zhang
Yize Chen
Yuanyuan Shi
AAML
62
17
0
18 Oct 2021
A Dual Approach to Constrained Markov Decision Processes with Entropy Regularization
Donghao Ying
Yuhao Ding
Javad Lavaei
83
34
0
17 Oct 2021
Explore before Moving: A Feasible Path Estimation and Memory Recalling Framework for Embodied Navigation
Yang Wu
Shirui Feng
Guanbin Li
Liang Lin
21
0
0
16 Oct 2021
ToM2C: Target-oriented Multi-agent Communication and Cooperation with Theory of Mind
Yuan-Fang Wang
Fangwei Zhong
Jing Xu
Yizhou Wang
LLMAG
116
70
0
15 Oct 2021
Containerized Distributed Value-Based Multi-Agent Reinforcement Learning
Siyang Wu
Tonghan Wang
Chenghao Li
Yang Hu
Chongjie Zhang
OffRL
50
1
0
15 Oct 2021
Effects of Different Optimization Formulations in Evolutionary Reinforcement Learning on Diverse Behavior Generation
Victor Villin
Naoki Masuyama
Yusuke Nojima
79
2
0
15 Oct 2021
SaLinA: Sequential Learning of Agents
Ludovic Denoyer
Alfredo De la Fuente
S. Duong
Jean-Baptiste Gaya
Pierre-Alexandre Kamienny
Daniel H. Thompson
94
11
0
15 Oct 2021
EdgeML: Towards Network-Accelerated Federated Learning over Wireless Edge
Pinyarash Pinyoanuntapong
Prabhu Janakaraj
Ravikumar Balakrishnan
Minwoo Lee
Chong Chen
Pu Wang
79
13
0
14 Oct 2021
A Framework for Learning to Request Rich and Contextually Useful Information from Humans
Khanh Nguyen
Yonatan Bisk
Hal Daumé
117
16
0
14 Oct 2021
NeurIPS 2021 Competition IGLU: Interactive Grounded Language Understanding in a Collaborative Environment
Julia Kiseleva
Ziming Li
Mohammad Aliannejadi
Shrestha Mohanty
Maartje ter Hoeve
...
Arthur Szlam
Yuxuan Sun
Katja Hofmann
Michel Galley
Ahmed Hassan Awadallah
LLMAG
133
15
0
13 Oct 2021
Evaluation of Abstractive Summarisation Models with Machine Translation in Deliberative Processes
Miguel Arana Catania
Rob Procter
Yulan He
Maria Liakata
21
3
0
12 Oct 2021
Learning to Coordinate in Multi-Agent Systems: A Coordinated Actor-Critic Algorithm and Finite-Time Guarantees
Siliang Zeng
Tianyi Chen
Alfredo García
Mingyi Hong
92
11
0
11 Oct 2021
Learning a subspace of policies for online adaptation in Reinforcement Learning
Jean-Baptiste Gaya
Laure Soulier
Ludovic Denoyer
OffRL
95
15
0
11 Oct 2021
REIN-2: Giving Birth to Prepared Reinforcement Learning Agents Using Reinforcement Learning Agents
A. Lazaridis
I. Vlahavas
OffRL
60
2
0
11 Oct 2021
Recurrent Model-Free RL Can Be a Strong Baseline for Many POMDPs
Tianwei Ni
Benjamin Eysenbach
Ruslan Salakhutdinov
83
110
0
11 Oct 2021
Reinforcement Learning for Systematic FX Trading
Gabriel Borrageiro
Nikan B. Firoozye
P. Barucca
97
7
0
10 Oct 2021
Braxlines: Fast and Interactive Toolkit for RL-driven Behavior Engineering beyond Reward Maximization
S. Gu
Manfred Diaz
Daniel Freeman
Hiroki Furuta
Seyed Kamyar Seyed Ghasemipour
Anton Raichuk
Byron David
Erik Frey
Erwin Coumans
Olivier Bachem
80
14
0
10 Oct 2021
Situated Dialogue Learning through Procedural Environment Generation
Prithviraj Ammanabrolu
Renee Jia
Mark O. Riedl
158
14
0
07 Oct 2021
Offline RL With Resource Constrained Online Deployment
Jayanth Reddy Regatti
A. Deshmukh
Frank Cheng
Young Hun Jung
Abhishek Gupta
Ürün Dogan
OffRL
74
14
0
07 Oct 2021
Hybrid Pointer Networks for Traveling Salesman Problems Optimization
Ahmed Stohy
Heba-Tullah Abdelhakam
Sayed Ali
Mohammed Elhenawy
Abdallah A. Hassan
Mahmoud Masoud
Sébastien Glaser
A. Rakotonirainy
60
14
0
06 Oct 2021
Optimized Recommender Systems with Deep Reinforcement Learning
Lucas Farris
OffRL
25
0
0
06 Oct 2021
Previous
1
2
3
...
28
29
30
...
70
71
72
Next