Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
1602.01783
Cited By
Asynchronous Methods for Deep Reinforcement Learning
4 February 2016
Volodymyr Mnih
Adria Puigdomenech Badia
M. Berk Mirza
Alex Graves
Timothy Lillicrap
Tim Harley
David Silver
Koray Kavukcuoglu
Re-assign community
ArXiv
PDF
HTML
Papers citing
"Asynchronous Methods for Deep Reinforcement Learning"
50 / 1,575 papers shown
Title
Reinforcement Learning for Feedback-Enabled Cyber Resilience
Yunhan Huang
Linan Huang
Quanyan Zhu
23
67
0
02 Jul 2021
Goal-Conditioned Reinforcement Learning with Imagined Subgoals
Elliot Chane-Sane
Cordelia Schmid
Ivan Laptev
30
141
0
01 Jul 2021
Applications of the Free Energy Principle to Machine Learning and Neuroscience
Beren Millidge
DRL
33
7
0
30 Jun 2021
Adaptive Stochastic ADMM for Decentralized Reinforcement Learning in Edge Industrial IoT
Wanlu Lei
Yu Ye
Ming Xiao
Mikael Skoglund
Zhu Han
28
1
0
30 Jun 2021
Core Challenges in Embodied Vision-Language Planning
Jonathan M Francis
Nariaki Kitamura
Felix Labelle
Xiaopeng Lu
Ingrid Navarro
Jean Oh
LM&Ro
54
45
0
26 Jun 2021
Policy Smoothing for Provably Robust Reinforcement Learning
Aounon Kumar
Alexander Levine
S. Feizi
AAML
20
56
0
21 Jun 2021
Distributed Heuristic Multi-Agent Path Finding with Communication
Ziyuan Ma
Yudong Luo
Hang Ma
27
69
0
21 Jun 2021
Prediction-Free, Real-Time Flexible Control of Tidal Lagoons through Proximal Policy Optimisation: A Case Study for the Swansea Lagoon
Túlio Marcondes Moreira
Jackson Geraldo de Faria
Pedro O. S. Vaz de Melo
Luiz Chaimowicz
G. Medeiros-Ribeiro
33
10
0
18 Jun 2021
Towards Distraction-Robust Active Visual Tracking
Fangwei Zhong
Peng Sun
Wenhan Luo
Tingyun Yan
Yizhou Wang
AAML
30
33
0
18 Jun 2021
A learning agent that acquires social norms from public sanctions in decentralized multi-agent settings
Eugene Vinitsky
Raphael Köster
J. Agapiou
Edgar A. Duénez-Guzmán
A. Vezhnevets
Joel Z Leibo
32
37
0
16 Jun 2021
Vision-Language Navigation with Random Environmental Mixup
Chong Liu
Fengda Zhu
Xiaojun Chang
Xiaodan Liang
Zongyuan Ge
Yi-Dong Shen
LM&Ro
56
86
0
15 Jun 2021
Unsupervised Learning of Visual 3D Keypoints for Control
Boyuan Chen
Pieter Abbeel
Deepak Pathak
3DPC
SSL
27
39
0
14 Jun 2021
Analysis of a Target-Based Actor-Critic Algorithm with Linear Function Approximation
Anas Barakat
Pascal Bianchi
Julien Lehmann
32
9
0
14 Jun 2021
Characterizing the Gap Between Actor-Critic and Policy Gradient
Junfeng Wen
Saurabh Kumar
Ramki Gummadi
Dale Schuurmans
34
15
0
13 Jun 2021
A New Formalism, Method and Open Issues for Zero-Shot Coordination
Johannes Treutlein
Michael Dennis
Caspar Oesterheld
Jakob N. Foerster
OffRL
29
35
0
11 Jun 2021
A3C-S: Automated Agent Accelerator Co-Search towards Efficient Deep Reinforcement Learning
Yonggan Fu
Yongan Zhang
Chaojian Li
Zhongzhi Yu
Yingyan Lin
39
6
0
11 Jun 2021
Going Beyond Linear Transformers with Recurrent Fast Weight Programmers
Kazuki Irie
Imanol Schlag
Róbert Csordás
Jürgen Schmidhuber
33
57
0
11 Jun 2021
Taylor Expansion of Discount Factors
Yunhao Tang
Mark Rowland
Rémi Munos
Michal Valko
OffRL
34
5
0
11 Jun 2021
DouZero: Mastering DouDizhu with Self-Play Deep Reinforcement Learning
Daochen Zha
Jingru Xie
Wenye Ma
Sheng Zhang
Xiangru Lian
Xia Hu
Ji Liu
25
117
0
11 Jun 2021
Simplifying Deep Reinforcement Learning via Self-Supervision
Daochen Zha
Kwei-Herng Lai
Kaixiong Zhou
Xia Hu
SSL
54
15
0
10 Jun 2021
Eye of the Beholder: Improved Relation Generalization for Text-based Reinforcement Learning Agents
K. Murugesan
Subhajit Chaudhury
Kartik Talamadupula
41
5
0
09 Jun 2021
Towards Practical Credit Assignment for Deep Reinforcement Learning
Vyacheslav Alipov
Riley Simmons-Edler
N.Yu. Putintsev
Pavel Kalinin
Dmitry Vetrov
OffRL
35
11
0
08 Jun 2021
Hierarchical Representation Learning for Markov Decision Processes
Lorenzo Steccanella
Simone Totaro
Anders Jonsson
28
4
0
03 Jun 2021
An Entropy Regularization Free Mechanism for Policy-based Reinforcement Learning
Changnan Xiao
Haosen Shi
Jiajun Fan
Shihong Deng
26
5
0
01 Jun 2021
Shapley Counterfactual Credits for Multi-Agent Reinforcement Learning
Jiahui Li
Kun Kuang
Baoxiang Wang
Furui Liu
Long Chen
Fei Wu
Jun Xiao
OffRL
27
60
0
01 Jun 2021
From Motor Control to Team Play in Simulated Humanoid Football
Siqi Liu
Guy Lever
Zhe Wang
J. Merel
S. M. Ali Eslami
...
Tuomas Haarnoja
Brendan D. Tracey
K. Tuyls
T. Graepel
N. Heess
31
130
0
25 May 2021
A Heuristically Assisted Deep Reinforcement Learning Approach for Network Slice Placement
José Jurandir Alves Esteves
Amina Boubendir
Fabrice Michel Guillemin
Pierre Sens
31
32
0
14 May 2021
A Survey on Reinforcement Learning-Aided Caching in Mobile Edge Networks
Nikolaos Nomikos
Spyros Zoupanos
Themistoklis Charalambous
I. Krikidis
Athina P. Petropulu
31
1
0
12 May 2021
Hierarchical RNNs-Based Transformers MADDPG for Mixed Cooperative-Competitive Environments
Xiaolong Wei
Lifang Yang
Xianglin Huang
Gang Cao
Zhulin Tao
Zhengyang Du
Jing An
34
6
0
11 May 2021
A Deep Reinforcement Learning Approach to Audio-Based Navigation in a Multi-Speaker Environment
Petros Giannakopoulos
A. Pikrakis
Y. Cotronis
24
7
0
10 May 2021
Recent Advances in Deep Learning Based Dialogue Systems: A Systematic Survey
Jinjie Ni
Tom Young
Vlad Pandelea
Fuzhao Xue
Min Zhang
66
270
0
10 May 2021
Pervasive AI for IoT applications: A Survey on Resource-efficient Distributed Artificial Intelligence
Emna Baccour
N. Mhaisen
A. Abdellatif
A. Erbad
Amr M. Mohamed
Mounir Hamdi
Mohsen Guizani
37
87
0
04 May 2021
Reinforcement Learning for Ridesharing: An Extended Survey
Zhiwei Qin
Hongtu Zhu
Jieping Ye
44
84
0
03 May 2021
Generative Adversarial Reward Learning for Generalized Behavior Tendency Inference
Xiaocong Chen
Lina Yao
Xianzhi Wang
Aixin Sun
Wenjie Zhang
Quan Z. Sheng
22
8
0
03 May 2021
End-to-End Intersection Handling using Multi-Agent Deep Reinforcement Learning
Alessandro Paolo Capasso
Paolo Maramotti
Anthony DellÉva
A. Broggi
73
18
0
28 Apr 2021
Learning Latent Graph Dynamics for Visual Manipulation of Deformable Objects
Xiao Ma
David Hsu
W. Lee
AI4CE
39
28
0
25 Apr 2021
Safe Chance Constrained Reinforcement Learning for Batch Process Control
M. Mowbray
Panagiotis Petsagkourakis
Ehecatl Antonio del Rio Chanona
Dongda Zhang
OffRL
37
34
0
23 Apr 2021
Graph Neural Network Reinforcement Learning for Autonomous Mobility-on-Demand Systems
Daniele Gammelli
Kaidi Yang
James Harrison
Filipe Rodrigues
Francisco Câmara Pereira
Marco Pavone
GNN
64
46
0
23 Apr 2021
Formula RL: Deep Reinforcement Learning for Autonomous Racing using Telemetry Data
Adrian Remonda
Sarah Krebs
Eduardo E. Veas
Granit Luzhnica
Roman Kern
OffRL
37
23
0
22 Apr 2021
CVLight: Decentralized Learning for Adaptive Traffic Signal Control with Connected Vehicles
Zhaobin Mo
Wangzhi Li
Yongjie Fu
Kangrui Ruan
Xuan Di
24
40
0
21 Apr 2021
Training Value-Aligned Reinforcement Learning Agents Using a Normative Prior
Md Sultan al Nahian
Spencer Frazier
Brent Harrison
Mark O. Riedl
32
18
0
19 Apr 2021
Two-stage training algorithm for AI robot soccer
Taeyoung Kim
L. Vecchietti
Kyujin Choi
Sanem Sariel
Dongsoo Har
21
7
0
13 Apr 2021
Survey on reinforcement learning for language processing
Víctor Uc Cetina
Nicolás Navarro-Guerrero
A. Martín-González
C. Weber
S. Wermter
OffRL
33
101
0
12 Apr 2021
Efficient Transformers in Reinforcement Learning using Actor-Learner Distillation
Emilio Parisotto
Ruslan Salakhutdinov
44
44
0
04 Apr 2021
A Dynamics Perspective of Pursuit-Evasion Games of Intelligent Agents with the Ability to Learn
Hao Xiong
Huanhui Cao
Lin Zhang
Wenjie Lu
24
3
0
03 Apr 2021
Storchastic: A Framework for General Stochastic Automatic Differentiation
Emile van Krieken
Jakub M. Tomczak
A. T. Teije
ODL
OffRL
31
15
0
01 Apr 2021
SOON: Scenario Oriented Object Navigation with Graph-based Exploration
Fengda Zhu
Xiwen Liang
Yi Zhu
Xiaojun Chang
Xiaodan Liang
27
122
0
31 Mar 2021
Flatland Competition 2020: MAPF and MARL for Efficient Train Coordination on a Grid World
Florian Laurent
Manuel Schneider
Christian Scheller
J. Watson
Jiaoyang Li
...
Nilabha Bhattacharya
Shivam Agarwal
A. Egli
Erik Nygren
Sharada Mohanty
36
27
0
30 Mar 2021
Greedy-GQ with Variance Reduction: Finite-time Analysis and Improved Complexity
Shaocong Ma
Ziyi Chen
Yi Zhou
Shaofeng Zou
19
11
0
30 Mar 2021
MaAST: Map Attention with Semantic Transformersfor Efficient Visual Navigation
Zachary Seymour
Kowshik Thopalli
Niluthpol Chowdhury Mithun
Han-Pang Chiu
S. Samarasekera
Rakesh Kumar
3DPC
24
18
0
21 Mar 2021
Previous
1
2
3
...
15
16
17
...
30
31
32
Next