Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
1602.01783
Cited By
v1
v2 (latest)
Asynchronous Methods for Deep Reinforcement Learning
4 February 2016
Volodymyr Mnih
Adria Puigdomenech Badia
M. Berk Mirza
Alex Graves
Timothy Lillicrap
Tim Harley
David Silver
Koray Kavukcuoglu
Re-assign community
ArXiv (abs)
PDF
HTML
Papers citing
"Asynchronous Methods for Deep Reinforcement Learning"
50 / 3,591 papers shown
Title
Hierarchical Object-to-Zone Graph for Object Navigation
Sixian Zhang
Xinhang Song
Yubing Bai
Weijie Li
Yakui Chu
Shuqiang Jiang
131
69
0
05 Sep 2021
Soft Hierarchical Graph Recurrent Networks for Many-Agent Partially Observable Environments
Zhenhui Ye
Xiaohong Jiang
Guang-hua Song
Bowei Yang
35
1
0
05 Sep 2021
Eden: A Unified Environment Framework for Booming Reinforcement Learning Algorithms
Ruizhi Chen
Xiaoyu Wu
Yansong Pan
Kaizhao Yuan
Ling Li
...
Shaohui Peng
Xishan Zhang
Zidong Du
Qi Guo
Yunji Chen
OffRL
61
3
0
04 Sep 2021
Event-Based Communication in Distributed Q-Learning
Daniel Jarne Ornia
M. Mazo
65
2
0
03 Sep 2021
A Comparative Study of Algorithms for Intelligent Traffic Signal Control
Hrishit Chaudhuri
Vibha Masti
Vishruth Veerendranath
Dr. S Natarajan
52
9
0
02 Sep 2021
Catastrophic Interference in Reinforcement Learning: A Solution Based on Context Division and Knowledge Distillation
Tiantian Zhang
Xueqian Wang
Bin Liang
Bo Yuan
OffRL
80
18
0
01 Sep 2021
A Survey of Exploration Methods in Reinforcement Learning
Susan Amin
Maziar Gomrokchi
Harsh Satija
H. V. Hoof
Doina Precup
OffRL
100
84
0
01 Sep 2021
WarpDrive: Extremely Fast End-to-End Deep Multi-Agent Reinforcement Learning on a GPU
Tian-Shing Lan
Sunil Srinivasa
Huan Wang
Stephan Zheng
AI4CE
68
13
0
31 Aug 2021
Phy-Q as a measure for physical reasoning intelligence
Cheng Xue
Vimukthini Pinto
C. Gamage
Ekaterina Nikonova
Peng Zhang
Jochen Renz
LRM
77
12
0
31 Aug 2021
Learning Practically Feasible Policies for Online 3D Bin Packing
Hang Zhao
Chenyang Zhu
Xin Xu
Hui Huang
Kai Xu
OffRL
89
84
0
31 Aug 2021
Deep Reinforcement Learning at the Edge of the Statistical Precipice
Rishabh Agarwal
Max Schwarzer
Pablo Samuel Castro
Aaron Courville
Marc G. Bellemare
OffRL
190
680
0
30 Aug 2021
A Policy Efficient Reduction Approach to Convex Constrained Deep Reinforcement Learning
Tianchi Cai
Wenpeng Zhang
Lihong Gu
Xiaodong Zeng
Jinjie Gu
21
0
0
29 Aug 2021
Federated Reinforcement Learning: Techniques, Applications, and Open Challenges
Jiaju Qi
Qihao Zhou
Lei Lei
Kan Zheng
FedML
111
160
0
26 Aug 2021
Vision-Language Navigation: A Survey and Taxonomy
Wansen Wu
Tao Chang
Xinmeng Li
LM&Ro
71
24
0
26 Aug 2021
Responsive Regulation of Dynamic UAV Communication Networks Based on Deep Reinforcement Learning
Ran Zhang
Duc Minh Nguyen
Nguyen
Miao Wang
L. Cai
Xuemin
X. Shen
36
3
0
25 Aug 2021
Entropy-Aware Model Initialization for Effective Exploration in Deep Reinforcement Learning
Sooyoung Jang
Hyungil Kim
55
5
0
24 Aug 2021
MimicBot: Combining Imitation and Reinforcement Learning to win in Bot Bowl
Nicola Pezzotti
63
1
0
21 Aug 2021
Settling the Variance of Multi-Agent Policy Gradients
J. Kuba
Muning Wen
Yaodong Yang
Linghui Meng
Shangding Gu
Haifeng Zhang
D. Mguni
Jun Wang
79
67
0
19 Aug 2021
End-to-End Urban Driving by Imitating a Reinforcement Learning Coach
Zhejun Zhang
Alexander Liniger
Dengxin Dai
Feng Yu
Luc Van Gool
116
211
0
18 Aug 2021
Using Cyber Terrain in Reinforcement Learning for Penetration Testing
Rohit Gangupantulu
Tyler Cody
Paul Park
Abdul Rahman
Logan Eisenbeiser
Dan Radke
Ryan Clark
56
38
0
16 Aug 2021
Continual Backprop: Stochastic Gradient Descent with Persistent Randomness
Shibhansh Dohare
R. Sutton
A. R. Mahmood
CLL
151
82
0
13 Aug 2021
Low-level Pose Control of Tilting Multirotor for Wall Perching Tasks Using Reinforcement Learning
Hyungyu Lee
Myeongwoo Jeong
Chanyoung Kim
Hyungtae Lim
Changgue Park
Sungwon Hwang
Hyun Myung
64
4
0
11 Aug 2021
DQ-GAT: Towards Safe and Efficient Autonomous Driving with Deep Q-Learning and Graph Attention Networks
Peide Cai
Hengli Wang
Yuxiang Sun
Ming-Yuan Liu
GNN
97
39
0
11 Aug 2021
A Survey on Deep Reinforcement Learning for Data Processing and Analytics
Qingpeng Cai
Can Cui
Yiyuan Xiong
Wei Wang
Zhongle Xie
Meihui Zhang
OffRL
58
32
0
10 Aug 2021
Deep Reinforcement Learning for Demand Driven Services in Logistics and Transportation Systems: A Survey
Zefang Zong
Tao Feng
Tong Xia
Depeng Jin
Yong Li
57
3
0
10 Aug 2021
Meta-Reinforcement Learning in Broad and Non-Parametric Environments
Zhenshan Bing
Lukas Knak
F. O. Morin
Kai-Qi Huang
Alois C. Knoll
OffRL
74
19
0
08 Aug 2021
Towards real-world navigation with deep differentiable planners
Shu Ishida
João F. Henriques
OffRL
46
6
0
08 Aug 2021
Rethinking of AlphaStar
Ruoxi Liu
57
2
0
07 Aug 2021
Semantic Tracklets: An Object-Centric Representation for Visual Multi-Agent Reinforcement Learning
Iou-Jen Liu
Zhongzheng Ren
Raymond A. Yeh
Alex Schwing
74
15
0
06 Aug 2021
Deep Reinforcement Learning for Intelligent Reflecting Surface-assisted D2D Communications
K. Nguyen
Antonino Masaracchia
Cheng Yin
L. Nguyen
O. Dobre
T. Duong
33
11
0
06 Aug 2021
RIS-assisted UAV Communications for IoT with Wireless Power Transfer Using Deep Reinforcement Learning
K. Nguyen
Antonino Masaracchia
Vishal Sharma
H. Vincent Poor
T. Duong
31
89
0
05 Aug 2021
Reinforcement Learning for Intelligent Healthcare Systems: A Comprehensive Survey
A. Abdellatif
N. Mhaisen
Z. Chkirbene
Amr M. Mohamed
A. Erbad
Mohsen Guizani
OffRL
AI4TS
65
23
0
05 Aug 2021
On the Robustness of Controlled Deep Reinforcement Learning for Slice Placement
José Jurandir Alves Esteves
Amina Boubendir
Fabrice Michel Guillemin
Pierre Sens
OOD
OffRL
37
5
0
05 Aug 2021
DRL-based Slice Placement Under Non-Stationary Conditions
José Jurandir Alves Esteves
Amina Boubendir
Fabrice Michel Guillemin
Pierre Sens
OffRL
31
6
0
05 Aug 2021
Active Reinforcement Learning over MDPs
Qi Yang
Peng Yang
K. Tang
89
0
0
05 Aug 2021
Parallelized Reverse Curriculum Generation
Zih-Yun Chiu
Yi-Lin Tuan
Hung-yi Lee
Li-Chen Fu
41
1
0
04 Aug 2021
Policy Gradients Incorporating the Future
David Venuto
Elaine Lau
Doina Precup
Ofir Nachum
OffRL
97
9
0
04 Aug 2021
High Performance Across Two Atari Paddle Games Using the Same Perceptual Control Architecture Without Training
T. Gulrez
W. Mansell
26
0
0
04 Aug 2021
Emergent Discrete Communication in Semantic Spaces
Mycal Tucker
Huao Li
Siddharth Agrawal
Dana Hughes
Katia Sycara
Michael Lewis
J. Shah
54
29
0
04 Aug 2021
Variational Actor-Critic Algorithms
Yuhua Zhu
Lexing Ying
OffRL
39
0
0
03 Aug 2021
Sequoia: A Software Framework to Unify Continual Learning Research
Fabrice Normandin
Florian Golemo
O. Ostapenko
Pau Rodríguez López
Matthew D Riemer
...
Dominic Zhao
Timothée Lesort
Laurent Charlin
Irina Rish
Massimo Caccia
CLL
105
21
0
02 Aug 2021
Risk Adversarial Learning System for Connected and Autonomous Vehicle Charging
M. S. Munir
Ki Tae Kim
K. Thar
Dusit Niyato
Choong Seon Hong
44
5
0
02 Aug 2021
Anomaly Detection with Neural Parsers That Never Reject
Alexander Grushin
Walt Woods
46
3
0
30 Jul 2021
Maximum Entropy Dueling Network Architecture in Atari Domain
Alireza Nadali
M. Ebadzadeh
45
0
0
30 Jul 2021
Survey of Recent Multi-Agent Reinforcement Learning Algorithms Utilizing Centralized Training
P. Sharma
Rolando Fernandez
Erin G. Zaroukian
M. Dorothy
Anjon Basak
Derrik E. Asher
72
42
0
29 Jul 2021
Human-Level Reinforcement Learning through Theory-Based Modeling, Exploration, and Planning
Pedro Tsividis
J. Loula
Jake Burga
Nathan Foss
Andres Campero
Thomas Pouncy
S. Gershman
J. Tenenbaum
LM&Ro
59
48
0
27 Jul 2021
Hindsight Value Function for Variance Reduction in Stochastic Dynamic Environment
Jiaming Guo
Rui Zhang
Xishan Zhang
Shaohui Peng
Qiaomin Yi
Zidong Du
Xing Hu
Qi Guo
Yunji Chen
59
7
0
26 Jul 2021
Predicting Game Engagement and Difficulty Using AI Players
Shaghayegh Roohi
Christian Guckelsberger
Asko Relas
Henri Heiskanen
Jari Takatalo
Perttu Hämäläinen
54
15
0
26 Jul 2021
The Impact of Negative Sampling on Contrastive Structured World Models
Ondrej Biza
Elise van der Pol
Thomas Kipf
DRL
OffRL
70
2
0
24 Jul 2021
A general sample complexity analysis of vanilla policy gradient
Rui Yuan
Robert Mansel Gower
A. Lazaric
138
64
0
23 Jul 2021
Previous
1
2
3
...
30
31
32
...
70
71
72
Next