Papers
Communities
Organizations
Events
Blog
Pricing
Search
Open menu
Home
Papers
1602.01783
Cited By
v1
v2 (latest)
Asynchronous Methods for Deep Reinforcement Learning
4 February 2016
Volodymyr Mnih
Adria Puigdomenech Badia
M. Berk Mirza
Alex Graves
Timothy Lillicrap
Tim Harley
David Silver
Koray Kavukcuoglu
Re-assign community
ArXiv (abs)
PDF
HTML
Papers citing
"Asynchronous Methods for Deep Reinforcement Learning"
50 / 3,592 papers shown
Title
Podracer architectures for scalable Reinforcement Learning
Matteo Hessel
M. Kroiss
Aidan Clark
Iurii Kemaev
John Quan
Thomas Keck
Fabio Viola
H. V. Hasselt
76
39
0
13 Apr 2021
Muesli: Combining Improvements in Policy Optimization
Matteo Hessel
Ivo Danihelka
Fabio Viola
A. Guez
Simon Schmitt
Laurent Sifre
T. Weber
David Silver
H. V. Hasselt
113
66
0
13 Apr 2021
Two-stage training algorithm for AI robot soccer
Taeyoung Kim
L. Vecchietti
Kyujin Choi
Sanem Sariel
Dongsoo Har
120
7
0
13 Apr 2021
Bi-level Off-policy Reinforcement Learning for Volt/VAR Control Involving Continuous and Discrete Devices
Haotian Liu
Wenchuan Wu
OffRL
34
7
0
13 Apr 2021
A coevolutionary approach to deep multi-agent reinforcement learning
Daan Klijn
A. E. Eiben
58
8
0
12 Apr 2021
Survey on reinforcement learning for language processing
Víctor Uc Cetina
Nicolás Navarro-Guerrero
A. Martín-González
C. Weber
S. Wermter
OffRL
97
111
0
12 Apr 2021
A Bayesian Approach to Reinforcement Learning of Vision-Based Vehicular Control
Zahra Gharaee
Karl Holmquist
Linbo He
Michael Felsberg
BDL
51
4
0
08 Apr 2021
A Reinforcement Learning Environment For Job-Shop Scheduling
Pierre Tassel
Martin Gebser
Konstantin Schekotihin
OffRL
58
50
0
08 Apr 2021
Data-Driven Simulation of Ride-Hailing Services using Imitation and Reinforcement Learning
H. Jayasinghe
Tarindu Jayatilaka
Ravin Gunawardena
Uthayasanker Thayasivam
144
0
0
06 Apr 2021
Progressive extension of reinforcement learning action dimension for asymmetric assembly tasks
Yuhang Gai
Jiuming Guo
Dan Wu
Ken Chen
37
0
0
06 Apr 2021
Probabilistic Programming Bots in Intuitive Physics Game Play
Fahad Alhasoun
Sarah Alnegheimish
J. Tenenbaum
39
1
0
05 Apr 2021
Efficient Transformers in Reinforcement Learning using Actor-Learner Distillation
Emilio Parisotto
Ruslan Salakhutdinov
108
46
0
04 Apr 2021
A Dynamics Perspective of Pursuit-Evasion Games of Intelligent Agents with the Ability to Learn
Hao Xiong
Huanhui Cao
Lin Zhang
Wenjie Lu
28
3
0
03 Apr 2021
Touch-based Curiosity for Sparse-Reward Tasks
Sai Rajeswar
Cyril Ibrahim
Nitin Surya
Florian Golemo
David Vazquez
Rameswar Panda
Pedro H. O. Pinheiro
67
6
0
01 Apr 2021
Storchastic: A Framework for General Stochastic Automatic Differentiation
Emile van Krieken
Jakub M. Tomczak
A. T. Teije
ODL
OffRL
103
16
0
01 Apr 2021
Bounding the Inefficiency of Route Control in Intelligent Transport Systems
Charlotte Roman
P. Turrini
16
0
0
01 Apr 2021
SOON: Scenario Oriented Object Navigation with Graph-based Exploration
Fengda Zhu
Xiwen Liang
Yi Zhu
Xiaojun Chang
Xiaodan Liang
79
127
0
31 Mar 2021
Solving Heterogeneous General Equilibrium Economic Models with Deep Reinforcement Learning
Edward W. Hill
M. Bardoscia
A. Turrell
67
26
0
31 Mar 2021
Visual Room Rearrangement
Luca Weihs
Matt Deitke
Aniruddha Kembhavi
Roozbeh Mottaghi
LM&Ro
79
139
0
30 Mar 2021
Flatland Competition 2020: MAPF and MARL for Efficient Train Coordination on a Grid World
Florian Laurent
Manuel Schneider
Christian Scheller
J. Watson
Jiaoyang Li
...
Nilabha Bhattacharya
Shivam Agarwal
A. Egli
Erik Nygren
Sharada Mohanty
77
29
0
30 Mar 2021
Greedy-GQ with Variance Reduction: Finite-time Analysis and Improved Complexity
Shaocong Ma
Ziyi Chen
Yi Zhou
Shaofeng Zou
62
11
0
30 Mar 2021
Co-Imitation Learning without Expert Demonstration
Hai-Jian Ke
Hu Xu
Kun Zhu
Sheng-Jun Huang
OffRL
57
3
0
27 Mar 2021
Composable Learning with Sparse Kernel Representations
Ekaterina V. Tolstaya
Ethan Stump
Alec Koppel
Alejandro Ribeiro
44
0
0
26 Mar 2021
Hierarchical Program-Triggered Reinforcement Learning Agents For Automated Driving
Briti Gangopadhyay
Harshit Soora
P. Dasgupta
70
35
0
25 Mar 2021
The Gradient Convergence Bound of Federated Multi-Agent Reinforcement Learning with Efficient Communication
Xing Xu
Rongpeng Li
Zhifeng Zhao
Honggang Zhang
86
12
0
24 Mar 2021
Automated and Autonomous Experiment in Electron and Scanning Probe Microscopy
Sergei V. Kalinin
M. Ziatdinov
Jacob D. Hinkle
S. Jesse
Ayana Ghosh
K. Kelley
A. Lupini
B. Sumpter
Rama K Vasudevan
81
3
0
22 Mar 2021
Improving Actor-Critic Reinforcement Learning via Hamiltonian Monte Carlo Method
Duo Xu
Faramarz Fekri
67
8
0
22 Mar 2021
Provably Correct Optimization and Exploration with Non-linear Policies
Fei Feng
W. Yin
Alekh Agarwal
Lin F. Yang
156
13
0
22 Mar 2021
MaAST: Map Attention with Semantic Transformersfor Efficient Visual Navigation
Zachary Seymour
Kowshik Thopalli
Niluthpol Chowdhury Mithun
Han-Pang Chiu
S. Samarasekera
Rakesh Kumar
3DPC
69
18
0
21 Mar 2021
Bayesian Distributional Policy Gradients
Luchen Li
A. Faisal
BDL
OffRL
69
9
0
20 Mar 2021
Local Patch AutoAugment with Multi-Agent Collaboration
Shiqi Lin
Tao Yu
Ruoyu Feng
Xin Li
Xin Jin
Zhibo Chen
51
16
0
20 Mar 2021
Reward Signal Design for Autonomous Racing
Benjamin Evans
H. Engelbrecht
H. W. Jordaan
43
6
0
18 Mar 2021
Weakly Supervised Reinforcement Learning for Autonomous Highway Driving via Virtual Safety Cages
Sampo Kuutti
Richard Bowden
Saber Fallah
81
14
0
17 Mar 2021
Decentralized Reinforcement Learning for Multi-Target Search and Detection by a Team of Drones
Roi Yehoshua
Juan Heredia Juesas
Yushu Wu
Chris Amato
J. Martinez-Lorenzo
31
2
0
17 Mar 2021
Self-Organizing mmWave MIMO Cell-Free Networks With Hybrid Beamforming: A Hierarchical DRL-Based Design
Yasser F. Al-Eryani
Ekram Hossain
18
27
0
17 Mar 2021
Goal-constrained Sparse Reinforcement Learning for End-to-End Driving
Pranav Agarwal
Pierre de Beaucorps
Raoul de Charette
63
3
0
16 Mar 2021
Sample-efficient Reinforcement Learning Representation Learning with Curiosity Contrastive Forward Dynamics Model
Thanh Nguyen
Tung M. Luu
Thang Vu
Chang D. Yoo
52
17
0
15 Mar 2021
A Quadratic Actor Network for Model-Free Reinforcement Learning
Matthias Weissenbacher
Yoshinobu Kawahara
30
0
0
11 Mar 2021
Hard Attention Control By Mutual Information Maximization
Himanshu Sahni
Charles Isbell
33
0
0
10 Mar 2021
Iterative Shrinking for Referring Expression Grounding Using Deep Reinforcement Learning
Mingjie Sun
Jimin Xiao
Eng Gee Lim
ObjD
86
35
0
09 Mar 2021
Model-free Policy Learning with Reward Gradients
Qingfeng Lan
Samuele Tosatto
Homayoon Farrahi
Rupam Mahmood
51
6
0
09 Mar 2021
A multi-agent reinforcement learning model of reputation and cooperation in human groups
Kevin R. McKee
Edward Hughes
Tina Zhu
Martin Chadwick
Raphael Köster
Antonio García Castañeda
Charlie Beattie
T. Graepel
M. Botvinick
Joel Z Leibo
75
9
0
08 Mar 2021
Self-Supervised Online Reward Shaping in Sparse-Reward Environments
F. Memarian
Wonjoon Goo
Rudolf Lioutikov
S. Niekum
Ufuk Topcu
OffRL
110
49
0
08 Mar 2021
MetaView: Few-shot Active Object Recognition
Wei Wei
Haonan Yu
Haichao Zhang
Wenyuan Xu
Ying Nian Wu
85
4
0
07 Mar 2021
Visual Explanation using Attention Mechanism in Actor-Critic-based Deep Reinforcement Learning
Hidenori Itaya
Tsubasa Hirakawa
Takayoshi Yamashita
H. Fujiyoshi
K. Sugiura
78
19
0
06 Mar 2021
Deep reinforcement learning in medical imaging: A literature review
S. Kevin Zhou
Hoang Ngan Le
Khoa Luu
Hien V Nguyen
N. Ayache
LM&MA
OffRL
MedIm
86
149
0
05 Mar 2021
Routing algorithms as tools for integrating social distancing with emergency evacuation
Yi-Lin Tsai
Chetanya Rastogi
P. Kitanidis
C. Field
56
10
0
05 Mar 2021
Foresee then Evaluate: Decomposing Value Estimation with Latent Future Prediction
Hongyao Tang
Jianye Hao
Guangyong Chen
Pengfei Chen
Chong Chen
Yaodong Yang
Lu Zhang
Wulong Liu
Zhaopeng Meng
OffRL
138
4
0
03 Mar 2021
Hierarchical and Partially Observable Goal-driven Policy Learning with Goals Relational Graph
Xin Ye
Yezhou Yang
104
25
0
01 Mar 2021
Decision Making in Monopoly using a Hybrid Deep Reinforcement Learning Approach
Trevor Bonjour
Marina Haliem
A. Alsalem
Shilpa Thomas
Hongyu Li
Vaneet Aggarwal
Mayank Kejriwal
Bharat K. Bhargava
108
15
0
01 Mar 2021
Previous
1
2
3
...
34
35
36
...
70
71
72
Next