Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
1602.01783
Cited By
v1
v2 (latest)
Asynchronous Methods for Deep Reinforcement Learning
4 February 2016
Volodymyr Mnih
Adria Puigdomenech Badia
M. Berk Mirza
Alex Graves
Timothy Lillicrap
Tim Harley
David Silver
Koray Kavukcuoglu
Re-assign community
ArXiv (abs)
PDF
HTML
Papers citing
"Asynchronous Methods for Deep Reinforcement Learning"
50 / 3,591 papers shown
Title
An Information-theoretic On-line Learning Principle for Specialization in Hierarchical Decision-Making Systems
Heinke Hihn
Sebastian Gottwald
Daniel A. Braun
97
16
0
26 Jul 2019
Action Guidance with MCTS for Deep Reinforcement Learning
Bilal Kartal
Pablo Hernandez-Leal
Matthew E. Taylor
56
18
0
25 Jul 2019
Terminal Prediction as an Auxiliary Task for Deep Reinforcement Learning
Bilal Kartal
Pablo Hernandez-Leal
Matthew E. Taylor
173
29
0
24 Jul 2019
Modeling question asking using neural program generation
ZiYun Wang
Brenden M. Lake
64
7
0
23 Jul 2019
Variance Reduction in Actor Critic Methods (ACM)
Eric Benhamou
OffRL
57
4
0
23 Jul 2019
Metalearned Neural Memory
Tsendsuren Munkhdalai
Alessandro Sordoni
Tong Wang
Adam Trischler
KELM
63
62
0
23 Jul 2019
Agent Modeling as Auxiliary Task for Deep Reinforcement Learning
Pablo Hernandez-Leal
Bilal Kartal
Matthew E. Taylor
45
51
0
22 Jul 2019
Deep Reinforcement Learning for Clinical Decision Support: A Brief Survey
Siqi Liu
K. Ngiam
Mengling Feng
LM&MA
OffRL
49
19
0
22 Jul 2019
VRLS: A Unified Reinforcement Learning Scheduler for Vehicle-to-Vehicle Communications
T. Şahin
R. Khalili
Mate Boban
A. Wolisz
31
7
0
22 Jul 2019
Deep Reinforcement Learning for Autonomous Internet of Things: Model, Applications and Challenges
Lei Lei
Yue Tan
Kan Zheng
Shiwen Liu
K. Zheng
Xuemin Shen
Shen
OffRL
89
205
0
22 Jul 2019
Characterizing Attacks on Deep Reinforcement Learning
Xinlei Pan
Chaowei Xiao
Warren He
Shuang Yang
Jian Peng
...
Jinfeng Yi
Zijiang Yang
Mingyan D. Liu
Yue Liu
Basel Alomair
AAML
104
70
0
21 Jul 2019
Potential-Based Advice for Stochastic Policy Learning
Baicen Xiao
Bhaskar Ramasubramanian
Andrew Clark
Hannaneh Hajishirzi
L. Bushnell
Radha Poovendran
OffRL
29
5
0
20 Jul 2019
An Actor-Critic-Attention Mechanism for Deep Reinforcement Learning in Multi-view Environments
Elaheh Barati
Xuewen Chen
69
13
0
19 Jul 2019
Accelerating Reinforcement Learning through GPU Atari Emulation
Steven Dalton
I. Frosio
M. Garland
ELM
58
9
0
19 Jul 2019
Convergence of Edge Computing and Deep Learning: A Comprehensive Survey
Xiaofei Wang
Yiwen Han
Victor C. M. Leung
Dusit Niyato
Xueqiang Yan
Xu Chen
104
1,006
0
19 Jul 2019
Convolutional Reservoir Computing for World Models
Hanten Chang
K. Futagami
50
4
0
18 Jul 2019
Prioritized Guidance for Efficient Multi-Agent Reinforcement Learning Exploration
Qisheng Wang
Qichao Wang
29
1
0
18 Jul 2019
OmniNet: A unified architecture for multi-modal multi-task learning
Subhojeet Pramanik
Priyanka Agrawal
A. Hussain
70
41
0
17 Jul 2019
Federated Reinforcement Distillation with Proxy Experience Memory
Han Cha
Jihong Park
Hyesung Kim
Seong-Lyun Kim
M. Bennis
92
16
0
15 Jul 2019
Proximal Policy Optimization with Mixed Distributed Training
Zhenyu Zhang
Xiangfeng Luo
Tong Liu
Shaorong Xie
Jianshu Wang
Wei Wang
Yongbin Li
Yan Peng
OffRL
41
21
0
15 Jul 2019
On the Global Convergence of Actor-Critic: A Case for Linear Quadratic Regulator with Ergodic Cost
Zhuoran Yang
Yongxin Chen
Mingyi Hong
Zhaoran Wang
118
40
0
14 Jul 2019
Provably Efficient Reinforcement Learning with Linear Function Approximation
Chi Jin
Zhuoran Yang
Zhaoran Wang
Michael I. Jordan
113
561
0
11 Jul 2019
An Optimistic Perspective on Offline Reinforcement Learning
Rishabh Agarwal
Dale Schuurmans
Mohammad Norouzi
OffRL
OnRL
113
70
0
10 Jul 2019
DOB-Net: Actively Rejecting Unknown Excessive Time-Varying Disturbances
Tianming Wang
Wenjie Lu
Zheng Yan
Dikai Liu
59
4
0
10 Jul 2019
Neural Input Search for Large Scale Recommendation Models
Manas R. Joglekar
Cong Li
Jay K. Adams
Pranav Khaitan
Quoc V. Le
59
116
0
10 Jul 2019
Graph Policy Gradients for Large Scale Robot Control
Arbaaz Khan
Ekaterina V. Tolstaya
Alejandro Ribeiro
Vijay Kumar
71
93
0
08 Jul 2019
Deep Learning based Wireless Resource Allocation with Application to Vehicular Networks
Le Liang
Hao Ye
Guanding Yu
Geoffrey Ye Li
78
200
0
07 Jul 2019
A Review of Robot Learning for Manipulation: Challenges, Representations, and Algorithms
Oliver Kroemer
S. Niekum
George Konidaris
153
369
0
06 Jul 2019
Playing Flappy Bird via Asynchronous Advantage Actor Critic Algorithm
E. Alp
M. Güzel
13
1
0
06 Jul 2019
Dependency-aware Attention Control for Unconstrained Face Recognition with Image Sets
Xiaofeng Liu
B. Kumar
Chao Yang
Qingming Tang
J. You
CVBM
95
42
0
05 Jul 2019
Attentive Multi-Task Deep Reinforcement Learning
Timo Bram
Gino Brunner
Oliver Richter
Roger Wattenhofer
CLL
146
18
0
05 Jul 2019
Interactive-Predictive Neural Machine Translation through Reinforcement and Imitation
Tsz Kin Lam
Shigehiko Schamoni
Stefan Riezler
AI4CE
67
18
0
04 Jul 2019
Integration of Imitation Learning using GAIL and Reinforcement Learning using Task-achievement Rewards via Probabilistic Graphical Model
Akira Kinose
T. Taniguchi
100
20
0
03 Jul 2019
Benchmarking Model-Based Reinforcement Learning
Tingwu Wang
Xuchan Bao
I. Clavera
Jerrick Hoang
Yeming Wen
Eric D. Langlois
Matthew Shunshi Zhang
Guodong Zhang
Pieter Abbeel
Jimmy Ba
OffRL
114
365
0
03 Jul 2019
On the Weaknesses of Reinforcement Learning for Neural Machine Translation
Leshem Choshen
Lior Fox
Zohar Aizenbud
Omri Abend
133
110
0
03 Jul 2019
Generalizing from a few environments in safety-critical reinforcement learning
Zachary Kenton
Angelos Filos
Owain Evans
Y. Gal
87
16
0
02 Jul 2019
Modified Actor-Critics
Erinc Merdivan
S. Hanke
Matthieu Geist
45
2
0
02 Jul 2019
Dynamic Face Video Segmentation via Reinforcement Learning
Yujiang Wang
Mingzhi Dong
Jie Shen
Yang Wu
Shiyang Cheng
Maja Pantic
CVBM
77
22
0
02 Jul 2019
MULEX: Disentangling Exploitation from Exploration in Deep RL
Lucas Beyer
Damien Vincent
O. Teboul
Sylvain Gelly
Matthieu Geist
Olivier Pietquin
50
14
0
01 Jul 2019
Learning World Graphs to Accelerate Hierarchical Reinforcement Learning
Wenling Shang
Alexander R. Trott
Stephan Zheng
Caiming Xiong
R. Socher
92
18
0
01 Jul 2019
FiDi-RL: Incorporating Deep Reinforcement Learning with Finite-Difference Policy Search for Efficient Learning of Continuous Control
Longxiang Shi
Shijian Li
LongBing Cao
Long Yang
Gang Zheng
Gang Pan
26
5
0
01 Jul 2019
Variational Quantum Circuits for Deep Reinforcement Learning
Samuel Yen-Chi Chen
Chao-Han Huck Yang
Jun Qi
Pin-Yu Chen
Xiaoli Ma
H. Goan
110
314
0
30 Jun 2019
Growing Action Spaces
Gregory Farquhar
Laura Gustafson
Zeming Lin
Shimon Whiteson
Nicolas Usunier
Gabriel Synnaeve
77
38
0
28 Jun 2019
Supervise Thyself: Examining Self-Supervised Representations in Interactive Environments
Evan Racah
C. Pal
SSL
108
2
0
27 Jun 2019
Learning Policies through Quantile Regression
Oliver Richter
Roger Wattenhofer
51
0
0
27 Jun 2019
Toward Simulating Environments in Reinforcement Learning Based Recommendations
Xiangyu Zhao
Long Xia
Zhuoye Ding
D. Yin
Jiliang Tang
82
25
0
27 Jun 2019
Generalization to Novel Objects using Prior Relational Knowledge
V. Vijay
Abhinav Ganesh
Hanlin Tang
Arjun K. Bansal
GNN
52
6
0
26 Jun 2019
Reinforcement Learning with Competitive Ensembles of Information-Constrained Primitives
Anirudh Goyal
Shagun Sodhani
Jonathan Binas
Xue Bin Peng
Sergey Levine
Yoshua Bengio
91
49
0
25 Jun 2019
Policy Optimization with Stochastic Mirror Descent
Long Yang
Yu Zhang
Gang Zheng
Qian Zheng
Pengfei Li
Jianhang Huang
Jun Wen
Gang Pan
128
34
0
25 Jun 2019
Neural Proximal/Trust Region Policy Optimization Attains Globally Optimal Policy
Boyi Liu
Qi Cai
Zhuoran Yang
Zhaoran Wang
103
111
0
25 Jun 2019
Previous
1
2
3
...
52
53
54
...
70
71
72
Next