Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
1602.01783
Cited By
v1
v2 (latest)
Asynchronous Methods for Deep Reinforcement Learning
4 February 2016
Volodymyr Mnih
Adria Puigdomenech Badia
M. Berk Mirza
Alex Graves
Timothy Lillicrap
Tim Harley
David Silver
Koray Kavukcuoglu
Re-assign community
ArXiv (abs)
PDF
HTML
Papers citing
"Asynchronous Methods for Deep Reinforcement Learning"
50 / 3,591 papers shown
Title
Adversarial Reinforced Instruction Attacker for Robust Vision-Language Navigation
Bingqian Lin
Yi Zhu
Yanxin Long
Xiaodan Liang
QiXiang Ye
Liang Lin
AAML
89
16
0
23 Jul 2021
Structured second-order methods via natural gradient descent
Wu Lin
Frank Nielsen
Mohammad Emtiyaz Khan
Mark Schmidt
ODL
60
10
0
22 Jul 2021
DRIVE: Deep Reinforced Accident Anticipation with Visual Explanation
Wentao Bao
Qi Yu
Yu Kong
FAtt
75
41
0
21 Jul 2021
MarsExplorer: Exploration of Unknown Terrains via Deep Reinforcement Learning and Procedurally Generated Environments
Dimitrios I. Koutras
Athanasios Ch. Kapoutsis
A. Amanatiadis
Elias B. Kosmatopoulos
59
10
0
21 Jul 2021
Proximal Policy Optimization for Tracking Control Exploiting Future Reference Information
Jana Mayer
Johannes Westermann
Juan Pedro Gutiérrez H. Muriedas
Uwe Mettin
A. Lampe
OffRL
23
0
0
20 Jul 2021
Mastering Visual Continuous Control: Improved Data-Augmented Reinforcement Learning
Denis Yarats
Rob Fergus
A. Lazaric
Lerrel Pinto
OffRL
133
353
0
20 Jul 2021
Decoupled Reinforcement Learning to Stabilise Intrinsically-Motivated Exploration
Lukas Schafer
Filippos Christianos
Josiah P. Hanna
Stefano V. Albrecht
92
23
0
19 Jul 2021
Structured World Belief for Reinforcement Learning in POMDP
Gautam Singh
Skand Peri
Junghyun Kim
Hyunseok Kim
Sungjin Ahn
OCL
81
28
0
19 Jul 2021
Greedification Operators for Policy Optimization: Investigating Forward and Reverse KL Divergences
Alan Chan
Hugo Silva
Sungsu Lim
Tadashi Kozuno
A. R. Mahmood
Martha White
90
31
0
17 Jul 2021
High-Accuracy Model-Based Reinforcement Learning, a Survey
Aske Plaat
W. Kosters
Mike Preuss
OffRL
75
37
0
17 Jul 2021
Neighbor-view Enhanced Model for Vision and Language Navigation
Dongyan An
Yuankai Qi
Yan Huang
Qi Wu
Liang Wang
Tieniu Tan
LM&Ro
82
71
0
15 Jul 2021
NeuSaver: Neural Adaptive Power Consumption Optimization for Mobile Video Streaming
Kyoungjun Park
Myungchul Kim
Laihyuk Park
21
3
0
15 Jul 2021
The Benchmark Lottery
Mostafa Dehghani
Yi Tay
A. Gritsenko
Zhe Zhao
N. Houlsby
Fernando Diaz
Donald Metzler
Oriol Vinyals
117
92
0
14 Jul 2021
Scalable Evaluation of Multi-Agent Reinforcement Learning with Melting Pot
Joel Z Leibo
Edgar A. Duénez-Guzmán
A. Vezhnevets
J. Agapiou
P. Sunehag
Raphael Köster
Jayd Matyas
Charlie Beattie
Igor Mordatch
T. Graepel
OffRL
93
111
0
14 Jul 2021
ReLLIE: Deep Reinforcement Learning for Customized Low-Light Image Enhancement
Rongkai Zhang
Lanqing Guo
Siyu Huang
Bihan Wen
OffRL
74
51
0
13 Jul 2021
Cautious Policy Programming: Exploiting KL Regularization in Monotonic Policy Improvement for Reinforcement Learning
Lingwei Zhu
Toshinori Kitamura
Takamitsu Matsubara
OffRL
46
1
0
13 Jul 2021
Conservative Offline Distributional Reinforcement Learning
Yecheng Jason Ma
Dinesh Jayaraman
Osbert Bastani
OffRL
106
85
0
12 Jul 2021
R3L: Connecting Deep Reinforcement Learning to Recurrent Neural Networks for Image Denoising via Residual Recovery
Rongkai Zhang
Jiang Zhu
Zhiyuan Zha
Justin Dauwels
Bihan Wen
75
6
0
12 Jul 2021
Stabilizing Neural Control Using Self-Learned Almost Lyapunov Critics
Ya-Chien Chang
Sicun Gao
94
59
0
11 Jul 2021
Coordinate-wise Control Variates for Deep Policy Gradients
Yuanyi Zhong
Yuanshuo Zhou
Jian-wei Peng
BDL
88
1
0
11 Jul 2021
ARC: Adversarially Robust Control Policies for Autonomous Vehicles
Sampo Kuutti
Saber Fallah
Richard Bowden
AAML
65
5
0
09 Jul 2021
Intelligent Link Adaptation for Grant-Free Access Cellular Networks: A Distributed Deep Reinforcement Learning Approach
Joao V. C. Evangelista
Zeeshan Sattar
Georges Kaddoum
Bassant Selim
Aydin Sarraf
27
2
0
08 Jul 2021
RMA: Rapid Motor Adaptation for Legged Robots
Ashish Kumar
Zipeng Fu
Deepak Pathak
Jitendra Malik
186
584
0
08 Jul 2021
Computational Benefits of Intermediate Rewards for Goal-Reaching Policy Learning
Yuexiang Zhai
Christina Baek
Zhengyuan Zhou
Jiantao Jiao
Yi-An Ma
85
23
0
08 Jul 2021
Analytically Tractable Hidden-States Inference in Bayesian Neural Networks
L. Nguyen
J. Goulet
BDL
35
6
0
08 Jul 2021
Deep Learning for Embodied Vision Navigation: A Survey
Fengda Zhu
Yi Zhu
Vincent CS Lee
Xiaodan Liang
Xiaojun Chang
EgoV
LM&Ro
101
0
0
07 Jul 2021
AdaRL: What, Where, and How to Adapt in Transfer Reinforcement Learning
Erdun Gao
Fan Feng
Chaochao Lu
Sara Magliacane
Kun Zhang
106
69
0
06 Jul 2021
Effects of Smart Traffic Signal Control on Air Quality
P. Fazzini
M. Torre
V. Rizza
F. Petracchini
18
4
0
06 Jul 2021
Collaborative Visual Navigation
Haiyang Wang
Wenguan Wang
Xizhou Zhu
Jifeng Dai
Liwei Wang
EgoV
100
20
0
02 Jul 2021
User Role Discovery and Optimization Method based on K-means + Reinforcement learning in Mobile Applications
Yuanbang Li
OffRL
19
2
0
02 Jul 2021
Reinforcement Learning for Feedback-Enabled Cyber Resilience
Yunhan Huang
Linan Huang
Quanyan Zhu
99
71
0
02 Jul 2021
Goal-Conditioned Reinforcement Learning with Imagined Subgoals
Elliot Chane-Sane
Cordelia Schmid
Ivan Laptev
87
144
0
01 Jul 2021
Applications of the Free Energy Principle to Machine Learning and Neuroscience
Beren Millidge
DRL
117
8
0
30 Jun 2021
Adaptive Stochastic ADMM for Decentralized Reinforcement Learning in Edge Industrial IoT
Wanlu Lei
Yu Ye
Ming Xiao
Mikael Skoglund
Zhu Han
45
1
0
30 Jun 2021
Understanding Adversarial Attacks on Observations in Deep Reinforcement Learning
You Qiaoben
Chengyang Ying
Xinning Zhou
Hang Su
Jun Zhu
Bo Zhang
AAML
109
17
0
30 Jun 2021
Deep Multiagent Reinforcement Learning: Challenges and Directions
Annie Wong
Thomas Bäck
Anna V. Kononova
Aske Plaat
AI4CE
116
97
0
29 Jun 2021
Globally Optimal Hierarchical Reinforcement Learning for Linearly-Solvable Markov Decision Processes
Guillermo Infante
Anders Jonsson
Vicencc Gómez
23
7
0
29 Jun 2021
Policy Regularization via Noisy Advantage Values for Cooperative Multi-agent Actor-Critic methods
Jian Hu
Siyue Hu
Shih-Wei Liao
118
15
0
27 Jun 2021
Graph Convolutional Memory using Topological Priors
Steven D. Morad
Stephan Liwicki
Ryan Kortvelesy
R. Mecca
Amanda Prorok
27
0
0
27 Jun 2021
Core Challenges in Embodied Vision-Language Planning
Jonathan M Francis
Nariaki Kitamura
Felix Labelle
Xiaopeng Lu
Ingrid Navarro
Jean Oh
LM&Ro
144
48
0
26 Jun 2021
A nonlinear hidden layer enables actor-critic agents to learn multiple paired association navigation
M Ganesh Kumar
Cheston Tan
C. Libedinsky
S. Yen
A. Tan
44
5
0
25 Jun 2021
Mix and Mask Actor-Critic Methods
Dom Huh
29
1
0
24 Jun 2021
Policy Smoothing for Provably Robust Reinforcement Learning
Aounon Kumar
Alexander Levine
Soheil Feizi
AAML
127
59
0
21 Jun 2021
Distributed Heuristic Multi-Agent Path Finding with Communication
Ziyuan Ma
Yudong Luo
Hang Ma
79
72
0
21 Jun 2021
Analytically Tractable Bayesian Deep Q-Learning
Luong Ha
L. Nguyen
J. Goulet
BDL
OffRL
35
2
0
21 Jun 2021
Learning to Reach, Swim, Walk and Fly in One Trial: Data-Driven Control with Scarce Data and Side Information
Franck Djeumou
Ufuk Topcu
43
4
0
19 Jun 2021
A Condense-then-Select Strategy for Text Summarization
Hou Pong Chan
Irwin King
45
13
0
19 Jun 2021
Prediction-Free, Real-Time Flexible Control of Tidal Lagoons through Proximal Policy Optimisation: A Case Study for the Swansea Lagoon
Túlio Marcondes Moreira
Jackson Geraldo de Faria
Pedro O. S. Vaz de Melo
Luiz Chaimowicz
G. Medeiros-Ribeiro
35
10
0
18 Jun 2021
MADE: Exploration via Maximizing Deviation from Explored Regions
Tianjun Zhang
Paria Rashidinejad
Jiantao Jiao
Yuandong Tian
Joseph E. Gonzalez
Stuart J. Russell
OffRL
96
44
0
18 Jun 2021
Towards Distraction-Robust Active Visual Tracking
Fangwei Zhong
Peng Sun
Wenhan Luo
Tingyun Yan
Yizhou Wang
AAML
55
38
0
18 Jun 2021
Previous
1
2
3
...
31
32
33
...
70
71
72
Next