Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
1602.01783
Cited By
v1
v2 (latest)
Asynchronous Methods for Deep Reinforcement Learning
4 February 2016
Volodymyr Mnih
Adria Puigdomenech Badia
M. Berk Mirza
Alex Graves
Timothy Lillicrap
Tim Harley
David Silver
Koray Kavukcuoglu
Re-assign community
ArXiv (abs)
PDF
HTML
Papers citing
"Asynchronous Methods for Deep Reinforcement Learning"
50 / 3,591 papers shown
Title
The Actor-Advisor: Policy Gradient With Off-Policy Advice
Hélène Plisnier
Denis Steckelmacher
D. Roijers
A. Nowé
CML
OffRL
27
6
0
07 Feb 2019
Decentralized Multi-Agents by Imitation of a Centralized Controller
A. Lin
Mark J. Debord
Katia Estabridis
G. Hewer
Guido Montufar
Stanley Osher
65
6
0
06 Feb 2019
Distilling Policy Distillation
Wojciech M. Czarnecki
Razvan Pascanu
Simon Osindero
Siddhant M. Jayakumar
G. Swirszcz
Max Jaderberg
85
134
0
06 Feb 2019
Neural Fictitious Self-Play on ELF Mini-RTS
Keigo Kawamura
Yoshimasa Tsuruoka
59
7
0
06 Feb 2019
Separating value functions across time-scales
Joshua Romoff
Peter Henderson
Ahmed Touati
Emma Brunskill
Joelle Pineau
Yann Ollivier
87
25
0
05 Feb 2019
Learning to Schedule Communication in Multi-agent Reinforcement Learning
Daewoo Kim
Sang-chul Moon
D. Hostallero
Wan Ju Kang
Taeyoung Lee
Kyunghwan Son
Yung Yi
80
208
0
05 Feb 2019
Embodied Multimodal Multitask Learning
Devendra Singh Chaplot
Lisa Lee
Ruslan Salakhutdinov
Devi Parikh
Dhruv Batra
LM&Ro
96
24
0
04 Feb 2019
Obstacle Tower: A Generalization Challenge in Vision, Control, and Planning
Arthur Juliani
Ahmed Khalifa
Vincent-Pierre Berges
Jonathan Harper
Ervin Teng
Hunter Henry
A. Crespi
Julian Togelius
Danny Lange
87
144
0
04 Feb 2019
Incremental Learning with Maximum Entropy Regularization: Rethinking Forgetting and Intransigence
Dahyun Kim
Jihwan Bae
Yeonsik Jo
Jonghyun Choi
OOD
CLL
75
20
0
03 Feb 2019
Certified Reinforcement Learning with Logic Guidance
Mohammadhosein Hasanbeig
Daniel Kroening
Alessandro Abate
127
57
0
02 Feb 2019
Visual Rationalizations in Deep Reinforcement Learning for Atari Games
L. Weitkamp
Elise van der Pol
Zeynep Akata
84
27
0
01 Feb 2019
Competitive Experience Replay
Hao Liu
Alexander R. Trott
R. Socher
Caiming Xiong
OffRL
128
53
0
01 Feb 2019
The Hanabi Challenge: A New Frontier for AI Research
Nolan Bard
Jakob N. Foerster
A. Chandar
Neil Burch
Marc Lanctot
...
Iain Dunning
Shibl Mourad
Hugo Larochelle
Marc G. Bellemare
Michael Bowling
LLMAG
126
355
0
01 Feb 2019
TF-Replicator: Distributed Machine Learning for Researchers
P. Buchlovsky
David Budden
Dominik Grewe
Chris Jones
John Aslanides
...
Aidan Clark
Sergio Gomez Colmenarejo
Aedan Pope
Fabio Viola
Dan Belov
GNN
OffRL
AI4CE
81
20
0
01 Feb 2019
Tsallis Reinforcement Learning: A Unified Framework for Maximum Entropy Reinforcement Learning
Kyungjae Lee
Sungyub Kim
Sungbin Lim
Sungjoon Choi
Songhwai Oh
150
28
0
31 Jan 2019
A Theory of Regularized Markov Decision Processes
Matthieu Geist
B. Scherrer
Olivier Pietquin
147
333
0
31 Jan 2019
Go-Explore: a New Approach for Hard-Exploration Problems
Adrien Ecoffet
Joost Huizinga
Joel Lehman
Kenneth O. Stanley
Jeff Clune
AI4TS
134
370
0
30 Jan 2019
Benchmarking Classic and Learned Navigation in Complex 3D Environments
Dmytro Mishkin
Alexey Dosovitskiy
V. Koltun
137
75
0
30 Jan 2019
InfoBot: Transfer and Exploration via the Information Bottleneck
Anirudh Goyal
Riashat Islam
Daniel Strouse
Zafarali Ahmed
M. Botvinick
Hugo Larochelle
Yoshua Bengio
Sergey Levine
OffRL
131
167
0
30 Jan 2019
Probability Functional Descent: A Unifying Perspective on GANs, Variational Inference, and Reinforcement Learning
Casey Chu
Jose H. Blanchet
Peter Glynn
GAN
75
26
0
30 Jan 2019
Privacy-preserving Q-Learning with Functional Noise in Continuous State Spaces
Baoxiang Wang
N. Hegde
88
65
0
30 Jan 2019
Trust Region-Guided Proximal Policy Optimization
Yuhui Wang
Hao He
Xiaoyang Tan
Yaozhong Gan
OffRL
89
57
0
29 Jan 2019
Self-organization of action hierarchy and compositionality by reinforcement learning with recurrent neural networks
Dongqi Han
Kenji Doya
Jun Tani
AI4CE
126
20
0
29 Jan 2019
A Regulation Enforcement Solution for Multi-agent Reinforcement Learning
Fan-Yun Sun
Yen-Yu Chang
Yueh-hua Wu
Shou-De Lin
25
2
0
29 Jan 2019
Making Deep Q-learning methods robust to time discretization
Corentin Tallec
Léonard Blier
Yann Ollivier
OOD
OffRL
67
91
0
28 Jan 2019
Q-learning with UCB Exploration is Sample Efficient for Infinite-Horizon MDP
Kefan Dong
Yuanhao Wang
Xiaoyu Chen
Liwei Wang
OffRL
81
97
0
27 Jan 2019
Model-based Deep Reinforcement Learning for Dynamic Portfolio Optimization
Pengqian Yu
J. Lee
Ilya Kulyatin
Zekun Shi
Sakyasingha Dasgupta
74
64
0
25 Jan 2019
Ablation Studies in Artificial Neural Networks
Richard Meyes
Melanie Lu
Constantin Waubert de Puiseau
Tobias Meisen
69
218
0
24 Jan 2019
Distributed Learning of Decentralized Control Policies for Articulated Mobile Robots
Guillaume Sartoretti
William Paivine
Yunfei Shi
Yue Wu
Howie Choset
54
55
0
24 Jan 2019
Never Forget: Balancing Exploration and Exploitation via Learning Optical Flow
Hsuan-Kung Yang
Po-Han Chiang
Kuan-Wei Ho
Min-Fong Hong
Chun-Yi Lee
45
7
0
24 Jan 2019
Combinational Q-Learning for Dou Di Zhu
Yang You
Liangwei Li
B. Guo
Weiming Wang
Cewu Lu
OffRL
61
13
0
24 Jan 2019
Causal Reasoning from Meta-reinforcement Learning
Ishita Dasgupta
Jane X. Wang
Silvia Chiappa
Jovana Mitrović
Pedro A. Ortega
David Raposo
Edward Hughes
Peter W. Battaglia
M. Botvinick
Z. Kurth-Nelson
CML
LRM
79
122
0
23 Jan 2019
Machine Learning for Wireless Communications in the Internet of Things: A Comprehensive Survey
Jithin Jagannath
Nicholas Polosky
Anu Jagannath
Francesco Restuccia
Tommaso Melodia
106
232
0
23 Jan 2019
Trust Region Value Optimization using Kalman Filtering
Shirli Di-Castro Shashua
Shie Mannor
61
8
0
23 Jan 2019
Towards Non-saturating Recurrent Units for Modelling Long-term Dependencies
A. Chandar
Chinnadhurai Sankar
Eugene Vorontsov
Samira Ebrahimi Kahou
Yoshua Bengio
101
56
0
22 Jan 2019
Towards Physically Safe Reinforcement Learning under Supervision
Yinan Zhang
Devin J. Balkcom
Haoxiang Li
OffRL
17
4
0
19 Jan 2019
Lifelong Federated Reinforcement Learning: A Learning Architecture for Navigation in Cloud Robotic Systems
Boyi Liu
Lujia Wang
Ming-Yuan Liu
98
253
0
19 Jan 2019
On-Policy Trust Region Policy Optimisation with Replay Buffers
D. Kangin
N. Pugeault
OffRL
23
3
0
18 Jan 2019
Amplifying the Imitation Effect for Reinforcement Learning of UCAV's Mission Execution
G. Lee
Chang Ouk Kim
33
4
0
17 Jan 2019
Learning Autonomous Exploration and Mapping with Semantic Vision
Xiangyang Zhi
Xuming He
Sören Schwertfeger
128
9
0
15 Jan 2019
AutoPhase: Compiler Phase-Ordering for High Level Synthesis with Deep Reinforcement Learning
Ameer Haj-Ali
Qijing Huang
William S. Moses
J. Xiang
Ion Stoica
Krste Asanović
J. Wawrzynek
52
36
0
15 Jan 2019
A Deep Recurrent Q Network towards Self-adapting Distributed Microservices architecture
Basel Magableh
60
16
0
13 Jan 2019
Neural network gradient-based learning of black-box function interfaces
Alon Jacovi
Guy Hadash
Einat Kermany
Boaz Carmeli
Ofer Lavi
George Kour
Jonathan Berant
48
13
0
13 Jan 2019
An investigation of model-free planning
A. Guez
M. Berk Mirza
Karol Gregor
Rishabh Kabra
S. Racanière
...
Laurent Orseau
Tom Eccles
Greg Wayne
David Silver
Timothy Lillicrap
OffRL
106
117
0
11 Jan 2019
Motion Perception in Reinforcement Learning with Dynamic Objects
Artemij Amiranashvili
Alexey Dosovitskiy
V. Koltun
Thomas Brox
74
35
0
10 Jan 2019
Model-Predictive Policy Learning with Uncertainty Regularization for Driving in Dense Traffic
Mikael Henaff
A. Canziani
Yann LeCun
OOD
118
123
0
08 Jan 2019
Paired Open-Ended Trailblazer (POET): Endlessly Generating Increasingly Complex and Diverse Learning Environments and Their Solutions
Rui Wang
Joel Lehman
Jeff Clune
Kenneth O. Stanley
131
250
0
07 Jan 2019
Recurrent Control Nets for Deep Reinforcement Learning
Vincent Liu
Ademi Adeniji
Nathaniel Lee
Jason Zhao
Mario Srouji
18
3
0
06 Jan 2019
Exploring applications of deep reinforcement learning for real-world autonomous driving systems
V. Talpaert
Ibrahim Sobh
Ravi Kiran
Patrick Mannion
S. Yogamani
Ahmad El-Sallab
P. Pérez
70
74
0
06 Jan 2019
What Should I Do Now? Marrying Reinforcement Learning and Symbolic Planning
Daniel Gordon
Dieter Fox
Ali Farhadi
78
20
0
06 Jan 2019
Previous
1
2
3
...
57
58
59
...
70
71
72
Next