Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
1602.01783
Cited By
v1
v2 (latest)
Asynchronous Methods for Deep Reinforcement Learning
4 February 2016
Volodymyr Mnih
Adria Puigdomenech Badia
M. Berk Mirza
Alex Graves
Timothy Lillicrap
Tim Harley
David Silver
Koray Kavukcuoglu
Re-assign community
ArXiv (abs)
PDF
HTML
Papers citing
"Asynchronous Methods for Deep Reinforcement Learning"
50 / 3,591 papers shown
Title
Surprise-Based Intrinsic Motivation for Deep Reinforcement Learning
Joshua Achiam
S. Shankar Sastry
80
238
0
06 Mar 2017
Third-Person Imitation Learning
Bradly C. Stadie
Pieter Abbeel
Ilya Sutskever
107
234
0
06 Mar 2017
FeUdal Networks for Hierarchical Reinforcement Learning
A. Vezhnevets
Simon Osindero
Tom Schaul
N. Heess
Max Jaderberg
David Silver
Koray Kavukcuoglu
FedML
112
910
0
03 Mar 2017
Virtual-to-real Deep Reinforcement Learning: Continuous Control of Mobile Robots for Mapless Navigation
L. Tai
Giuseppe Paolo
Ming-Yuan Liu
105
714
0
01 Mar 2017
Bridging the Gap Between Value and Policy Based Reinforcement Learning
Ofir Nachum
Mohammad Norouzi
Kelvin Xu
Dale Schuurmans
205
478
0
28 Feb 2017
Learning What Data to Learn
Yang Fan
Fei Tian
Tao Qin
Jiang Bian
Tie-Yan Liu
87
79
0
28 Feb 2017
Neural Map: Structured Memory for Deep Reinforcement Learning
Emilio Parisotto
Ruslan Salakhutdinov
101
261
0
27 Feb 2017
Reinforcement Learning with Deep Energy-Based Policies
Tuomas Haarnoja
Haoran Tang
Pieter Abbeel
Sergey Levine
124
1,351
0
27 Feb 2017
Learning Control for Air Hockey Striking using Deep Reinforcement Learning
Ayal Taitler
N. Shimkin
55
10
0
26 Feb 2017
Online Meta-learning by Parallel Algorithm Competition
Stefan Elfwing
E. Uchibe
Kenji Doya
73
22
0
24 Feb 2017
Deep Models Under the GAN: Information Leakage from Collaborative Deep Learning
Briland Hitaj
G. Ateniese
Fernando Perez-Cruz
FedML
157
1,416
0
24 Feb 2017
Active One-shot Learning
Mark P. Woodward
Chelsea Finn
VLM
OffRL
83
130
0
21 Feb 2017
Towards a Common Implementation of Reinforcement Learning for Multiple Robotic Tasks
Angel Martínez-Tenor
Juan-Antonio Fernández-Madrigal
A. Cruz-Martín
Javier González Jiménez
OffRL
34
32
0
21 Feb 2017
Real-time visual tracking by deep reinforced decision making
Janghoon Choi
Junseok Kwon
Kyoung Mu Lee
75
41
0
21 Feb 2017
Beating the World's Best at Super Smash Bros. with Deep Reinforcement Learning
Vlad Firoiu
William F. Whitney
J. Tenenbaum
94
36
0
21 Feb 2017
Learning to Multi-Task by Active Sampling
Sahil Sharma
Ashutosh Jha
Parikshit Hegde
Balaraman Ravindran
151
21
0
20 Feb 2017
Collaborative Deep Reinforcement Learning
Kaixiang Lin
Shu Wang
Jiayu Zhou
77
21
0
19 Feb 2017
Cognitive Mapping and Planning for Visual Navigation
Saurabh Gupta
Varun Tolani
James Davidson
Sergey Levine
Rahul Sukthankar
Jitendra Malik
131
715
0
13 Feb 2017
Sigmoid-Weighted Linear Units for Neural Network Function Approximation in Reinforcement Learning
Stefan Elfwing
E. Uchibe
Kenji Doya
145
1,760
0
10 Feb 2017
Preparing for the Unknown: Learning a Universal Policy with Online System Identification
Wenhao Yu
Jie Tan
Chenxi Liu
Greg Turk
OffRL
110
309
0
08 Feb 2017
Adversarial Attacks on Neural Network Policies
Sandy Huang
Nicolas Papernot
Ian Goodfellow
Yan Duan
Pieter Abbeel
MLAU
AAML
123
841
0
08 Feb 2017
DeepNav: Learning to Navigate Large Cities
Samarth Brahmbhatt
James Hays
SSL
HAI
74
54
0
31 Jan 2017
PathNet: Evolution Channels Gradient Descent in Super Neural Networks
Chrisantha Fernando
Dylan Banarse
Charles Blundell
Yori Zwols
David R Ha
Andrei A. Rusu
Alexander Pritzel
Daan Wierstra
77
882
0
30 Jan 2017
Wasserstein GAN
Martín Arjovsky
Soumith Chintala
Léon Bottou
GAN
199
4,836
0
26 Jan 2017
Learning Light Transport the Reinforced Way
Ken Dahm
A. Keller
75
64
0
25 Jan 2017
Deep Reinforcement Learning: An Overview
Yuxi Li
OffRL
VLM
346
1,550
0
25 Jan 2017
Regularizing Neural Networks by Penalizing Confident Output Distributions
Gabriel Pereyra
George Tucker
J. Chorowski
Lukasz Kaiser
Geoffrey E. Hinton
NoLa
229
1,142
0
23 Jan 2017
A K-fold Method for Baseline Estimation in Policy Gradient Algorithms
N. Kota
Abhishek Mishra
Sunil Srinivasa
Xi
Xi Chen
Pieter Abbeel
OffRL
42
0
0
03 Jan 2017
The Predictron: End-To-End Learning and Planning
David Silver
H. V. Hasselt
Matteo Hessel
Tom Schaul
A. Guez
...
Gabriel Dulac-Arnold
David P. Reichert
Neil C. Rabinowitz
André Barreto
T. Degris
90
292
0
28 Dec 2016
Loss is its own Reward: Self-Supervision for Reinforcement Learning
Evan Shelhamer
Parsa Mahmoudieh
Max Argus
Trevor Darrell
SSL
105
186
0
21 Dec 2016
A Survey of Deep Network Solutions for Learning Control in Robotics: From Reinforcement to Imitation
L. Tai
Jingwei Zhang
Ming-Yuan Liu
Joschka Boedecker
Wolfram Burgard
OffRL
147
78
0
21 Dec 2016
DeepMind Lab
Charlie Beattie
Joel Z Leibo
Denis Teplyashin
Tom Ward
Marcus Wainwright
...
Stephen Gaffney
Helen King
Demis Hassabis
Shane Legg
Stig Petersen
66
241
0
12 Dec 2016
Towards better decoding and language model integration in sequence to sequence models
J. Chorowski
Navdeep Jaitly
106
370
0
08 Dec 2016
Combining Deep Reinforcement Learning and Safety Based Control for Autonomous Driving
Xincheng Xiong
Jianqiang Wang
Fang Zhang
Keqiang Li
88
66
0
01 Dec 2016
Neural Combinatorial Optimization with Reinforcement Learning
Irwan Bello
Hieu H. Pham
Quoc V. Le
Mohammad Norouzi
Samy Bengio
175
1,499
0
29 Nov 2016
Improving Policy Gradient by Exploring Under-appreciated Rewards
Ofir Nachum
Mohammad Norouzi
Dale Schuurmans
106
44
0
28 Nov 2016
Nonparametric General Reinforcement Learning
Jan Leike
OffRL
105
26
0
28 Nov 2016
Training an Interactive Humanoid Robot Using Multimodal Deep Reinforcement Learning
Heriberto Cuayáhuitl
G. Couly
Clément Olalainty
52
3
0
26 Nov 2016
Dense Captioning with Joint Inference and Visual Context
L. Yang
K. Tang
Jianchao Yang
Li Li
VLM
103
170
0
21 Nov 2016
Local minima in training of neural networks
G. Swirszcz
Wojciech M. Czarnecki
Razvan Pascanu
ODL
83
73
0
19 Nov 2016
Reinforcement Learning through Asynchronous Advantage Actor-Critic on a GPU
Mohammad Babaeizadeh
I. Frosio
Stephen Tyree
Jason Clemons
Jan Kautz
OffRL
82
259
0
18 Nov 2016
Learning to reinforcement learn
Jane X. Wang
Z. Kurth-Nelson
Dhruva Tirumala
Hubert Soyer
Joel Z Leibo
Rémi Munos
Charles Blundell
D. Kumaran
M. Botvinick
OffRL
108
985
0
17 Nov 2016
Reinforcement Learning with Unsupervised Auxiliary Tasks
Max Jaderberg
Volodymyr Mnih
Wojciech M. Czarnecki
Tom Schaul
Joel Z Leibo
David Silver
Koray Kavukcuoglu
SSL
121
1,229
0
16 Nov 2016
#Exploration: A Study of Count-Based Exploration for Deep Reinforcement Learning
Haoran Tang
Rein Houthooft
Davis Foote
Adam Stooke
Xi Chen
Yan Duan
John Schulman
F. Turck
Pieter Abbeel
OffRL
143
777
0
15 Nov 2016
How to scale distributed deep learning?
Peter H. Jin
Qiaochu Yuan
F. Iandola
Kurt Keutzer
3DH
85
137
0
14 Nov 2016
Learning to Navigate in Complex Environments
Piotr Wojciech Mirowski
Razvan Pascanu
Fabio Viola
Hubert Soyer
Andy Ballard
...
Ross Goroshin
Laurent Sifre
Koray Kavukcuoglu
D. Kumaran
R. Hadsell
118
880
0
11 Nov 2016
RL
2
^2
2
: Fast Reinforcement Learning via Slow Reinforcement Learning
Yan Duan
John Schulman
Xi Chen
Peter L. Bartlett
Ilya Sutskever
Pieter Abbeel
OffRL
114
1,029
0
09 Nov 2016
Q-Prop: Sample-Efficient Policy Gradient with An Off-Policy Critic
S. Gu
Timothy Lillicrap
Zoubin Ghahramani
Richard Turner
Sergey Levine
OffRL
BDL
106
345
0
07 Nov 2016
Playing SNES in the Retro Learning Environment
Nadav Bhonker
Shai Rozenberg
Itay Hubara
63
19
0
07 Nov 2016
Averaged-DQN: Variance Reduction and Stabilization for Deep Reinforcement Learning
Oron Anschel
Nir Baram
N. Shimkin
104
318
0
07 Nov 2016
Previous
1
2
3
...
70
71
72
Next