Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
1602.01783
Cited By
v1
v2 (latest)
Asynchronous Methods for Deep Reinforcement Learning
4 February 2016
Volodymyr Mnih
Adria Puigdomenech Badia
M. Berk Mirza
Alex Graves
Timothy Lillicrap
Tim Harley
David Silver
Koray Kavukcuoglu
Re-assign community
ArXiv (abs)
PDF
HTML
Papers citing
"Asynchronous Methods for Deep Reinforcement Learning"
50 / 3,591 papers shown
Title
Sequential Dynamic Decision Making with Deep Neural Nets on a Test-Time Budget
Henghui Zhu
Feng Nan
I. Paschalidis
Venkatesh Saligrama
11
2
0
31 May 2017
Experience Replay Using Transition Sequences
Thommen George Karimpanal
Roland Bouffanais
OffRL
39
14
0
30 May 2017
End-to-end Active Object Tracking via Reinforcement Learning
Wenhan Luo
Peng Sun
Fangwei Zhong
Wei Liu
Yadong Mu
Yizhou Wang
95
86
0
30 May 2017
Constrained Policy Optimization
Joshua Achiam
David Held
Aviv Tamar
Pieter Abbeel
193
1,339
0
30 May 2017
Learning End-to-end Multimodal Sensor Policies for Autonomous Navigation
Guan-Horng Liu
Avinash Siravuru
Sai P. Selvaraj
Manuela Veloso
George Kantor
89
69
0
30 May 2017
Convergent Tree Backup and Retrace with Function Approximation
Ahmed Touati
Pierre-Luc Bacon
Doina Precup
Pascal Vincent
106
40
0
25 May 2017
The Marginal Value of Adaptive Gradient Methods in Machine Learning
Ashia Wilson
Rebecca Roelofs
Mitchell Stern
Nathan Srebro
Benjamin Recht
ODL
125
1,035
0
23 May 2017
Enhanced Experience Replay Generation for Efficient Reinforcement Learning
Vincent Huang
Tobias Ley
Martha Vlachou-Konchylaki
Wenfeng Hu
OnRL
GAN
SyDa
41
10
0
23 May 2017
Visual Semantic Planning using Deep Successor Representations
Yuke Zhu
Daniel Gordon
Eric Kolve
Dieter Fox
Li Fei-Fei
Abhinav Gupta
Roozbeh Mottaghi
Ali Farhadi
112
142
0
23 May 2017
Neural Network Memory Architectures for Autonomous Robot Navigation
Steven W. Chen
Nikolay Atanasov
Arbaaz Khan
Konstantinos Karydis
Daniel D. Lee
Vijay Kumar
51
7
0
23 May 2017
Pairwise Confusion for Fine-Grained Visual Classification
Abhimanyu Dubey
O. Gupta
Pei Guo
Ramesh Raskar
Ryan Farrell
Nikhil Naik
54
10
0
22 May 2017
A unified view of entropy-regularized Markov decision processes
Gergely Neu
Anders Jonsson
Vicencc Gómez
121
264
0
22 May 2017
Guide Actor-Critic for Continuous Control
Voot Tangkaratt
A. Abdolmaleki
Masashi Sugiyama
67
17
0
22 May 2017
Shallow Updates for Deep Reinforcement Learning
Nir Levine
Tom Zahavy
D. Mankowitz
Aviv Tamar
Shie Mannor
OffRL
72
48
0
21 May 2017
Learning to Mix n-Step Returns: Generalizing lambda-Returns for Deep Reinforcement Learning
Sahil Sharma
J. GirishRaguvir
S. Ramesh
Balaraman Ravindran
38
6
0
21 May 2017
Learning to Factor Policies and Action-Value Functions: Factored Action Space Representations for Deep Reinforcement learning
Sahil Sharma
A. Suresh
Rahul Ramesh
Balaraman Ravindran
OffRL
56
36
0
20 May 2017
Relaxed Wasserstein with Applications to GANs
Xin Guo
Johnny Hong
Tianyi Lin
Nan Yang
GAN
114
35
0
19 May 2017
Atari games and Intel processors
R. Adamski
T. Grel
Maciek Klimek
Henryk Michalewski
34
5
0
19 May 2017
Feature Control as Intrinsic Motivation for Hierarchical Reinforcement Learning
Nat Dilokthanakul
Christos Kaplanis
Nick Pawlowski
Murray Shanahan
87
92
0
18 May 2017
Delving into adversarial attacks on deep policies
Jernej Kos
Basel Alomair
AAML
72
228
0
18 May 2017
Probabilistically Safe Policy Transfer
David Held
Zoe McCarthy
Michael Zhang
Fred Shentu
Pieter Abbeel
86
19
0
15 May 2017
Curiosity-driven Exploration by Self-supervised Prediction
Deepak Pathak
Pulkit Agrawal
Alexei A. Efros
Trevor Darrell
LRM
SSL
183
2,456
0
15 May 2017
Discrete Sequential Prediction of Continuous Actions for Deep RL
Luke Metz
Julian Ibarz
Navdeep Jaitly
James Davidson
BDL
OffRL
92
121
0
14 May 2017
Efficient Parallel Methods for Deep Reinforcement Learning
Alfredo V. Clemente
Humberto Nicolás Castejón Martínez
A. Chandra
85
115
0
13 May 2017
Metacontrol for Adaptive Imagination-Based Optimization
Jessica B. Hamrick
A. J. Ballard
Razvan Pascanu
Oriol Vinyals
N. Heess
Peter W. Battaglia
76
69
0
07 May 2017
Toward Low-Flying Autonomous MAV Trail Navigation using Deep Neural Networks for Environmental Awareness
Nikolai Smolyanskiy
A. Kamenev
Jeffrey Smith
Stan Birchfield
144
223
0
07 May 2017
Traffic Light Control Using Deep Policy-Gradient and Value-Function Based Reinforcement Learning
Seyed Sajad Mousavi
Michael Schukat
Enda Howley
95
309
0
28 Apr 2017
Mapping Instructions and Visual Observations to Actions with Reinforcement Learning
Dipendra Kumar Misra
John Langford
Yoav Artzi
86
247
0
28 Apr 2017
General Video Game AI: Learning from Screen Capture
Kamolwan Kunanusont
Simon Lucas
Diego Perez-Liebana
60
20
0
23 Apr 2017
Equivalence Between Policy Gradients and Soft Q-Learning
John Schulman
Xi Chen
Pieter Abbeel
OffRL
132
349
0
21 Apr 2017
Beating Atari with Natural Language Guided Reinforcement Learning
Russell Kaplan
Chris Sauer
A. Sosa
LM&Ro
86
69
0
18 Apr 2017
Investigating Recurrence and Eligibility Traces in Deep Q-Networks
J. Harb
Doina Precup
54
21
0
18 Apr 2017
The Reactor: A fast and sample-efficient Actor-Critic agent for Reinforcement Learning
A. Gruslys
Will Dabney
M. G. Azar
Bilal Piot
Marc G. Bellemare
Rémi Munos
76
58
0
15 Apr 2017
Virtual to Real Reinforcement Learning for Autonomous Driving
Xinlei Pan
Yurong You
Ziyan Wang
Cewu Lu
OffRL
121
338
0
13 Apr 2017
Deep Reinforcement Learning-based Image Captioning with Embedding Reward
Zhou Ren
Xiaoyu Wang
Ning Zhang
Xutao Lv
Li Li
65
324
0
12 Apr 2017
Deep Q-learning from Demonstrations
Todd Hester
Matej Vecerík
Olivier Pietquin
Marc Lanctot
Tom Schaul
...
Gabriel Dulac-Arnold
Ian Osband
J. Agapiou
Joel Z Leibo
A. Gruslys
OffRL
94
157
0
12 Apr 2017
Data-efficient Deep Reinforcement Learning for Dexterous Manipulation
I. Popov
N. Heess
Timothy Lillicrap
Roland Hafner
Gabriel Barth-Maron
Matej Vecerík
Thomas Lampe
Yuval Tassa
Tom Erez
Martin Riedmiller
OffRL
99
265
0
10 Apr 2017
Stein Variational Policy Gradient
Yang Liu
Prajit Ramachandran
Qiang Liu
Jian-wei Peng
80
141
0
07 Apr 2017
Recurrent Environment Simulators
Silvia Chiappa
S. Racanière
Daan Wierstra
S. Mohamed
85
211
0
07 Apr 2017
Learned Watershed: End-to-End Learning of Seeded Segmentation
Steffen Wolf
Lukas Schott
Ullrich Kothe
Fred Hamprecht
56
35
0
07 Apr 2017
Multiagent Bidirectionally-Coordinated Nets: Emergence of Human-level Coordination in Learning to Play StarCraft Combat Games
Peng Peng
Ying Wen
Yaodong Yang
Quan Yuan
Zhenkun Tang
Haitao Long
Jun Wang
110
336
0
29 Mar 2017
Socially Aware Motion Planning with Deep Reinforcement Learning
Yu Fan Chen
Michael Everett
Miao Liu
Jonathan P. How
121
683
0
26 Mar 2017
Combining Neural Networks and Tree Search for Task and Motion Planning in Challenging Environments
Chris Paxton
Vasumathi Raman
Gregory Hager
Marin Kobilarov
78
123
0
22 Mar 2017
An End-to-End Approach to Natural Language Object Retrieval via Context-Aware Deep Reinforcement Learning
Fan Wu
Zhongwen Xu
Yi Yang
ObjD
60
11
0
22 Mar 2017
Learning to Navigate Cloth using Haptics
Alexander Clegg
Wenhao Yu
Zackory M. Erickson
Jie Tan
Chenxi Liu
Greg Turk
86
23
0
20 Mar 2017
Evolution Strategies as a Scalable Alternative to Reinforcement Learning
Tim Salimans
Jonathan Ho
Xi Chen
Szymon Sidor
Ilya Sutskever
174
1,545
0
10 Mar 2017
Tactics of Adversarial Attack on Deep Reinforcement Learning Agents
Yen-Chen Lin
Zhang-Wei Hong
Yuan-Hong Liao
Meng-Li Shih
Ming-Yuan Liu
Min Sun
AAML
141
419
0
08 Mar 2017
Combining Self-Supervised Learning and Imitation for Vision-Based Rope Manipulation
Ashvin Nair
Dian Chen
Pulkit Agrawal
Phillip Isola
Pieter Abbeel
Jitendra Malik
Sergey Levine
SSL
83
312
0
06 Mar 2017
Neural Episodic Control
Alexander Pritzel
Benigno Uria
Sriram Srinivasan
A. Badia
Oriol Vinyals
Demis Hassabis
Daan Wierstra
Charles Blundell
OffRL
BDL
113
346
0
06 Mar 2017
Context-Based Concurrent Experience Sharing in Multiagent Systems
Dan Garant
Bruno Castro da Silva
V. Lesser
Chongjie Zhang
22
4
0
06 Mar 2017
Previous
1
2
3
...
69
70
71
72
Next