Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
1602.01783
Cited By
v1
v2 (latest)
Asynchronous Methods for Deep Reinforcement Learning
4 February 2016
Volodymyr Mnih
Adria Puigdomenech Badia
M. Berk Mirza
Alex Graves
Timothy Lillicrap
Tim Harley
David Silver
Koray Kavukcuoglu
Re-assign community
ArXiv (abs)
PDF
HTML
Papers citing
"Asynchronous Methods for Deep Reinforcement Learning"
50 / 3,591 papers shown
Title
Adversarial Advantage Actor-Critic Model for Task-Completion Dialogue Policy Learning
Baolin Peng
Xiujun Li
Jianfeng Gao
Jingjing Liu
Yun-Nung Chen
Kam-Fai Wong
91
65
0
31 Oct 2017
Action-depedent Control Variates for Policy Optimization via Stein's Identity
Hao Liu
Yihao Feng
Yi Mao
Dengyong Zhou
Jian-wei Peng
Qiang Liu
94
4
0
30 Oct 2017
Predicting Head Movement in Panoramic Video: A Deep Reinforcement Learning Approach
Yuhang Song
Mai Xu
Jianyi Wang
Minglang Qiao
Liangyu Huo
Zulin Wang
105
207
0
30 Oct 2017
Diff-DAC: Distributed Actor-Critic for Average Multitask Deep Reinforcement Learning
Sergio Valcarcel Macua
Aleksi Tukiainen
D. Hernández
David Baldazo
Enrique Munoz de Cote
S. Zazo
114
29
0
28 Oct 2017
Generalization Tower Network: A Novel Deep Neural Network Architecture for Multi-Task Learning
Yuhang Song
Mai Xu
Songyang Zhang
Liangyu Huo
57
3
0
27 Oct 2017
Understanding Early Word Learning in Situated Artificial Agents
Felix Hill
S. Clark
Karl Moritz Hermann
Phil Blunsom
LM&Ro
93
32
0
26 Oct 2017
DoShiCo Challenge: Domain Shift in Control Prediction
Klaas Kelchtermans
Tinne Tuytelaars
22
0
0
26 Oct 2017
Meta Learning Shared Hierarchies
Kevin Frans
Jonathan Ho
Xi Chen
Pieter Abbeel
John Schulman
84
355
0
26 Oct 2017
Consequentialist conditional cooperation in social dilemmas with imperfect information
A. Peysakhovich
Adam Lerer
89
65
0
19 Oct 2017
Map-based Multi-Policy Reinforcement Learning: Enhancing Adaptability of Robots by Deep Reinforcement Learning
A. Kume
Eiichi Matsumoto
K. Takahashi
W. Ko
Jethro Tan
69
11
0
17 Oct 2017
Deep Imitation Learning for Complex Manipulation Tasks from Virtual Reality Teleoperation
Tianhao Zhang
Zoe McCarthy
Owen Jow
Dennis Lee
Xi Chen
Ken Goldberg
Pieter Abbeel
SSL
156
663
0
12 Oct 2017
Arguing Machines: Human Supervision of Black Box AI Systems That Make Life-Critical Decisions
Alex Fridman
Li Ding
Benedikt Jenik
B. Reimer
51
14
0
12 Oct 2017
AMBER: Adaptive Multi-Batch Experience Replay for Continuous Action Control
Seungyul Han
Y. Sung
OffRL
33
8
0
12 Oct 2017
Emergent Complexity via Multi-Agent Competition
Trapit Bansal
J. Pachocki
Szymon Sidor
Ilya Sutskever
Igor Mordatch
86
392
0
10 Oct 2017
MSC: A Dataset for Macro-Management in StarCraft II
Huikai Wu
Yanqi Zong
Junge Zhang
Kaiqi Huang
59
16
0
09 Oct 2017
Recurrent Deterministic Policy Gradient Method for Bipedal Locomotion on Rough Terrain Challenge
Doo Re Song
Chuanyu Yang
C. McGreavy
Zhibin Li
167
30
0
08 Oct 2017
Socially Compliant Navigation through Raw Depth Inputs with Generative Adversarial Imitation Learning
L. Tai
Jingwei Zhang
Ming-Yuan Liu
Wolfram Burgard
GAN
67
180
0
06 Oct 2017
Rainbow: Combining Improvements in Deep Reinforcement Learning
Matteo Hessel
Joseph Modayil
H. V. Hasselt
Tom Schaul
Georg Ostrovski
Will Dabney
Dan Horgan
Bilal Piot
M. G. Azar
David Silver
OffRL
112
2,283
0
06 Oct 2017
Detecting Adversarial Attacks on Neural Network Policies with Visual Foresight
Yen-Chen Lin
Ming-Yuan Liu
Min Sun
Jia-Bin Huang
AAML
104
49
0
02 Oct 2017
Parameter Sharing Deep Deterministic Policy Gradient for Cooperative Multi-agent Reinforcement Learning
Xiangxiang Chu
Hangjun Ye
74
56
0
01 Oct 2017
Vision-based deep execution monitoring
Francesco Puja
S. Grazioso
A. Tammaro
Valsamis Ntouskos
Marta Sanzari
F. Pirri
34
1
0
29 Sep 2017
Deep TAMER: Interactive Agent Shaping in High-Dimensional State Spaces
Garrett A. Warnell
Nicholas R. Waytowich
Vernon J. Lawhern
Peter Stone
86
273
0
28 Sep 2017
Overcoming Exploration in Reinforcement Learning with Demonstrations
Ashvin Nair
Bob McGrew
Marcin Andrychowicz
Wojciech Zaremba
Pieter Abbeel
OffRL
188
791
0
28 Sep 2017
Towards continuous control of flippers for a multi-terrain robot using deep reinforcement learning
Giuseppe Paolo
L. Tai
Ming-Yuan Liu
17
7
0
25 Sep 2017
Learning Unmanned Aerial Vehicle Control for Autonomous Target Following
Siyi Li
Tianbo Liu
Fangqiu Yi
Dit-Yan Yeung
Shaojie Shen
53
38
0
24 Sep 2017
Expanding Motor Skills through Relay Neural Networks
Visak C. V. Kumar
Sehoon Ha
Chenxi Liu
27
2
0
22 Sep 2017
Avoidance of Manual Labeling in Robotic Autonomous Navigation Through Multi-Sensory Semi-Supervised Learning
Junhong Xu
Shangyue Zhu
Hanqing Guo
Shaoen Wu
SSL
31
3
0
22 Sep 2017
Learning Human Behaviors for Robot-Assisted Dressing
Alexander Clegg
Wenhao Yu
Jie Tan
Charles C. Kemp
Greg Turk
Chenxi Liu
38
3
0
20 Sep 2017
OptionGAN: Learning Joint Reward-Policy Options using Generative Adversarial Inverse Reinforcement Learning
Peter Henderson
Wei-Di Chang
Pierre-Luc Bacon
David Meger
Joelle Pineau
Doina Precup
GAN
77
73
0
20 Sep 2017
Deep Reinforcement Learning that Matters
Peter Henderson
Riashat Islam
Philip Bachman
Joelle Pineau
Doina Precup
David Meger
OffRL
155
1,970
0
19 Sep 2017
Revisiting the Arcade Learning Environment: Evaluation Protocols and Open Problems for General Agents
Marlos C. Machado
Marc G. Bellemare
Erik Talvitie
J. Veness
Matthew J. Hausknecht
Michael Bowling
114
558
0
18 Sep 2017
Memory Augmented Control Networks
Arbaaz Khan
Clark Zhang
Nikolay Atanasov
Konstantinos Karydis
Vijay Kumar
Daniel D. Lee
82
77
0
17 Sep 2017
The Uncertainty Bellman Equation and Exploration
Brendan O'Donoghue
Ian Osband
Rémi Munos
Volodymyr Mnih
90
193
0
15 Sep 2017
Transforming Cooling Optimization for Green Data Center via Deep Reinforcement Learning
Yuanlong Li
Yonggang Wen
K. Guan
Dacheng Tao
AI4CE
88
180
0
15 Sep 2017
Shared Learning : Enhancing Reinforcement in
Q
Q
Q
-Ensembles
Rakesh R Menon
Balaraman Ravindran
33
0
0
14 Sep 2017
A2-RL: Aesthetics Aware Reinforcement Learning for Image Cropping
Debang Li
Huikai Wu
Junge Zhang
Kaiqi Huang
OffRL
56
9
0
14 Sep 2017
When Waiting is not an Option : Learning Options with a Deliberation Cost
J. Harb
Pierre-Luc Bacon
Martin Klissarov
Doina Precup
71
150
0
14 Sep 2017
A Study of AI Population Dynamics with Million-agent Reinforcement Learning
Yaodong Yang
Lantao Yu
Yiwei Bai
Jun Wang
Weinan Zhang
Ying Wen
Yong Yu
64
7
0
13 Sep 2017
Pre-training Neural Networks with Human Demonstrations for Deep Reinforcement Learning
G. V. D. L. Cruz
Yunshu Du
Matthew E. Taylor
3DH
OffRL
84
58
0
12 Sep 2017
Deep Reinforcement Learning with Surrogate Agent-Environment Interface
Songli Wang
Yutao Jing
27
1
0
12 Sep 2017
TensorFlow Agents: Efficient Batched Reinforcement Learning in TensorFlow
Danijar Hafner
James Davidson
Vincent Vanhoucke
OffRL
59
49
0
08 Sep 2017
Prosocial learning agents solve generalized Stag Hunts better than selfish ones
A. Peysakhovich
Adam Lerer
114
109
0
08 Sep 2017
BOOK: Storing Algorithm-Invariant Episodes for Deep Reinforcement Learning
Simyung Chang
Y. Yoo
Jaeseok Choi
Nojun Kwak
OffRL
15
1
0
05 Sep 2017
Mean Actor Critic
Cameron Allen
Kavosh Asadi
Melrose Roderick
Abdel-rahman Mohamed
George Konidaris
Michael Littman
92
45
0
01 Sep 2017
Deep Learning for Video Game Playing
Niels Justesen
Philip Bontrager
Julian Togelius
S. Risi
VLM
101
208
0
25 Aug 2017
Learning the Enigma with Recurrent Neural Networks
S. Greydanus
81
39
0
24 Aug 2017
Reinforcement Learning in POMDPs with Memoryless Options and Option-Observation Initiation Sets
Denis Steckelmacher
D. Roijers
Anna Harutyunyan
Peter Vrancx
Hélène Plisnier
A. Nowé
113
20
0
22 Aug 2017
Teaching UAVs to Race: End-to-End Regression of Agile Controls in Simulation
Matthias Mueller
Vincent Casser
Neil G. Smith
D. L. Michels
Guohao Li
84
10
0
19 Aug 2017
Sim4CV: A Photo-Realistic Simulator for Computer Vision Applications
Matthias Muller
Vincent Casser
Jean Lahoud
Neil G. Smith
Guohao Li
VGen
74
181
0
19 Aug 2017
A Brief Survey of Deep Reinforcement Learning
Kai Arulkumaran
M. Deisenroth
Miles Brundage
Anil Anthony Bharath
OffRL
173
2,830
0
19 Aug 2017
Previous
1
2
3
...
67
68
69
70
71
72
Next