Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
1602.01783
Cited By
v1
v2 (latest)
Asynchronous Methods for Deep Reinforcement Learning
4 February 2016
Volodymyr Mnih
Adria Puigdomenech Badia
M. Berk Mirza
Alex Graves
Timothy Lillicrap
Tim Harley
David Silver
Koray Kavukcuoglu
Re-assign community
ArXiv (abs)
PDF
HTML
Papers citing
"Asynchronous Methods for Deep Reinforcement Learning"
50 / 3,591 papers shown
Title
A Review of Reinforcement Learning for Autonomous Building Energy Management
Karl Mason
S. Grijalva
AI4CE
74
224
0
12 Mar 2019
On the Pitfalls of Measuring Emergent Communication
Ryan J. Lowe
Jakob N. Foerster
Y-Lan Boureau
Joelle Pineau
Yann N. Dauphin
146
135
0
12 Mar 2019
Multi-Agent Deep Reinforcement Learning for Large-scale Traffic Signal Control
Tianshu Chu
Jie Wang
Lara Codecà
Zhaojian Li
60
675
0
11 Mar 2019
Learning to Paint With Model-based Deep Reinforcement Learning
Zhewei Huang
Wen Heng
Shuchang Zhou
GAN
117
156
0
11 Mar 2019
Sample-Efficient Model-Free Reinforcement Learning with Off-Policy Critics
Denis Steckelmacher
Hélène Plisnier
D. Roijers
A. Nowé
OffRL
67
17
0
11 Mar 2019
Scene Memory Transformer for Embodied Agents in Long-Horizon Tasks
Kuan Fang
Alexander Toshev
Li Fei-Fei
Silvio Savarese
OffRL
142
202
0
09 Mar 2019
Learning Hierarchical Teaching Policies for Cooperative Agents
Dong-Ki Kim
Miao Liu
Shayegan Omidshafiei
Sebastian Lopez-Cot
Matthew D Riemer
Golnaz Habibi
Gerald Tesauro
Sami Mourad
Murray Campbell
Jonathan P. How
65
7
0
07 Mar 2019
Concurrent Meta Reinforcement Learning
Emilio Parisotto
Soham Ghosh
S. Yalamanchi
Varsha Chinnaobireddy
Yuhuai Wu
Ruslan Salakhutdinov
LRM
66
17
0
07 Mar 2019
Training in Task Space to Speed Up and Guide Reinforcement Learning
Guillaume Bellegarda
Katie Byl
54
19
0
06 Mar 2019
Viewpoint Optimization for Autonomous Strawberry Harvesting with Deep Reinforcement Learning
Jonathon Sather
Xiaozheng Jane Zhang
OffRL
26
3
0
05 Mar 2019
Deep Active Localization
S. Gottipati
K. Seo
Dhaivat Bhatt
Vincent Mai
Krishna Murthy Jatavallabhula
Liam Paull
94
38
0
05 Mar 2019
The Regretful Agent: Heuristic-Aided Navigation through Progress Estimation
Chih-Yao Ma
Zuxuan Wu
G. Al-Regib
Caiming Xiong
Z. Kira
LM&Ro
89
175
0
05 Mar 2019
Learning Dynamics Model in Reinforcement Learning by Incorporating the Long Term Future
Nan Rosemary Ke
Amanpreet Singh
Ahmed Touati
Anirudh Goyal
Yoshua Bengio
Devi Parikh
Dhruv Batra
78
48
0
05 Mar 2019
Joint Perception and Control as Inference with an Object-based Implementation
Minne Li
Zheng Tian
Pranav Nashikkar
Ian Davies
Ying Wen
Jun Wang
36
2
0
04 Mar 2019
Microscopic Traffic Simulation by Cooperative Multi-agent Deep Reinforcement Learning
Giulio Bacchiani
Daniele Molinari
Marco Patander
68
22
0
04 Mar 2019
Hybrid Actor-Critic Reinforcement Learning in Parameterized Action Space
Zhou Fan
Ruilong Su
Weinan Zhang
Yong Yu
123
134
0
04 Mar 2019
The StreetLearn Environment and Dataset
Piotr Wojciech Mirowski
Andras Banki-Horvath
Keith Anderson
Denis Teplyashin
Karl Moritz Hermann
...
Matthew Koichi Grimes
Karen Simonyan
Koray Kavukcuoglu
Andrew Zisserman
R. Hadsell
3DV
77
66
0
04 Mar 2019
Reinforcement Learning on Variable Impedance Controller for High-Precision Robotic Assembly
Jianlan Luo
Eugen Solowjow
Chengtao Wen
J. A. Ojea
A. Agogino
Aviv Tamar
Pieter Abbeel
87
177
0
04 Mar 2019
A Strongly Asymptotically Optimal Agent in General Environments
Michael K. Cohen
Elliot Catt
Marcus Hutter
70
12
0
04 Mar 2019
Asynchronous Episodic Deep Deterministic Policy Gradient: Towards Continuous Control in Computationally Complex Environments
Zhizheng Zhang
Jiale Chen
Zhibo Chen
Weiping Li
OffRL
93
61
0
03 Mar 2019
Model-Based Reinforcement Learning for Atari
Lukasz Kaiser
Mohammad Babaeizadeh
Piotr Milos
B. Osinski
R. Campbell
...
Sergey Levine
Afroz Mohiuddin
Ryan Sepassi
George Tucker
Henryk Michalewski
OffRL
228
871
0
01 Mar 2019
Deep learning in bioinformatics: introduction, application, and perspective in big data era
Yu Li
Chao Huang
Lizhong Ding
Zhongxiao Li
Yijie Pan
Xin Gao
AI4CE
99
302
0
28 Feb 2019
Neural Packet Classification
Eric Liang
Hang Zhu
Xin Jin
Ion Stoica
OffRL
78
122
0
27 Feb 2019
The Termination Critic
Anna Harutyunyan
Will Dabney
Diana Borsa
N. Heess
Rémi Munos
Doina Precup
OffRL
55
48
0
26 Feb 2019
Design of intentional backdoors in sequential models
Zhaoyuan Yang
N. Iyer
Johan Reimann
Nurali Virani
SILM
AAML
66
40
0
26 Feb 2019
Cooperative Learning of Disjoint Syntax and Semantics
Serhii Havrylov
Germán Kruszewski
Armand Joulin
77
48
0
25 Feb 2019
Adversarial Reinforcement Learning under Partial Observability in Autonomous Computer Network Defence
Yi Han
David Hubczenko
Paul Montague
O. Vel
Tamas Abraham
Benjamin I. P. Rubinstein
C. Leckie
T. Alpcan
S. Erfani
AAML
70
6
0
25 Feb 2019
Where Do Human Heuristics Come From?
Marcel Binz
Dominik M. Endres
23
0
0
20 Feb 2019
Analysing Factorizations of Action-Value Networks for Cooperative Multi-Agent Reinforcement Learning
Jacopo Castellini
F. Oliehoek
Rahul Savani
Shimon Whiteson
52
3
0
20 Feb 2019
DIViS: Domain Invariant Visual Servoing for Collision-Free Goal Reaching
Fereshteh Sadeghi
86
28
0
18 Feb 2019
Fast Efficient Hyperparameter Tuning for Policy Gradients
Supratik Paul
Vitaly Kurin
Shimon Whiteson
72
32
0
18 Feb 2019
A new Potential-Based Reward Shaping for Reinforcement Learning Agent
Babak Badnava
Mona Esmaeili
N. Mozayani
Payman Zarkesh-Ha
37
24
0
17 Feb 2019
Leveraging Communication Topologies Between Learning Agents in Deep Reinforcement Learning
D. Adjodah
D. Calacci
Abhimanyu Dubey
Anirudh Goyal
P. Krafft
Esteban Moro Egido
Alex Pentland
AI4CE
75
8
0
16 Feb 2019
Network Offloading Policies for Cloud Robotics: a Learning-based Approach
Sandeep P. Chinchali
Apoorva Sharma
James Harrison
Amine Elhafsi
Daniel Kang
Evgenya Pergament
Eyal Cidon
Sachin Katti
Marco Pavone
OffRL
66
107
0
15 Feb 2019
Learn a Prior for RHEA for Better Online Planning
Xinyao Tong
W. Liu
Bin Li
OffRL
107
0
0
14 Feb 2019
Non-Asymptotic Analysis of Monte Carlo Tree Search
Devavrat Shah
Qiaomin Xie
Zhi Xu
36
9
0
14 Feb 2019
Off-Policy Actor-Critic in an Ensemble: Achieving Maximum General Entropy and Effective Environment Exploration in Deep Reinforcement Learning
Gang Chen
Yiming Peng
40
8
0
14 Feb 2019
ACTRCE: Augmenting Experience via Teacher's Advice For Multi-Goal Reinforcement Learning
Harris Chan
Yuhuai Wu
J. Kiros
Sanja Fidler
Jimmy Ba
102
34
0
12 Feb 2019
Deep Reinforcement Learning from Policy-Dependent Human Feedback
Dilip Arumugam
Jun Ki Lee
S. Saskin
Michael L. Littman
76
100
0
12 Feb 2019
Performance Dynamics and Termination Errors in Reinforcement Learning: A Unifying Perspective
Nikki Lijing Kuang
C. Leung
18
6
0
11 Feb 2019
Policy Learning for Fairness in Ranking
Ashudeep Singh
Thorsten Joachims
OffRL
98
219
0
11 Feb 2019
Optimal Kronecker-Sum Approximation of Real Time Recurrent Learning
Frederik Benzing
M. Gauy
Asier Mujika
A. Martinsson
Angelika Steger
126
23
0
11 Feb 2019
Latent Space Reinforcement Learning for Steering Angle Prediction
Qadeer Ahmad Khan
Torsten Schön
Patrick Wenzel
SSL
LLMSV
57
8
0
11 Feb 2019
A Bandit Framework for Optimal Selection of Reinforcement Learning Agents
A. Merentitis
Kashif Rasul
Roland Vollgraf
Abdul-Saboor Sheikh
Urs M. Bergmann
40
2
0
10 Feb 2019
Size Independent Neural Transfer for RDDL Planning
Sankalp Garg
Aniket Bajpai
Mausam
OffRL
40
41
0
08 Feb 2019
Reinforcement Learning from Hierarchical Critics
Zehong Cao
Chin-Teng Lin
52
12
0
08 Feb 2019
Visual search and recognition for robot task execution and monitoring
L. Mauro
Francesco Puja
S. Grazioso
Valsamis Ntouskos
Marta Sanzari
Edoardo Alati
F. Pirri
56
9
0
07 Feb 2019
Compatible Natural Gradient Policy Search
Joni Pajarinen
Hong Linh Thai
R. Akrour
Jan Peters
Gerhard Neumann
69
22
0
07 Feb 2019
Metaoptimization on a Distributed System for Deep Reinforcement Learning
Greg Heinrich
I. Frosio
OffRL
28
2
0
07 Feb 2019
Real-time malware process detection and automated process killing
Matilda Rhode
Pete Burnap
Adam Wedgbury
16
11
0
07 Feb 2019
Previous
1
2
3
...
56
57
58
...
70
71
72
Next