Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
1602.01783
Cited By
Asynchronous Methods for Deep Reinforcement Learning
4 February 2016
Volodymyr Mnih
Adria Puigdomenech Badia
M. Berk Mirza
Alex Graves
Timothy Lillicrap
Tim Harley
David Silver
Koray Kavukcuoglu
Re-assign community
ArXiv
PDF
HTML
Papers citing
"Asynchronous Methods for Deep Reinforcement Learning"
50 / 1,517 papers shown
Title
CARLA: An Open Urban Driving Simulator
Alexey Dosovitskiy
G. Ros
Felipe Codevilla
Antonio M. López
V. Koltun
VLM
98
5,075
0
10 Nov 2017
Reinforcement Learning of Speech Recognition System Based on Policy Gradient and Hypothesis Selection
Taku Kato
T. Shinozaki
17
21
0
10 Nov 2017
A Unified Game-Theoretic Approach to Multiagent Reinforcement Learning
Marc Lanctot
V. Zambaldi
A. Gruslys
Angeliki Lazaridou
K. Tuyls
Julien Perolat
David Silver
T. Graepel
50
628
0
02 Nov 2017
Visualizing and Understanding Atari Agents
S. Greydanus
Anurag Koul
Jonathan Dodge
Alan Fern
FAtt
37
342
0
31 Oct 2017
TreeQN and ATreeC: Differentiable Tree-Structured Models for Deep Reinforcement Learning
Gregory Farquhar
Tim Rocktaschel
Maximilian Igl
Shimon Whiteson
OffRL
25
71
0
31 Oct 2017
Predicting Head Movement in Panoramic Video: A Deep Reinforcement Learning Approach
Yuhang Song
Mai Xu
Jianyi Wang
Minglang Qiao
Liangyu Huo
Zulin Wang
37
205
0
30 Oct 2017
Diff-DAC: Distributed Actor-Critic for Average Multitask Deep Reinforcement Learning
Sergio Valcarcel Macua
Aleksi Tukiainen
D. Hernández
David Baldazo
Enrique Munoz de Cote
S. Zazo
32
29
0
28 Oct 2017
Understanding Early Word Learning in Situated Artificial Agents
Felix Hill
S. Clark
Karl Moritz Hermann
Phil Blunsom
LM&Ro
19
32
0
26 Oct 2017
Consequentialist conditional cooperation in social dilemmas with imperfect information
A. Peysakhovich
Adam Lerer
38
65
0
19 Oct 2017
Socially Compliant Navigation through Raw Depth Inputs with Generative Adversarial Imitation Learning
L. Tai
Jingwei Zhang
Ming Liu
Wolfram Burgard
GAN
22
176
0
06 Oct 2017
Detecting Adversarial Attacks on Neural Network Policies with Visual Foresight
Yen-Chen Lin
Ming Liu
Min Sun
Jia-Bin Huang
AAML
29
48
0
02 Oct 2017
Deep TAMER: Interactive Agent Shaping in High-Dimensional State Spaces
Garrett A. Warnell
Nicholas R. Waytowich
Vernon J. Lawhern
Peter Stone
13
266
0
28 Sep 2017
Revisiting the Arcade Learning Environment: Evaluation Protocols and Open Problems for General Agents
Marlos C. Machado
Marc G. Bellemare
Erik Talvitie
J. Veness
Matthew J. Hausknecht
Michael Bowling
35
544
0
18 Sep 2017
The Uncertainty Bellman Equation and Exploration
Brendan O'Donoghue
Ian Osband
Rémi Munos
Volodymyr Mnih
27
186
0
15 Sep 2017
Deep Reinforcement Learning with Surrogate Agent-Environment Interface
Songli Wang
Yutao Jing
20
1
0
12 Sep 2017
TensorFlow Agents: Efficient Batched Reinforcement Learning in TensorFlow
Danijar Hafner
James Davidson
Vincent Vanhoucke
OffRL
14
49
0
08 Sep 2017
Mean Actor Critic
Cameron Allen
Kavosh Asadi
Melrose Roderick
Abdel-rahman Mohamed
George Konidaris
Michael Littman
22
44
0
01 Sep 2017
Learning the Enigma with Recurrent Neural Networks
S. Greydanus
32
39
0
24 Aug 2017
A Brief Survey of Deep Reinforcement Learning
Kai Arulkumaran
M. Deisenroth
Miles Brundage
Anil Anthony Bharath
OffRL
62
2,776
0
19 Aug 2017
Scalable trust-region method for deep reinforcement learning using Kronecker-factored approximation
Yuhuai Wu
Elman Mansimov
Shun Liao
Roger C. Grosse
Jimmy Ba
OffRL
22
622
0
17 Aug 2017
An Information-Theoretic Optimality Principle for Deep Reinforcement Learning
Felix Leibfried
Jordi Grau-Moya
Haitham Bou-Ammar
38
24
0
06 Aug 2017
The UMD Neural Machine Translation Systems at WMT17 Bandit Learning Task
Amr Sharaf
Shi Feng
Khanh Nguyen
Kianté Brantley
Hal Daumé
16
4
0
03 Aug 2017
Grounding Language for Transfer in Deep Reinforcement Learning
Karthik Narasimhan
Regina Barzilay
Tommi Jaakkola
LM&Ro
OffRL
27
25
0
01 Aug 2017
DARLA: Improving Zero-Shot Transfer in Reinforcement Learning
I. Higgins
Arka Pal
Andrei A. Rusu
Loic Matthey
Christopher P. Burgess
Alexander Pritzel
M. Botvinick
Charles Blundell
Alexander Lerchner
DRL
43
410
0
26 Jul 2017
Reinforcement Learning for Bandit Neural Machine Translation with Simulated Human Feedback
Khanh Nguyen
Hal Daumé
Jordan L. Boyd-Graber
30
137
0
24 Jul 2017
Proximal Policy Optimization Algorithms
John Schulman
Filip Wolski
Prafulla Dhariwal
Alec Radford
Oleg Klimov
OffRL
72
18,334
0
20 Jul 2017
Imagination-Augmented Agents for Deep Reinforcement Learning
T. Weber
S. Racanière
David P. Reichert
Lars Buesing
A. Guez
...
Razvan Pascanu
Peter W. Battaglia
Demis Hassabis
David Silver
Daan Wierstra
LM&Ro
54
551
0
19 Jul 2017
On-line Building Energy Optimization using Deep Reinforcement Learning
Elena Mocanu
Decebal Constantin Mocanu
Phuong H. Nguyen
A. Liotta
M. Webber
M. Gibescu
J. Slootweg
OffRL
32
464
0
18 Jul 2017
Trial without Error: Towards Safe Reinforcement Learning via Human Intervention
William Saunders
Girish Sastry
Andreas Stuhlmuller
Owain Evans
OffRL
24
229
0
17 Jul 2017
Distral: Robust Multitask Reinforcement Learning
Yee Whye Teh
V. Bapst
Wojciech M. Czarnecki
John Quan
J. Kirkpatrick
R. Hadsell
N. Heess
Razvan Pascanu
44
544
0
13 Jul 2017
Learning Macromanagement in StarCraft from Replays using Deep Learning
Niels Justesen
S. Risi
44
68
0
12 Jul 2017
SCAN: Learning Hierarchical Compositional Visual Concepts
I. Higgins
Nicolas Sonnerat
Loic Matthey
Arka Pal
Christopher P. Burgess
Matko Bosnjak
Murray Shanahan
M. Botvinick
Demis Hassabis
Alexander Lerchner
OCL
DRL
CoGe
24
51
0
11 Jul 2017
Emergence of Locomotion Behaviours in Rich Environments
N. Heess
TB Dhruva
S. Sriram
Jay Lemmon
J. Merel
...
Tom Erez
Ziyun Wang
S. M. Ali Eslami
Martin Riedmiller
David Silver
143
928
0
07 Jul 2017
ELF: An Extensive, Lightweight and Flexible Research Platform for Real-time Strategy Games
Yuandong Tian
Qucheng Gong
Wenling Shang
Yuxin Wu
C. L. Zitnick
OffRL
27
126
0
04 Jul 2017
A Deep Reinforcement Learning Framework for the Financial Portfolio Management Problem
Zhengyao Jiang
Dixing Xu
Jinjun Liang
OOD
13
344
0
30 Jun 2017
Neural Sequence Model Training via
α
α
α
-divergence Minimization
Sotetsu Koyamada
Yuta Kikuchi
Atsunori Kanemura
S. Maeda
S. Ishii
65
0
0
30 Jun 2017
Neural SLAM: Learning to Explore with External Memory
Jingwei Zhang
L. Tai
Ming Liu
Joschka Boedecker
Wolfram Burgard
22
71
0
29 Jun 2017
Count-Based Exploration in Feature Space for Reinforcement Learning
Jarryd Martin
S. N. Sasikumar
Tom Everitt
Marcus Hutter
24
122
0
25 Jun 2017
Observational Learning by Reinforcement Learning
Diana Borsa
Bilal Piot
Rémi Munos
Olivier Pietquin
OffRL
25
45
0
20 Jun 2017
Grounded Language Learning in a Simulated 3D World
Karl Moritz Hermann
Felix Hill
Simon Green
Fumin Wang
Ryan Faulkner
...
Denis Teplyashin
Marcus Wainwright
C. Apps
Demis Hassabis
Phil Blunsom
LM&Ro
11
305
0
20 Jun 2017
Dex: Incremental Learning for Complex Environments in Deep Reinforcement Learning
Nick Erickson
Qi Zhao
CLL
OffRL
210
2
0
19 Jun 2017
Value-Decomposition Networks For Cooperative Multi-Agent Learning
P. Sunehag
Guy Lever
A. Gruslys
Wojciech M. Czarnecki
V. Zambaldi
...
Marc Lanctot
Nicolas Sonnerat
Joel Z Leibo
K. Tuyls
T. Graepel
36
985
0
16 Jun 2017
Schema Networks: Zero-shot Transfer with a Generative Causal Model of Intuitive Physics
Ken Kansky
Tom Silver
David A. Mély
Mohamed Eldawy
Miguel Lazaro-Gredilla
Xinghua Lou
N. Dorfman
Szymon Sidor
Scott Phoenix
Dileep George
AI4CE
37
230
0
14 Jun 2017
Deep reinforcement learning from human preferences
Paul Christiano
Jan Leike
Tom B. Brown
Miljan Martic
Shane Legg
Dario Amodei
18
3,118
0
12 Jun 2017
Generalized Value Iteration Networks: Life Beyond Lattices
Sufeng Niu
Siheng Chen
Hanyu Guo
Colin Targonski
M. C. Smith
J. Kovacevic
GNN
27
53
0
08 Jun 2017
The Atari Grand Challenge Dataset
Vitaly Kurin
Sebastian Nowozin
Katja Hofmann
Lucas Beyer
Bastian Leibe
OffRL
9
43
0
31 May 2017
End-to-end Active Object Tracking via Reinforcement Learning
Wenhan Luo
Peng Sun
Fangwei Zhong
Wei Liu
Yadong Mu
Yizhou Wang
36
82
0
30 May 2017
Constrained Policy Optimization
Joshua Achiam
David Held
Aviv Tamar
Pieter Abbeel
11
1,302
0
30 May 2017
Learning End-to-end Multimodal Sensor Policies for Autonomous Navigation
Guan-Horng Liu
Avinash Siravuru
Sai P. Selvaraj
Manuela Veloso
George Kantor
10
69
0
30 May 2017
The Marginal Value of Adaptive Gradient Methods in Machine Learning
Ashia Wilson
Rebecca Roelofs
Mitchell Stern
Nathan Srebro
Benjamin Recht
ODL
20
1,013
0
23 May 2017
Previous
1
2
3
...
28
29
30
31
Next