Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
1602.01783
Cited By
v1
v2 (latest)
Asynchronous Methods for Deep Reinforcement Learning
4 February 2016
Volodymyr Mnih
Adria Puigdomenech Badia
M. Berk Mirza
Alex Graves
Timothy Lillicrap
Tim Harley
David Silver
Koray Kavukcuoglu
Re-assign community
ArXiv (abs)
PDF
HTML
Papers citing
"Asynchronous Methods for Deep Reinforcement Learning"
50 / 3,591 papers shown
Title
Robust Dual View Deep Agent
Ibrahim Sobh
N. Darwish
66
2
0
13 Apr 2018
Emergence of Linguistic Communication from Referential Games with Symbolic and Pixel Input
Angeliki Lazaridou
Karl Moritz Hermann
K. Tuyls
S. Clark
LLMAG
87
217
0
11 Apr 2018
Universal Successor Representations for Transfer Reinforcement Learning
Chen Ma
Junfeng Wen
Yoshua Bengio
OffRL
42
33
0
11 Apr 2018
Gotta Learn Fast: A New Benchmark for Generalization in RL
Alex Nichol
Vicki Pfau
Christopher Hesse
Oleg Klimov
John Schulman
VLM
OffRL
74
177
0
10 Apr 2018
Understanding disentangling in
β
β
β
-VAE
Christopher P. Burgess
I. Higgins
Arka Pal
Loic Matthey
Nicholas Watters
Guillaume Desjardins
Alexander Lerchner
CoGe
DRL
73
832
0
10 Apr 2018
Policy Gradient With Value Function Approximation For Collective Multiagent Planning
D. Nguyen
Akshat Kumar
H. Lau
96
43
0
09 Apr 2018
Latent Space Policies for Hierarchical Reinforcement Learning
Tuomas Haarnoja
Kristian Hartikainen
Pieter Abbeel
Sergey Levine
BDL
87
193
0
09 Apr 2018
Differentiable plasticity: training plastic neural networks with backpropagation
Thomas Miconi
Jeff Clune
Kenneth O. Stanley
AI4CE
90
154
0
06 Apr 2018
A Human Mixed Strategy Approach to Deep Reinforcement Learning
Ngoc Duy Nguyen
S. Nahavandi
Thanh Nguyen
76
12
0
05 Apr 2018
Information Maximizing Exploration with a Latent Dynamics Model
Trevor Barron
Oliver Obst
H. B. Amor
65
3
0
04 Apr 2018
Synthesizing Programs for Images using Reinforced Adversarial Learning
Yaroslav Ganin
Tejas D. Kulkarni
Igor Babuschkin
A. Eslami
Oriol Vinyals
GAN
89
230
0
03 Apr 2018
StarCraft Micromanagement with Reinforcement Learning and Curriculum Transfer Learning
Kun Shao
Yuanheng Zhu
Dongbin Zhao
145
171
0
03 Apr 2018
Curiosity-driven Exploration for Mapless Navigation with Deep Reinforcement Learning
Oleksii Zhelo
Jingwei Zhang
L. Tai
Ming-Yuan Liu
Wolfram Burgard
68
105
0
02 Apr 2018
Recall Traces: Backtracking Models for Efficient Reinforcement Learning
Anirudh Goyal
Philemon Brakel
W. Fedus
Soumye Singhal
Timothy Lillicrap
Sergey Levine
Hugo Larochelle
Yoshua Bengio
OffRL
100
68
0
02 Apr 2018
Learning to Navigate in Cities Without a Map
Piotr Wojciech Mirowski
Matthew Koichi Grimes
Mateusz Malinowski
Karl Moritz Hermann
Keith Anderson
Denis Teplyashin
Karen Simonyan
Koray Kavukcuoglu
Andrew Zisserman
R. Hadsell
SSL
HAI
105
320
0
31 Mar 2018
Entropy based Independent Learning in Anonymous Multi-Agent Settings
Tanvi Verma
Pradeep Varakantham
H. Lau
63
7
0
27 Mar 2018
Inequity aversion improves cooperation in intertemporal social dilemmas
Edward Hughes
Joel Z Leibo
Matthew Phillips
K. Tuyls
Edgar A. Duénez-Guzmán
...
Tina Zhu
Kevin R. McKee
Raphael Köster
H. Roff
T. Graepel
82
211
0
23 Mar 2018
Deep Reinforcement Learning with Model Learning and Monte Carlo Tree Search in Minecraft
Stephan Alaniz
53
16
0
22 Mar 2018
DOP: Deep Optimistic Planning with Approximate Value Function Evaluation
Francesco Riccio
Roberto Capobianco
Daniele Nardi
23
3
0
22 Mar 2018
Optimizing Sponsored Search Ranking Strategy by Deep Reinforcement Learning
Li He
Liang Wang
Kaipeng Liu
Bo Wu
Weinan Zhang
41
7
0
20 Mar 2018
Variance Reduction for Policy Gradient with Action-Dependent Factorized Baselines
Cathy Wu
Aravind Rajeswaran
Yan Duan
Vikash Kumar
Alexandre M. Bayen
Sham Kakade
Igor Mordatch
Pieter Abbeel
OffRL
89
153
0
20 Mar 2018
Automated Curriculum Learning by Rewarding Temporally Rare Events
Niels Justesen
S. Risi
OffRL
69
20
0
19 Mar 2018
Simple random search provides a competitive approach to reinforcement learning
Horia Mania
Aurelia Guy
Benjamin Recht
75
317
0
19 Mar 2018
TBD: Benchmarking and Analyzing Deep Neural Network Training
Hongyu Zhu
Mohamed Akrout
Bojian Zheng
Andrew Pelegris
Amar Phanishayee
Bianca Schroeder
Gennady Pekhimenko
90
81
0
16 Mar 2018
Imitation Learning with Concurrent Actions in 3D Games
Jack Harmer
Linus Gisslén
Jorge del Val
Henrik Holst
Joakim Bergdahl
Tom Olsson
K. Sjöö
Magnus Nordin
75
46
0
14 Mar 2018
Learning to Play General Video-Games via an Object Embedding Network
William Woof
Ke Chen
61
13
0
14 Mar 2018
Learning to Explore with Meta-Policy Gradient
Tianbing Xu
Qiang Liu
Liang Zhao
Jian Peng
74
54
0
13 Mar 2018
Policy Search in Continuous Action Domains: an Overview
Olivier Sigaud
F. Stulp
128
73
0
13 Mar 2018
Deep Learning in Mobile and Wireless Networking: A Survey
Chaoyun Zhang
P. Patras
Hamed Haddadi
134
1,320
0
12 Mar 2018
Two-Stage Convolutional Neural Network for Breast Cancer Histology Image Classification
Kamyar Nazeri
Azad Aminpour
Mehran Ebrahimi
83
267
0
11 Mar 2018
Kickstarting Deep Reinforcement Learning
Simon Schmitt
Jonathan J. Hudson
Augustin Žídek
Simon Osindero
Carl Doersch
...
Joel Z Leibo
Heinrich Küttler
Andrew Zisserman
Karen Simonyan
S. M. Ali Eslami
OnRL
80
135
0
10 Mar 2018
The Surprising Creativity of Digital Evolution: A Collection of Anecdotes from the Evolutionary Computation and Artificial Life Research Communities
Joel Lehman
Jeff Clune
D. Misevic
C. Adami
L. Altenberg
...
Danesh Tarapore
S. Thibault
Westley Weimer
R. Watson
Jason Yosinksi
177
282
0
09 Mar 2018
A Brandom-ian view of Reinforcement Learning towards strong-AI
Atrisha Sarkar
19
2
0
07 Mar 2018
Accelerated Methods for Deep Reinforcement Learning
Adam Stooke
Pieter Abbeel
OffRL
OnRL
73
136
0
07 Mar 2018
Kinematic Morphing Networks for Manipulation Skill Transfer
Péter Englert
Marc Toussaint
3DPC
26
5
0
05 Mar 2018
Learning Sample-Efficient Target Reaching for Mobile Robots
Arbaaz Khan
Vijay Kumar
Alejandro Ribeiro
SSL
38
7
0
05 Mar 2018
Learning to Sequence Robot Behaviors for Visual Navigation
Hadi Salman
Puneet Singhal
Tanmay Shankar
Peng Yin
A. Salman
William Paivine
Guillaume Sartoretti
Matthew Travers
Howie Choset
40
8
0
05 Mar 2018
The History Began from AlexNet: A Comprehensive Survey on Deep Learning Approaches
Md. Zahangir Alom
T. Taha
C. Yakopcic
Stefan Westberg
P. Sidike
Mst Shamima Nasrin
B. Van Essen
A. Awwal
V. Asari
VLM
133
883
0
03 Mar 2018
OIL: Observational Imitation Learning
Ge Li
Matthias Muller
Vincent Casser
Neil G. Smith
D. L. Michels
Guohao Li
119
41
0
03 Mar 2018
Some Considerations on Learning to Explore via Meta-Reinforcement Learning
Bradly C. Stadie
Ge Yang
Rein Houthooft
Xi Chen
Yan Duan
Yuhuai Wu
Pieter Abbeel
Ilya Sutskever
LRM
100
116
0
03 Mar 2018
Distributed Prioritized Experience Replay
Dan Horgan
John Quan
David Budden
Gabriel Barth-Maron
Matteo Hessel
H. V. Hasselt
David Silver
165
742
0
02 Mar 2018
Semi-parametric Topological Memory for Navigation
Nikolay Savinov
Alexey Dosovitskiy
V. Koltun
105
383
0
01 Mar 2018
Hierarchical Imitation and Reinforcement Learning
Hoang Minh Le
Nan Jiang
Alekh Agarwal
Miroslav Dudík
Yisong Yue
Hal Daumé
79
194
0
01 Mar 2018
Deep Reinforcement Learning for Vision-Based Robotic Grasping: A Simulated Comparative Evaluation of Off-Policy Methods
Deirdre Quillen
Eric Jang
Ofir Nachum
Chelsea Finn
Julian Ibarz
Sergey Levine
OOD
OffRL
102
204
0
28 Feb 2018
Investigating Human Priors for Playing Video Games
Rachit Dubey
Pulkit Agrawal
Deepak Pathak
Thomas Griffiths
Alexei A. Efros
OffRL
134
146
0
28 Feb 2018
Latent-space Physics: Towards Learning the Temporal Evolution of Fluid Flow
S. Wiewel
M. Becher
N. Thürey
AI4CE
124
276
0
27 Feb 2018
The Mirage of Action-Dependent Baselines in Reinforcement Learning
George Tucker
Surya Bhupatiraju
S. Gu
Richard Turner
Zoubin Ghahramani
Sergey Levine
OffRL
112
127
0
27 Feb 2018
Real-Time Bidding with Multi-Agent Reinforcement Learning in Display Advertising
Junqi Jin
Cheng-Ning Song
Han Li
Kun Gai
Jun Wang
Weinan Zhang
64
180
0
27 Feb 2018
Modeling Others using Oneself in Multi-Agent Reinforcement Learning
Roberta Raileanu
Emily L. Denton
Arthur Szlam
Rob Fergus
97
202
0
26 Feb 2018
Addressing Function Approximation Error in Actor-Critic Methods
Scott Fujimoto
H. V. Hoof
David Meger
OffRL
380
5,252
0
26 Feb 2018
Previous
1
2
3
...
64
65
66
...
70
71
72
Next