Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
1802.01561
Cited By
v1
v2
v3 (latest)
IMPALA: Scalable Distributed Deep-RL with Importance Weighted Actor-Learner Architectures
5 February 2018
L. Espeholt
Hubert Soyer
Rémi Munos
Karen Simonyan
Volodymyr Mnih
Tom Ward
Yotam Doron
Vlad Firoiu
Tim Harley
Iain Dunning
Shane Legg
Koray Kavukcuoglu
Re-assign community
ArXiv (abs)
PDF
HTML
Papers citing
"IMPALA: Scalable Distributed Deep-RL with Importance Weighted Actor-Learner Architectures"
50 / 1,000 papers shown
Title
Generalized Off-Policy Actor-Critic
Shangtong Zhang
Wendelin Bohmer
Shimon Whiteson
OffRL
CML
151
43
0
27 Mar 2019
Optimization Methods for Interpretable Differentiable Decision Trees in Reinforcement Learning
I. D. Rodriguez
Taylor W. Killian
Ivan Dario Jimenez Rodriguez
Sung-Hyun Son
Matthew C. Gombolay
OffRL
85
12
0
22 Mar 2019
Learning Reciprocity in Complex Sequential Social Dilemmas
Tom Eccles
Edward Hughes
János Kramár
S. Wheelwright
Joel Z Leibo
52
50
0
19 Mar 2019
Exploiting Hierarchy for Learning and Transfer in KL-regularized RL
Dhruva Tirumala
Hyeonwoo Noh
Alexandre Galashov
Leonard Hasenclever
Arun Ahuja
Greg Wayne
Razvan Pascanu
Yee Whye Teh
N. Heess
OffRL
72
44
0
18 Mar 2019
Scheduled Intrinsic Drive: A Hierarchical Take on Intrinsically Motivated Exploration
Jingwei Zhang
Niklas Wetzel
Nicolai Dorka
Joschka Boedecker
Wolfram Burgard
67
26
0
18 Mar 2019
A Multi-Agent Off-Policy Actor-Critic Algorithm for Distributed Reinforcement Learning
Wesley A Suttle
Zhuoran Yang
Kai Zhang
Zhaoran Wang
Tamer Basar
Ji Liu
OffRL
84
64
0
15 Mar 2019
The StreetLearn Environment and Dataset
Piotr Wojciech Mirowski
Andras Banki-Horvath
Keith Anderson
Denis Teplyashin
Karl Moritz Hermann
...
Matthew Koichi Grimes
Karen Simonyan
Koray Kavukcuoglu
Andrew Zisserman
R. Hadsell
3DV
75
66
0
04 Mar 2019
Asynchronous Episodic Deep Deterministic Policy Gradient: Towards Continuous Control in Computationally Complex Environments
Zhizheng Zhang
Jiale Chen
Zhibo Chen
Weiping Li
OffRL
93
61
0
03 Mar 2019
Autocurricula and the Emergence of Innovation from Social Interaction: A Manifesto for Multi-Agent Intelligence Research
Joel Z Leibo
Edward Hughes
Marc Lanctot
T. Graepel
94
110
0
02 Mar 2019
Learning To Follow Directions in Street View
Karl Moritz Hermann
Mateusz Malinowski
Piotr Wojciech Mirowski
Andras Banki-Horvath
Keith Anderson
R. Hadsell
SSL
79
69
0
01 Mar 2019
Model-Based Reinforcement Learning for Atari
Lukasz Kaiser
Mohammad Babaeizadeh
Piotr Milos
B. Osinski
R. Campbell
...
Sergey Levine
Afroz Mohiuddin
Ryan Sepassi
George Tucker
Henryk Michalewski
OffRL
225
871
0
01 Mar 2019
Neural Packet Classification
Eric Liang
Hang Zhu
Xin Jin
Ion Stoica
OffRL
78
122
0
27 Feb 2019
Leveraging Communication Topologies Between Learning Agents in Deep Reinforcement Learning
D. Adjodah
D. Calacci
Abhimanyu Dubey
Anirudh Goyal
P. Krafft
Esteban Moro Egido
Alex Pentland
AI4CE
75
8
0
16 Feb 2019
Neural-encoding Human Experts' Domain Knowledge to Warm Start Reinforcement Learning
Andrew Silva
Matthew C. Gombolay
OffRL
74
20
0
15 Feb 2019
Simultaneously Learning Vision and Feature-based Control Policies for Real-world Ball-in-a-Cup
Devin Schwab
Tobias Springenberg
M. Martins
Thomas Lampe
Michael Neunert
A. Abdolmaleki
Tim Hertweck
Roland Hafner
F. Nori
Martin Riedmiller
76
22
0
13 Feb 2019
Contextual Recurrent Neural Networks
Sam Wenke
J. Fleming
36
6
0
09 Feb 2019
Metaoptimization on a Distributed System for Deep Reinforcement Learning
Greg Heinrich
I. Frosio
OffRL
28
2
0
07 Feb 2019
Distilling Policy Distillation
Wojciech M. Czarnecki
Razvan Pascanu
Simon Osindero
Siddhant M. Jayakumar
G. Swirszcz
Max Jaderberg
85
134
0
06 Feb 2019
The Hanabi Challenge: A New Frontier for AI Research
Nolan Bard
Jakob N. Foerster
A. Chandar
Neil Burch
Marc Lanctot
...
Iain Dunning
Shibl Mourad
Hugo Larochelle
Marc G. Bellemare
Michael Bowling
LLMAG
126
355
0
01 Feb 2019
TF-Replicator: Distributed Machine Learning for Researchers
P. Buchlovsky
David Budden
Dominik Grewe
Chris Jones
John Aslanides
...
Aidan Clark
Sergio Gomez Colmenarejo
Aedan Pope
Fabio Viola
Dan Belov
GNN
OffRL
AI4CE
81
20
0
01 Feb 2019
Go-Explore: a New Approach for Hard-Exploration Problems
Adrien Ecoffet
Joost Huizinga
Joel Lehman
Kenneth O. Stanley
Jeff Clune
AI4TS
130
370
0
30 Jan 2019
Transfer in Deep Reinforcement Learning Using Successor Features and Generalised Policy Improvement
André Barreto
Diana Borsa
John Quan
Tom Schaul
David Silver
Matteo Hessel
D. Mankowitz
Augustin Žídek
Rémi Munos
OffRL
117
165
0
30 Jan 2019
Benchmarking Classic and Learned Navigation in Complex 3D Environments
Dmytro Mishkin
Alexey Dosovitskiy
V. Koltun
137
75
0
30 Jan 2019
Ablation Studies in Artificial Neural Networks
Richard Meyes
Melanie Lu
Constantin Waubert de Puiseau
Tobias Meisen
69
218
0
24 Jan 2019
Causal Reasoning from Meta-reinforcement Learning
Ishita Dasgupta
Jane X. Wang
Silvia Chiappa
Jovana Mitrović
Pedro A. Ortega
David Raposo
Edward Hughes
Peter W. Battaglia
M. Botvinick
Z. Kurth-Nelson
CML
LRM
79
122
0
23 Jan 2019
Understanding Multi-Step Deep Reinforcement Learning: A Systematic Study of the DQN Target
J. F. Hernandez-Garcia
R. Sutton
77
63
0
22 Jan 2019
An investigation of model-free planning
A. Guez
M. Berk Mirza
Karol Gregor
Rishabh Kabra
S. Racanière
...
Laurent Orseau
Tom Eccles
Greg Wayne
David Silver
Timothy Lillicrap
OffRL
104
117
0
11 Jan 2019
Paired Open-Ended Trailblazer (POET): Endlessly Generating Increasingly Complex and Diverse Learning Environments and Their Solutions
Rui Wang
Joel Lehman
Jeff Clune
Kenneth O. Stanley
125
250
0
07 Jan 2019
Universal Successor Features Approximators
Diana Borsa
André Barreto
John Quan
D. Mankowitz
Rémi Munos
H. V. Hasselt
David Silver
Tom Schaul
91
117
0
18 Dec 2018
An Atari Model Zoo for Analyzing, Visualizing, and Comparing Deep Reinforcement Learning Agents
F. Such
Vashisht Madhavan
Rosanne Liu
Rui Wang
Pablo Samuel Castro
...
Jiale Zhi
Ludwig Schubert
Marc G. Bellemare
Jeff Clune
Joel Lehman
OffRL
83
54
0
17 Dec 2018
Malthusian Reinforcement Learning
Joel Z Leibo
Julien Perolat
Edward Hughes
S. Wheelwright
Adam H. Marblestone
Edgar A. Duénez-Guzmán
P. Sunehag
Iain Dunning
T. Graepel
AI4CE
103
38
0
17 Dec 2018
Dopamine: A Research Framework for Deep Reinforcement Learning
Pablo Samuel Castro
Subhodeep Moitra
Carles Gelada
Saurabh Kumar
Marc G. Bellemare
OffRL
86
279
0
14 Dec 2018
Scaling shared model governance via model splitting
Miljan Martic
Jan Leike
Andrew Trask
Matteo Hessel
Shane Legg
Pushmeet Kohli
FedML
64
2
0
14 Dec 2018
Learning Montezuma's Revenge from a Single Demonstration
Tim Salimans
Richard J. Chen
129
139
0
08 Dec 2018
Quantifying Generalization in Reinforcement Learning
K. Cobbe
Oleg Klimov
Christopher Hesse
Taehoon Kim
John Schulman
OffRL
173
680
0
06 Dec 2018
Towards a Definition of Disentangled Representations
I. Higgins
David Amos
David Pfau
S. Racanière
Loic Matthey
Danilo Jimenez Rezende
Alexander Lerchner
OCL
DRL
148
481
0
05 Dec 2018
Adapting Auxiliary Losses Using Gradient Similarity
Yunshu Du
Wojciech M. Czarnecki
Siddhant M. Jayakumar
Mehrdad Farajtabar
Razvan Pascanu
Balaji Lakshminarayanan
132
159
0
05 Dec 2018
Rigorous Agent Evaluation: An Adversarial Approach to Uncover Catastrophic Failures
Junhui Yin
Jiayan Qiu
Csaba Szepesvári
Siqing Zhang
Avraham Ruderman
Jiyang Xie
Krishnamurthy Dvijotham
Zhanyu Ma
N. Heess
Pushmeet Kohli
AAML
107
82
0
04 Dec 2018
CompILE: Compositional Imitation Learning and Execution
Thomas Kipf
Yujia Li
H. Dai
V. Zambaldi
Alvaro Sanchez-Gonzalez
Edward Grefenstette
Pushmeet Kohli
Peter W. Battaglia
VLM
93
14
0
04 Dec 2018
Using Monte Carlo Tree Search as a Demonstrator within Asynchronous Deep RL
Bilal Kartal
Pablo Hernandez-Leal
Matthew E. Taylor
OffRL
122
9
0
30 Nov 2018
Experience Replay for Continual Learning
David Rolnick
Arun Ahuja
Jonathan Richard Schwarz
Timothy Lillicrap
Greg Wayne
CLL
153
1,179
0
28 Nov 2018
Unsupervised Control Through Non-Parametric Discriminative Rewards
David Warde-Farley
T. Wiele
Tejas D. Kulkarni
Catalin Ionescu
Steven Hansen
Volodymyr Mnih
DRL
OffRL
SSL
101
178
0
28 Nov 2018
PNS: Population-Guided Novelty Search for Reinforcement Learning in Hard Exploration Environments
Qihao Liu
Yujia Wang
Xiao-Fei Liu
84
8
0
26 Nov 2018
Hierarchical visuomotor control of humanoids
J. Merel
Arun Ahuja
Vu Pham
S. Tunyasuvunakool
Siqi Liu
Dhruva Tirumala
N. Heess
Greg Wayne
117
97
0
23 Nov 2018
Scalable agent alignment via reward modeling: a research direction
Jan Leike
David M. Krueger
Tom Everitt
Miljan Martic
Vishal Maini
Shane Legg
124
421
0
19 Nov 2018
Woulda, Coulda, Shoulda: Counterfactually-Guided Policy Search
Lars Buesing
T. Weber
Yori Zwols
S. Racanière
A. Guez
Jean-Baptiste Lespiau
N. Heess
CML
121
138
0
15 Nov 2018
Evolving intrinsic motivations for altruistic behavior
Jane X. Wang
Edward Hughes
Chrisantha Fernando
Wojciech M. Czarnecki
Edgar A. Duénez-Guzmán
Joel Z Leibo
90
78
0
14 Nov 2018
Importance Weighted Evolution Strategies
Victor Campos
Xavier Giró-i-Nieto
Jordi Torres
46
1
0
12 Nov 2018
Online Off-policy Prediction
Sina Ghiassian
D. Paul
M. Fasoulakis
R. Sutton
Adam White
OffRL
148
23
0
06 Nov 2018
Bayesian Action Decoder for Deep Multi-Agent Reinforcement Learning
Jakob N. Foerster
H. F. Song
Edward Hughes
Neil Burch
Iain Dunning
Shimon Whiteson
M. Botvinick
Michael Bowling
94
149
0
04 Nov 2018
Previous
1
2
3
...
18
19
20
Next