Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
1802.01561
Cited By
IMPALA: Scalable Distributed Deep-RL with Importance Weighted Actor-Learner Architectures
5 February 2018
L. Espeholt
Hubert Soyer
Rémi Munos
Karen Simonyan
Volodymyr Mnih
Tom Ward
Yotam Doron
Vlad Firoiu
Tim Harley
Iain Dunning
Shane Legg
Koray Kavukcuoglu
Re-assign community
ArXiv
PDF
HTML
Papers citing
"IMPALA: Scalable Distributed Deep-RL with Importance Weighted Actor-Learner Architectures"
50 / 981 papers shown
Title
TF-Replicator: Distributed Machine Learning for Researchers
P. Buchlovsky
David Budden
Dominik Grewe
Chris Jones
John Aslanides
...
Aidan Clark
Sergio Gomez Colmenarejo
Aedan Pope
Fabio Viola
Dan Belov
GNN
OffRL
AI4CE
37
20
0
01 Feb 2019
Go-Explore: a New Approach for Hard-Exploration Problems
Adrien Ecoffet
Joost Huizinga
Joel Lehman
Kenneth O. Stanley
Jeff Clune
AI4TS
24
362
0
30 Jan 2019
Transfer in Deep Reinforcement Learning Using Successor Features and Generalised Policy Improvement
André Barreto
Diana Borsa
John Quan
Tom Schaul
David Silver
Matteo Hessel
D. Mankowitz
Augustin Žídek
Rémi Munos
OffRL
41
161
0
30 Jan 2019
Benchmarking Classic and Learned Navigation in Complex 3D Environments
Dmytro Mishkin
Alexey Dosovitskiy
V. Koltun
34
75
0
30 Jan 2019
Ablation Studies in Artificial Neural Networks
Richard Meyes
Melanie Lu
Constantin Waubert de Puiseau
Tobias Meisen
13
210
0
24 Jan 2019
Causal Reasoning from Meta-reinforcement Learning
Ishita Dasgupta
Jane X. Wang
Silvia Chiappa
Jovana Mitrović
Pedro A. Ortega
David Raposo
Edward Hughes
Peter W. Battaglia
M. Botvinick
Z. Kurth-Nelson
CML
LRM
20
120
0
23 Jan 2019
Understanding Multi-Step Deep Reinforcement Learning: A Systematic Study of the DQN Target
J. F. Hernandez-Garcia
R. Sutton
19
61
0
22 Jan 2019
An investigation of model-free planning
A. Guez
M. Berk Mirza
Karol Gregor
Rishabh Kabra
S. Racanière
...
Laurent Orseau
Tom Eccles
Greg Wayne
David Silver
Timothy Lillicrap
OffRL
30
111
0
11 Jan 2019
Paired Open-Ended Trailblazer (POET): Endlessly Generating Increasingly Complex and Diverse Learning Environments and Their Solutions
Rui Wang
Joel Lehman
Jeff Clune
Kenneth O. Stanley
47
240
0
07 Jan 2019
Universal Successor Features Approximators
Diana Borsa
André Barreto
John Quan
D. Mankowitz
Rémi Munos
H. V. Hasselt
David Silver
Tom Schaul
28
114
0
18 Dec 2018
An Atari Model Zoo for Analyzing, Visualizing, and Comparing Deep Reinforcement Learning Agents
F. Such
Vashisht Madhavan
Rosanne Liu
Rui Wang
Pablo Samuel Castro
...
Jiale Zhi
Ludwig Schubert
Marc G. Bellemare
Jeff Clune
Joel Lehman
OffRL
27
54
0
17 Dec 2018
Malthusian Reinforcement Learning
Joel Z Leibo
Julien Perolat
Edward Hughes
S. Wheelwright
Adam H. Marblestone
Edgar A. Duénez-Guzmán
P. Sunehag
Iain Dunning
T. Graepel
AI4CE
33
37
0
17 Dec 2018
Dopamine: A Research Framework for Deep Reinforcement Learning
Pablo Samuel Castro
Subhodeep Moitra
Carles Gelada
Saurabh Kumar
Marc G. Bellemare
OffRL
19
276
0
14 Dec 2018
Scaling shared model governance via model splitting
Miljan Martic
Jan Leike
Andrew Trask
Matteo Hessel
Shane Legg
Pushmeet Kohli
FedML
20
2
0
14 Dec 2018
Learning Montezuma's Revenge from a Single Demonstration
Tim Salimans
Richard J. Chen
42
136
0
08 Dec 2018
Quantifying Generalization in Reinforcement Learning
K. Cobbe
Oleg Klimov
Christopher Hesse
Taehoon Kim
John Schulman
OffRL
54
659
0
06 Dec 2018
Towards a Definition of Disentangled Representations
I. Higgins
David Amos
David Pfau
S. Racanière
Loic Matthey
Danilo Jimenez Rezende
Alexander Lerchner
OCL
DRL
41
471
0
05 Dec 2018
Adapting Auxiliary Losses Using Gradient Similarity
Yunshu Du
Wojciech M. Czarnecki
Siddhant M. Jayakumar
Mehrdad Farajtabar
Razvan Pascanu
Balaji Lakshminarayanan
35
155
0
05 Dec 2018
Rigorous Agent Evaluation: An Adversarial Approach to Uncover Catastrophic Failures
Junhui Yin
Jiayan Qiu
Csaba Szepesvári
Siqing Zhang
Avraham Ruderman
Jiyang Xie
Krishnamurthy Dvijotham
Zhanyu Ma
N. Heess
Pushmeet Kohli
AAML
15
80
0
04 Dec 2018
CompILE: Compositional Imitation Learning and Execution
Thomas Kipf
Yujia Li
H. Dai
V. Zambaldi
Alvaro Sanchez-Gonzalez
Edward Grefenstette
Pushmeet Kohli
Peter W. Battaglia
VLM
22
13
0
04 Dec 2018
Using Monte Carlo Tree Search as a Demonstrator within Asynchronous Deep RL
Bilal Kartal
Pablo Hernandez-Leal
Matthew E. Taylor
OffRL
33
9
0
30 Nov 2018
Experience Replay for Continual Learning
David Rolnick
Arun Ahuja
Jonathan Richard Schwarz
Timothy Lillicrap
Greg Wayne
CLL
19
1,114
0
28 Nov 2018
Unsupervised Control Through Non-Parametric Discriminative Rewards
David Warde-Farley
T. Wiele
Tejas D. Kulkarni
Catalin Ionescu
Steven Hansen
Volodymyr Mnih
DRL
OffRL
SSL
41
173
0
28 Nov 2018
PNS: Population-Guided Novelty Search for Reinforcement Learning in Hard Exploration Environments
Qihao Liu
Yujia Wang
Xiao-Fei Liu
22
8
0
26 Nov 2018
Hierarchical visuomotor control of humanoids
J. Merel
Arun Ahuja
Vu Pham
S. Tunyasuvunakool
Siqi Liu
Dhruva Tirumala
N. Heess
Greg Wayne
32
97
0
23 Nov 2018
Scalable agent alignment via reward modeling: a research direction
Jan Leike
David M. Krueger
Tom Everitt
Miljan Martic
Vishal Maini
Shane Legg
34
397
0
19 Nov 2018
Woulda, Coulda, Shoulda: Counterfactually-Guided Policy Search
Lars Buesing
T. Weber
Yori Zwols
S. Racanière
A. Guez
Jean-Baptiste Lespiau
N. Heess
CML
37
135
0
15 Nov 2018
Evolving intrinsic motivations for altruistic behavior
Jane X. Wang
Edward Hughes
Chrisantha Fernando
Wojciech M. Czarnecki
Edgar A. Duénez-Guzmán
Joel Z Leibo
24
75
0
14 Nov 2018
Importance Weighted Evolution Strategies
Victor Campos
Xavier Giró-i-Nieto
Jordi Torres
19
1
0
12 Nov 2018
Online Off-policy Prediction
Sina Ghiassian
D. Paul
M. Fasoulakis
R. Sutton
Adam White
OffRL
8
23
0
06 Nov 2018
Bayesian Action Decoder for Deep Multi-Agent Reinforcement Learning
Jakob N. Foerster
H. F. Song
Edward Hughes
Neil Burch
Iain Dunning
Shimon Whiteson
M. Botvinick
Michael Bowling
16
148
0
04 Nov 2018
Exploration by Random Network Distillation
Yuri Burda
Harrison Edwards
Amos Storkey
Oleg Klimov
60
1,297
0
30 Oct 2018
RLgraph: Modular Computation Graphs for Deep Reinforcement Learning
Michael Schaarschmidt
Sven Mika
Kai Fricke
Eiko Yoneki
OffRL
23
5
0
21 Oct 2018
Actor-Critic Policy Optimization in Partially Observable Multiagent Environments
S. Srinivasan
Marc Lanctot
V. Zambaldi
Julien Perolat
K. Tuyls
Rémi Munos
Michael Bowling
8
148
0
21 Oct 2018
BabyAI: A Platform to Study the Sample Efficiency of Grounded Language Learning
Maxime Chevalier-Boisvert
Dzmitry Bahdanau
Salem Lahlou
Lucas Willems
Chitwan Saharia
Thien Huu Nguyen
Yoshua Bengio
ELM
33
232
0
18 Oct 2018
Fast deep reinforcement learning using online adjustments from the past
Steven Hansen
Pablo Sprechmann
Alexander Pritzel
André Barreto
Charles Blundell
TTA
OffRL
OnRL
18
42
0
18 Oct 2018
At Human Speed: Deep Reinforcement Learning with Action Delay
Vlad Firoiu
Tina Ju
J. Tenenbaum
13
36
0
16 Oct 2018
Deep Reinforcement Learning
Yuxi Li
VLM
OffRL
28
144
0
15 Oct 2018
CURIOUS: Intrinsically Motivated Modular Multi-Goal Reinforcement Learning
Cédric Colas
Pierre Fournier
Olivier Sigaud
Mohamed Chetouani
Pierre-Yves Oudeyer
33
39
0
15 Oct 2018
GPU-Accelerated Robotic Simulation for Distributed Reinforcement Learning
Jacky Liang
Viktor Makoviychuk
Ankur Handa
N. Chentanez
Miles Macklin
Dieter Fox
AI4CE
27
182
0
12 Oct 2018
A Survey and Critique of Multiagent Deep Reinforcement Learning
Pablo Hernandez-Leal
Bilal Kartal
Matthew E. Taylor
OffRL
43
551
0
12 Oct 2018
One-Shot High-Fidelity Imitation: Training Large-Scale Deep Nets with RL
T. Paine
Sergio Gomez Colmenarejo
Ziyun Wang
Scott E. Reed
Y. Aytar
...
Matthew W. Hoffman
Gabriel Barth-Maron
Serkan Cabi
David Budden
Nando de Freitas
OffRL
19
26
0
11 Oct 2018
Episodic Curiosity through Reachability
Nikolay Savinov
Anton Raichuk
Raphaël Marinier
Damien Vincent
Marc Pollefeys
Timothy Lillicrap
Sylvain Gelly
14
266
0
04 Oct 2018
Efficient Dialog Policy Learning via Positive Memory Retention
Rui Zhao
Volker Tresp
14
10
0
02 Oct 2018
Generalization and Regularization in DQN
Jesse Farebrother
Marlos C. Machado
Michael Bowling
30
203
0
29 Sep 2018
Relational Forward Models for Multi-Agent Learning
Andrea Tacchetti
H. F. Song
P. Mediano
V. Zambaldi
Neil C. Rabinowitz
T. Graepel
M. Botvinick
Peter W. Battaglia
AI4CE
19
77
0
28 Sep 2018
Sim-to-Real Transfer of Robot Learning with Variable Length Inputs
Vibhavari Dasagi
Robert Lee
Serena Mou
Jake Bruce
Niko Sünderhauf
Jurgen Leitner
OffRL
25
3
0
20 Sep 2018
Multi-task Deep Reinforcement Learning with PopArt
Matteo Hessel
Hubert Soyer
L. Espeholt
Wojciech M. Czarnecki
Simon Schmitt
H. V. Hasselt
11
314
0
12 Sep 2018
Unity: A General Platform for Intelligent Agents
Arthur Juliani
Vincent-Pierre Berges
Esh Vckay
Andrew Cohen
Jonathan Harper
...
Chris Goy
Yuan Gao
Hunter Henry
Marwan Mattar
Danny Lange
21
808
0
07 Sep 2018
LIFT: Reinforcement Learning in Computer Systems by Learning From Demonstrations
Michael Schaarschmidt
A. Kuhnle
Ben Ellis
Kai Fricke
Felix Gessert
Eiko Yoneki
OffRL
32
41
0
23 Aug 2018
Previous
1
2
3
...
18
19
20
Next