Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
1802.01561
Cited By
v1
v2
v3 (latest)
IMPALA: Scalable Distributed Deep-RL with Importance Weighted Actor-Learner Architectures
5 February 2018
L. Espeholt
Hubert Soyer
Rémi Munos
Karen Simonyan
Volodymyr Mnih
Tom Ward
Yotam Doron
Vlad Firoiu
Tim Harley
Iain Dunning
Shane Legg
Koray Kavukcuoglu
Re-assign community
ArXiv (abs)
PDF
HTML
Papers citing
"IMPALA: Scalable Distributed Deep-RL with Importance Weighted Actor-Learner Architectures"
50 / 1,000 papers shown
Title
The Animal-AI Environment: Training and Testing Animal-Like Artificial Cognition
Benjamin Beyret
José Hernández-Orallo
Lucy G. Cheke
Marta Halina
Murray Shanahan
Matthew Crosby
91
35
0
12 Sep 2019
Discovery of Useful Questions as Auxiliary Tasks
Vivek Veeriah
Matteo Hessel
Zhongwen Xu
Richard L. Lewis
Janarthanan Rajendran
Junhyuk Oh
H. V. Hasselt
David Silver
Satinder Singh
LLMAG
81
85
0
10 Sep 2019
Logic and the
2
2
2
-Simplicial Transformer
James Clift
D. Doryn
Daniel Murfet
James Wallbridge
NAI
48
3
0
02 Sep 2019
An Open-Source Framework for Adaptive Traffic Signal Control
Wade Genders
S. Razavi
59
29
0
01 Sep 2019
Neural Policy Gradient Methods: Global Optimality and Rates of Convergence
Lingxiao Wang
Qi Cai
Zhuoran Yang
Zhaoran Wang
113
242
0
29 Aug 2019
Dynamics-aware Embeddings
William F. Whitney
Rajat Agarwal
Kyunghyun Cho
Abhinav Gupta
SSL
95
53
0
25 Aug 2019
Feature Partitioning for Efficient Multi-Task Architectures
Alejandro Newell
Lu Jiang
Chong-Jun Wang
Li Li
Jia Deng
89
17
0
12 Aug 2019
Free-Lunch Saliency via Attention in Atari Agents
Dmitry Nikulin
A. Ianina
Vladimir Aliev
Sergey I. Nikolenko
FAtt
73
24
0
07 Aug 2019
Benchmarking Bonus-Based Exploration Methods on the Arcade Learning Environment
Adrien Ali Taïga
W. Fedus
Marlos C. Machado
Aaron Courville
Marc G. Bellemare
87
41
0
06 Aug 2019
Google Research Football: A Novel Reinforcement Learning Environment
Karol Kurach
Anton Raichuk
Piotr Stańczyk
Michal Zajac
Olivier Bachem
...
C. Riquelme
Damien Vincent
Marcin Michalski
Olivier Bousquet
Sylvain Gelly
225
410
0
25 Jul 2019
Variance Reduction in Actor Critic Methods (ACM)
Eric Benhamou
OffRL
57
4
0
23 Jul 2019
Accelerating Reinforcement Learning through GPU Atari Emulation
Steven Dalton
I. Frosio
M. Garland
ELM
58
9
0
19 Jul 2019
Proximal Policy Optimization with Mixed Distributed Training
Zhenyu Zhang
Xiangfeng Luo
Tong Liu
Shaorong Xie
Jianshu Wang
Wei Wang
Yongbin Li
Yan Peng
OffRL
41
21
0
15 Jul 2019
Learning Safe Unlabeled Multi-Robot Planning with Motion Constraints
Arbaaz Khan
Chi Zhang
Shuo Li
J. Wu
Brent Schlotfeldt
Sarah Tang
Alejandro Ribeiro
Osbert Bastani
Vijay Kumar
66
29
0
11 Jul 2019
On Inductive Biases in Deep Reinforcement Learning
Matteo Hessel
H. V. Hasselt
Joseph Modayil
David Silver
AI4CE
83
41
0
05 Jul 2019
Attentive Multi-Task Deep Reinforcement Learning
Timo Bram
Gino Brunner
Oliver Richter
Roger Wattenhofer
CLL
146
18
0
05 Jul 2019
Generalizing from a few environments in safety-critical reinforcement learning
Zachary Kenton
Angelos Filos
Owain Evans
Y. Gal
87
16
0
02 Jul 2019
Growing Action Spaces
Gregory Farquhar
Laura Gustafson
Zeming Lin
Shimon Whiteson
Nicolas Usunier
Gabriel Synnaeve
77
38
0
28 Jun 2019
Learning Policies through Quantile Regression
Oliver Richter
Roger Wattenhofer
51
0
0
27 Jun 2019
Compositional Transfer in Hierarchical Reinforcement Learning
Markus Wulfmeier
A. Abdolmaleki
Roland Hafner
Jost Tobias Springenberg
Michael Neunert
Tim Hertweck
Thomas Lampe
Noah Y. Siegel
N. Heess
Martin Riedmiller
111
27
0
26 Jun 2019
Shaping Belief States with Generative Environment Models for RL
Karol Gregor
Danilo Jimenez Rezende
F. Besse
Yan Wu
Hamza Merzic
Aaron van den Oord
OffRL
AI4CE
131
119
0
21 Jun 2019
Cross-View Policy Learning for Street Navigation
Ang Li
Huiyi Hu
Piotr Wojciech Mirowski
Mehrdad Farajtabar
92
27
0
13 Jun 2019
Fast Task Inference with Variational Intrinsic Successor Features
Steven Hansen
Will Dabney
André Barreto
T. Wiele
David Warde-Farley
Volodymyr Mnih
BDL
100
152
0
12 Jun 2019
Reinforcement Learning for Integer Programming: Learning to Cut
Yunhao Tang
Shipra Agrawal
Yuri Faenza
AI4CE
111
174
0
11 Jun 2019
Importance Resampling for Off-policy Prediction
M. Schlegel
Wesley Chung
Daniel Graves
Jian Qian
Martha White
OffRL
57
41
0
11 Jun 2019
Gossip-based Actor-Learner Architectures for Deep Reinforcement Learning
Mahmoud Assran
Joshua Romoff
Nicolas Ballas
Joelle Pineau
Michael G. Rabbat
64
33
0
09 Jun 2019
Empirical Likelihood for Contextual Bandits
Nikos Karampatziakis
John Langford
Paul Mineiro
OffRL
136
9
0
07 Jun 2019
Towards Interpretable Reinforcement Learning Using Attention Augmented Agents
Alex Mott
Daniel Zoran
Mike Chrzanowski
Daan Wierstra
Danilo Jimenez Rezende
74
192
0
06 Jun 2019
How to Initialize your Network? Robust Initialization for WeightNorm & ResNets
Devansh Arpit
Victor Campos
Yoshua Bengio
83
59
0
05 Jun 2019
Options as responses: Grounding behavioural hierarchies in multi-agent RL
A. Vezhnevets
Yuhuai Wu
Rémi Leblond
Joel Z Leibo
AI4CE
92
17
0
04 Jun 2019
Stabilizing Off-Policy Q-Learning via Bootstrapping Error Reduction
Aviral Kumar
Justin Fu
George Tucker
Sergey Levine
OffRL
OnRL
156
1,070
0
03 Jun 2019
Neural Replicator Dynamics
Daniel Hennes
Dustin Morrill
Shayegan Omidshafiei
Rémi Munos
Julien Perolat
...
A. Gruslys
Jean-Baptiste Lespiau
Paavo Parmas
Edgar A. Duénez-Guzmán
K. Tuyls
74
16
0
01 Jun 2019
Interval timing in deep reinforcement learning agents
B. Deverett
Ryan Faulkner
Meire Fortunato
Greg Wayne
Joel Z Leibo
47
14
0
31 May 2019
Unsupervised Model Selection for Variational Disentangled Representation Learning
Sunny Duan
Loic Matthey
Andre Saraiva
Nicholas Watters
Christopher P. Burgess
Alexander Lerchner
I. Higgins
OOD
DRL
96
80
0
29 May 2019
An Explicitly Relational Neural Network Architecture
Murray Shanahan
Kyriacos Nikiforou
Antonia Creswell
Christos Kaplanis
David Barrett
M. Garnelo
NAI
3DV
GAN
80
69
0
24 May 2019
Combining Experience Replay with Exploration by Random Network Distillation
Francesco Sovrano
61
15
0
18 May 2019
Optimizing Sequential Medical Treatments with Auto-Encoding Heuristic Search in POMDPs
Luchen Li
Matthieu Komorowski
Aldo A. Faisal
OffRL
139
13
0
17 May 2019
Trajectory-Based Off-Policy Deep Reinforcement Learning
Andreas Doerr
Michael Volpp
Marc Toussaint
Sebastian Trimpe
Christian Daniel
OffRL
66
2
0
14 May 2019
Smoothing Policies and Safe Policy Gradients
Matteo Papini
Matteo Pirotta
Marcello Restelli
80
31
0
08 May 2019
Reinforced Genetic Algorithm Learning for Optimizing Computation Graphs
Aditya Sanjay Paliwal
Felix Gimeno
Vinod Nair
Yujia Li
Miles Lubin
Pushmeet Kohli
Oriol Vinyals
OffRL
GNN
99
67
0
07 May 2019
Dimension-Wise Importance Sampling Weight Clipping for Sample-Efficient Reinforcement Learning
Seungyul Han
Y. Sung
OffRL
65
20
0
07 May 2019
Information asymmetry in KL-regularized RL
Alexandre Galashov
Siddhant M. Jayakumar
Leonard Hasenclever
Dhruva Tirumala
Jonathan Richard Schwarz
Guillaume Desjardins
Wojciech M. Czarnecki
Yee Whye Teh
Razvan Pascanu
N. Heess
OffRL
67
104
0
03 May 2019
Challenges of Real-World Reinforcement Learning
Gabriel Dulac-Arnold
D. Mankowitz
Todd Hester
OffRL
125
553
0
29 Apr 2019
Ray Interference: a Source of Plateaus in Deep Reinforcement Learning
Tom Schaul
Diana Borsa
Joseph Modayil
Razvan Pascanu
79
63
0
25 Apr 2019
Towards Combining On-Off-Policy Methods for Real-World Applications
Kai-Chun Hu
Chen-Huan Pi
Ting Han Wei
I-Chen Wu
Stone Cheng
Yi-Wei Dai
Wei-Yuan Ye
OffRL
33
2
0
24 Apr 2019
Rogue-Gym: A New Challenge for Generalization in Reinforcement Learning
Yuji Kanagawa
Tomoyuki Kaneko
73
14
0
17 Apr 2019
Only Relevant Information Matters: Filtering Out Noisy Samples to Boost RL
Yannis Flet-Berliac
Philippe Preux
47
2
0
08 Apr 2019
Creating Pro-Level AI for a Real-Time Fighting Game Using Deep Reinforcement Learning
In-Suk Oh
Seungeun Rho
Sangbin Moon
Seongho Son
Hyoil Lee
Jinyun Chung
98
54
0
08 Apr 2019
Deep Reinforcement Learning on a Budget: 3D Control and Reasoning Without a Supercomputer
E. Beeching
Christian Wolf
J. Dibangoye
Olivier Simonin
OffRL
LRM
76
25
0
03 Apr 2019
Meta-Learning surrogate models for sequential decision making
Alexandre Galashov
Jonathan Richard Schwarz
Hyunjik Kim
M. Garnelo
D. Saxton
Pushmeet Kohli
S. M. Ali Eslami
Yee Whye Teh
BDL
OffRL
95
25
0
28 Mar 2019
Previous
1
2
3
...
17
18
19
20
Next