Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
1802.01561
Cited By
v1
v2
v3 (latest)
IMPALA: Scalable Distributed Deep-RL with Importance Weighted Actor-Learner Architectures
5 February 2018
L. Espeholt
Hubert Soyer
Rémi Munos
Karen Simonyan
Volodymyr Mnih
Tom Ward
Yotam Doron
Vlad Firoiu
Tim Harley
Iain Dunning
Shane Legg
Koray Kavukcuoglu
Re-assign community
ArXiv (abs)
PDF
HTML
Papers citing
"IMPALA: Scalable Distributed Deep-RL with Importance Weighted Actor-Learner Architectures"
50 / 1,000 papers shown
Title
Human-in-the-Loop Methods for Data-Driven and Reinforcement Learning Systems
Vinicius G. Goecks
117
11
0
30 Aug 2020
Deep Reinforcement Learning for Field Development Optimization
Y. Nasir
31
1
0
05 Aug 2020
WordCraft: An Environment for Benchmarking Commonsense Agents
Minqi Jiang
Jelena Luketina
Nantas Nardelli
Pasquale Minervini
Philip Torr
Shimon Whiteson
Tim Rocktaschel
LLMAG
OffRL
49
23
0
17 Jul 2020
Discovering Reinforcement Learning Algorithms
Junhyuk Oh
Matteo Hessel
Wojciech M. Czarnecki
Zhongwen Xu
H. V. Hasselt
Satinder Singh
David Silver
91
129
0
17 Jul 2020
Meta-Gradient Reinforcement Learning with an Objective Discovered Online
Zhongwen Xu
H. V. Hasselt
Matteo Hessel
Junhyuk Oh
Satinder Singh
David Silver
92
78
0
16 Jul 2020
Distributed Reinforcement Learning of Targeted Grasping with Active Vision for Mobile Manipulators
Yasuhiro Fujita
Kota Uenishi
Avinash Ummadisingu
P. Nagarajan
Shimpei Masuda
M. Castro
84
18
0
16 Jul 2020
Lifelong Learning using Eigentasks: Task Separation, Skill Acquisition, and Selective Transfer
Aswin Raghavan
Jesse Hostetler
Indranil Sur
Abrar Rahman
Ajay Divakaran
CLL
45
7
0
14 Jul 2020
Relational-Grid-World: A Novel Relational Reasoning Environment and An Agent Model for Relational Information Extraction
Faruk Küçüksubasi
Elif Surer
34
2
0
12 Jul 2020
Learning Retrospective Knowledge with Reverse Reinforcement Learning
Shangtong Zhang
Vivek Veeriah
Shimon Whiteson
OffRL
AI4TS
76
13
0
09 Jul 2020
Tracking-by-Trackers with a Distilled and Reinforced Model
Matteo Dunnhofer
N. Martinel
C. Micheloni
VOT
OffRL
64
4
0
08 Jul 2020
Guided Exploration with Proximal Policy Optimization using a Single Demonstration
Gabriele Libardi
Gianni De Fabritiis
58
24
0
07 Jul 2020
TDprop: Does Jacobi Preconditioning Help Temporal Difference Learning?
Joshua Romoff
Peter Henderson
David Kanaa
Emmanuel Bengio
Ahmed Touati
Pierre-Luc Bacon
Joelle Pineau
66
3
0
06 Jul 2020
Scaling Imitation Learning in Minecraft
Artemij Amiranashvili
Nicolai Dorka
Wolfram Burgard
V. Koltun
Thomas Brox
MLAU
55
15
0
06 Jul 2020
Integrating Distributed Architectures in Highly Modular RL Libraries
Albert Bou
Sebastian Dittert
Gianni De Fabritiis
76
0
0
06 Jul 2020
Verifiably Safe Exploration for End-to-End Reinforcement Learning
Nathan Hunt
Nathan Fulton
Sara Magliacane
Nghia Hoang
Subhro Das
Armando Solar-Lezama
OffRL
85
52
0
02 Jul 2020
Gradient Temporal-Difference Learning with Regularized Corrections
Sina Ghiassian
Andrew Patterson
Shivam Garg
Dhawal Gupta
Adam White
Martha White
177
42
0
01 Jul 2020
Perception-Prediction-Reaction Agents for Deep Reinforcement Learning
Adam Stooke
Valentin Dalibard
Siddhant M. Jayakumar
Wojciech M. Czarnecki
Max Jaderberg
61
1
0
26 Jun 2020
The NetHack Learning Environment
Heinrich Küttler
Nantas Nardelli
Alexander H. Miller
Roberta Raileanu
Marco Selvatici
Edward Grefenstette
Tim Rocktaschel
128
181
0
24 Jun 2020
Local Stochastic Approximation: A Unified View of Federated Learning and Distributed Multi-Task Reinforcement Learning Algorithms
Thinh T. Doan
FedML
64
10
0
24 Jun 2020
Experience Replay with Likelihood-free Importance Weights
Samarth Sinha
Jiaming Song
Animesh Garg
Stefano Ermon
OffRL
105
58
0
23 Jun 2020
Automatic Data Augmentation for Generalization in Deep Reinforcement Learning
Roberta Raileanu
M. Goldstein
Denis Yarats
Ilya Kostrikov
Rob Fergus
OffRL
65
110
0
23 Jun 2020
Learning with AMIGo: Adversarially Motivated Intrinsic Goals
Andres Campero
Roberta Raileanu
Heinrich Küttler
J. Tenenbaum
Tim Rocktaschel
Edward Grefenstette
114
127
0
22 Jun 2020
Sample Factory: Egocentric 3D Control from Pixels at 100000 FPS with Asynchronous Reinforcement Learning
Aleksei Petrenko
Zhehui Huang
T. Kumar
Gaurav Sukhatme
V. Koltun
113
105
0
21 Jun 2020
Non-local Policy Optimization via Diversity-regularized Collaborative Exploration
Zhenghao Peng
Hao Sun
Bolei Zhou
81
19
0
14 Jun 2020
Online Hyper-parameter Tuning in Off-policy Learning via Evolutionary Strategies
Yunhao Tang
K. Choromanski
OffRL
41
14
0
13 Jun 2020
Self-Imitation Learning via Generalized Lower Bound Q-learning
Yunhao Tang
SSL
121
24
0
12 Jun 2020
A Brief Look at Generalization in Visual Meta-Reinforcement Learning
Safa Alver
Doina Precup
OffRL
49
8
0
12 Jun 2020
A Practical Sparse Approximation for Real Time Recurrent Learning
Jacob Menick
Erich Elsen
Utku Evci
Simon Osindero
Karen Simonyan
Alex Graves
94
32
0
12 Jun 2020
Shared Experience Actor-Critic for Multi-Agent Reinforcement Learning
Filippos Christianos
Lukas Schafer
Stefano V. Albrecht
150
170
0
12 Jun 2020
Systematic Generalisation through Task Temporal Logic and Deep Reinforcement Learning
Borja G. Leon
Murray Shanahan
Francesco Belardinelli
NAI
AI4CE
95
29
0
12 Jun 2020
What Matters In On-Policy Reinforcement Learning? A Large-Scale Empirical Study
Marcin Andrychowicz
Anton Raichuk
Piotr Stańczyk
Manu Orsini
Sertan Girgin
...
Matthieu Geist
Olivier Pietquin
Marcin Michalski
Sylvain Gelly
Olivier Bachem
OffRL
95
226
0
10 Jun 2020
Transient Non-Stationarity and Generalisation in Deep Reinforcement Learning
Maximilian Igl
Gregory Farquhar
Jelena Luketina
Wendelin Boehmer
Shimon Whiteson
139
88
0
10 Jun 2020
Learning to Play No-Press Diplomacy with Best Response Policy Iteration
Thomas W. Anthony
Tom Eccles
Andrea Tacchetti
János Kramár
I. Gemp
...
Richard Everett
Roman Werpachowski
Satinder Singh
T. Graepel
Yoram Bachrach
106
43
0
08 Jun 2020
A Decentralized Policy Gradient Approach to Multi-task Reinforcement Learning
Sihan Zeng
Aqeel Anwar
Thinh T. Doan
A. Raychowdhury
Justin Romberg
88
40
0
08 Jun 2020
Rapid Task-Solving in Novel Environments
Samuel Ritter
Ryan Faulkner
Laurent Sartran
Adam Santoro
M. Botvinick
David Raposo
74
29
0
05 Jun 2020
Deployment-Efficient Reinforcement Learning via Model-Based Offline Optimization
T. Matsushima
Hiroki Furuta
Y. Matsuo
Ofir Nachum
S. Gu
OffRL
126
150
0
05 Jun 2020
Learning Neural Light Transport
Paul Sanzenbacher
L. Mescheder
Andreas Geiger
34
7
0
05 Jun 2020
Visual Transfer for Reinforcement Learning via Wasserstein Domain Confusion
Josh Roy
George Konidaris
78
16
0
04 Jun 2020
Probing Emergent Semantics in Predictive Agents via Question Answering
Abhishek Das
Federico Carnevale
Hamza Merzic
Laura Rimell
R. Schneider
...
Alden Hung
Arun Ahuja
S. Clark
Greg Wayne
Felix Hill
91
18
0
01 Jun 2020
Acme: A Research Framework for Distributed Reinforcement Learning
Matthew W. Hoffman
Bobak Shahriari
John Aslanides
Gabriel Barth-Maron
Nikola Momchev
...
Srivatsan Srinivasan
A. Cowie
Ziyun Wang
Bilal Piot
Nando de Freitas
143
226
0
01 Jun 2020
Korali: Efficient and Scalable Software Framework for Bayesian Uncertainty Quantification and Stochastic Optimization
Sergio M. Martin
Daniel Wälchli
G. Arampatzis
Athena Economides
Petr Karnakov
Petros Koumoutsakos
31
15
0
27 May 2020
Policy Entropy for Out-of-Distribution Classification
Andreas Sedlmeier
Robert Muller
Steffen Illium
Claudia Linnhoff-Popien
OODD
OffRL
59
14
0
25 May 2020
Evaluating Generalisation in General Video Game Playing
Martin Balla
Simon Lucas
Diego Perez-Liebana
42
2
0
22 May 2020
Human Instruction-Following with Deep Reinforcement Learning via Transfer-Learning from Text
Felix Hill
Soňa Mokrá
Nathaniel Wong
Tim Harley
LM&Ro
101
82
0
19 May 2020
Playing Minecraft with Behavioural Cloning
Anssi Kanervisto
Janne Karttunen
Ville Hautamaki
72
12
0
07 May 2020
Offline Reinforcement Learning: Tutorial, Review, and Perspectives on Open Problems
Sergey Levine
Aviral Kumar
George Tucker
Justin Fu
OffRL
GP
586
2,052
0
04 May 2020
Reinforcement Learning with Augmented Data
Michael Laskin
Kimin Lee
Adam Stooke
Lerrel Pinto
Pieter Abbeel
A. Srinivas
OffRL
168
661
0
30 Apr 2020
Bootstrap Latent-Predictive Representations for Multitask Reinforcement Learning
Z. Guo
Bernardo Avila-Pires
Bilal Piot
Jean-Bastien Grill
Florent Altché
Rémi Munos
M. G. Azar
BDL
DRL
SSL
192
143
0
30 Apr 2020
Image Augmentation Is All You Need: Regularizing Deep Reinforcement Learning from Pixels
Ilya Kostrikov
Denis Yarats
Rob Fergus
OffRL
206
794
0
28 Apr 2020
Multi-Task Learning for Dense Prediction Tasks: A Survey
Simon Vandenhende
Stamatios Georgoulis
Wouter Van Gansbeke
Marc Proesmans
Dengxin Dai
Luc Van Gool
CVBM
71
73
0
28 Apr 2020
Previous
1
2
3
...
14
15
16
...
18
19
20
Next