Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
1802.01561
Cited By
v1
v2
v3 (latest)
IMPALA: Scalable Distributed Deep-RL with Importance Weighted Actor-Learner Architectures
5 February 2018
L. Espeholt
Hubert Soyer
Rémi Munos
Karen Simonyan
Volodymyr Mnih
Tom Ward
Yotam Doron
Vlad Firoiu
Tim Harley
Iain Dunning
Shane Legg
Koray Kavukcuoglu
Re-assign community
ArXiv (abs)
PDF
HTML
Papers citing
"IMPALA: Scalable Distributed Deep-RL with Importance Weighted Actor-Learner Architectures"
50 / 1,000 papers shown
Title
Divergence-Regularized Multi-Agent Actor-Critic
Kefan Su
Zongqing Lu
157
27
0
01 Oct 2021
Genealogical Population-Based Training for Hyperparameter Optimization
Antoine Scardigli
P. Fournier
Matteo Vilucchio
D. Naccache
GP
42
0
0
30 Sep 2021
Deep Reinforcement Learning Versus Evolution Strategies: A Comparative Survey
Amjad Yousef Majid
Serge Saaybi
Tomas van Rietbergen
Vincent François-Lavet
R. V. Prasad
Chris Verhoeven
OffRL
138
60
0
28 Sep 2021
Faster Improvement Rate Population Based Training
Valentin Dalibard
Max Jaderberg
69
13
0
28 Sep 2021
Learning to Superoptimize Real-world Programs
Alex Shypula
Pengcheng Yin
Jeremy Lacomis
Claire Le Goues
Edward N. Schwartz
Graham Neubig
NAI
148
10
0
28 Sep 2021
MiniHack the Planet: A Sandbox for Open-Ended Reinforcement Learning Research
Mikayel Samvelyan
Robert Kirk
Vitaly Kurin
Jack Parker-Holder
Minqi Jiang
Eric Hambro
Fabio Petroni
Heinrich Küttler
Edward Grefenstette
Tim Rocktaschel
OffRL
319
91
0
27 Sep 2021
Applying supervised and reinforcement learning methods to create neural-network-based agents for playing StarCraft II
Michal Opanowicz
34
0
0
26 Sep 2021
Parameter-free Reduction of the Estimation Bias in Deep Reinforcement Learning for Deterministic Policy Gradients
Baturay Saglam
Furkan B. Mutlu
Dogan C. Cicek
Suleyman S. Kozat
OffRL
53
3
0
24 Sep 2021
Improved Soft Actor-Critic: Mixing Prioritized Off-Policy Samples with On-Policy Experience
C. Banerjee
Zhiyong Chen
N. Noman
63
34
0
24 Sep 2021
On Bonus-Based Exploration Methods in the Arcade Learning Environment
Adrien Ali Taïga
W. Fedus
Marlos C. Machado
Aaron Courville
Marc G. Bellemare
59
61
0
22 Sep 2021
Estimation Error Correction in Deep Reinforcement Learning for Deterministic Actor-Critic Methods
Baturay Saglam
Enes Duran
Dogan C. Cicek
Furkan B. Mutlu
Suleyman S. Kozat
OffRL
73
12
0
22 Sep 2021
Autonomous Blimp Control using Deep Reinforcement Learning
Y. Liu
Eric Price
Pascal Goldschmid
Michael J. Black
Aamir Ahmad
AI4CE
56
3
0
22 Sep 2021
Learning Natural Language Generation from Scratch
Alice Martin Donati
Guillaume Quispe
Charles Ollion
Sylvain Le Corff
Florian Strub
Olivier Pietquin
LRM
58
4
0
20 Sep 2021
CompilerGym: Robust, Performant Compiler Optimization Environments for AI Research
Chris Cummins
Bram Wasti
Jiadong Guo
Brandon Cui
Jason Ansel
...
Jia-Wei Liu
O. Teytaud
Benoit Steiner
Yuandong Tian
Hugh Leather
80
76
0
17 Sep 2021
Conservative Data Sharing for Multi-Task Offline Reinforcement Learning
Tianhe Yu
Aviral Kumar
Yevgen Chebotar
Karol Hausman
Sergey Levine
Chelsea Finn
OffRL
91
77
0
16 Sep 2021
Direct Advantage Estimation
Hsiao-Ru Pan
Nico Gürtler
Alexander Neitz
Bernhard Schölkopf
OffRL
CML
62
13
0
13 Sep 2021
An Empirical Comparison of Off-policy Prediction Learning Algorithms in the Four Rooms Environment
Sina Ghiassian
R. Sutton
AAML
OffRL
102
6
0
10 Sep 2021
Bootstrapped Meta-Learning
Sebastian Flennerhag
Yannick Schroecker
Tom Zahavy
Hado van Hasselt
David Silver
Satinder Singh
95
58
0
09 Sep 2021
An Exploration of Deep Learning Methods in Hungry Geese
Nikzad Khani
Matthew Kluska
27
0
0
05 Sep 2021
Eden: A Unified Environment Framework for Booming Reinforcement Learning Algorithms
Ruizhi Chen
Xiaoyu Wu
Yansong Pan
Kaizhao Yuan
Ling Li
...
Shaohui Peng
Xishan Zhang
Zidong Du
Qi Guo
Yunji Chen
OffRL
61
3
0
04 Sep 2021
Boosting Search Engines with Interactive Agents
Leonard Adolphs
Benjamin Boerschinger
Christian Buck
Michelle Chen Huebscher
Massimiliano Ciaramita
...
Thomas Hofmann
Yannic Kilcher
Sascha Rothe
Pier Giuseppe Sessa
Lierni Sestorain Saralegui
LLMAG
147
24
0
01 Sep 2021
Catastrophic Interference in Reinforcement Learning: A Solution Based on Context Division and Knowledge Distillation
Tiantian Zhang
Xueqian Wang
Bin Liang
Bo Yuan
OffRL
80
18
0
01 Sep 2021
WarpDrive: Extremely Fast End-to-End Deep Multi-Agent Reinforcement Learning on a GPU
Tian-Shing Lan
Sunil Srinivasa
Huan Wang
Stephan Zheng
AI4CE
70
13
0
31 Aug 2021
Deep Reinforcement Learning at the Edge of the Statistical Precipice
Rishabh Agarwal
Max Schwarzer
Pablo Samuel Castro
Aaron Courville
Marc G. Bellemare
OffRL
218
680
0
30 Aug 2021
Federated Reinforcement Learning: Techniques, Applications, and Open Challenges
Jiaju Qi
Qihao Zhou
Lei Lei
Kan Zheng
FedML
113
161
0
26 Aug 2021
Truncated Emphatic Temporal Difference Methods for Prediction and Control
Shangtong Zhang
Shimon Whiteson
OffRL
79
12
0
11 Aug 2021
Rethinking of AlphaStar
Ruoxi Liu
66
2
0
07 Aug 2021
Learning more skills through optimistic exploration
D. Strouse
Kate Baumli
David Warde-Farley
Vlad Mnih
Steven Hansen
SSL
105
46
0
29 Jul 2021
Lyapunov-based uncertainty-aware safe reinforcement learning
Ashkan B. Jeddi
Nariman L. Dehghani
A. Shafieezadeh
69
7
0
29 Jul 2021
Accelerating Quadratic Optimization with Reinforcement Learning
Jeffrey Ichnowski
Paras Jain
Bartolomeo Stellato
G. Banjac
Michael Luo
Francesco Borrelli
Joseph E. Gonzalez
Ion Stoica
Ken Goldberg
OffRL
85
36
0
22 Jul 2021
Decoupled Reinforcement Learning to Stabilise Intrinsically-Motivated Exploration
Lukas Schafer
Filippos Christianos
Josiah P. Hanna
Stefano V. Albrecht
92
23
0
19 Jul 2021
Megaverse: Simulating Embodied Agents at One Million Experiences per Second
Aleksei Petrenko
Erik Wijmans
Brennan Shacklett
V. Koltun
LM&Ro
VGen
88
24
0
17 Jul 2021
Scalable Evaluation of Multi-Agent Reinforcement Learning with Melting Pot
Joel Z Leibo
Edgar A. Duénez-Guzmán
A. Vezhnevets
J. Agapiou
P. Sunehag
Raphael Köster
Jayd Matyas
Charlie Beattie
Igor Mordatch
T. Graepel
OffRL
95
111
0
14 Jul 2021
Shortest-Path Constrained Reinforcement Learning for Sparse Reward Tasks
Sungryull Sohn
Sungtae Lee
Jongwook Choi
H. V. Seijen
Mehdi Fatemi
Honglak Lee
321
5
0
13 Jul 2021
Learning Expected Emphatic Traces for Deep RL
Ray Jiang
Shangtong Zhang
Veronica Chelu
Adam White
Hado van Hasselt
OffRL
69
12
0
12 Jul 2021
Learning Vision-Guided Quadrupedal Locomotion End-to-End with Cross-Modal Transformers
Ruihan Yang
Minghao Zhang
Nicklas Hansen
Huazhe Xu
Xiaolong Wang
OffRL
99
108
0
08 Jul 2021
Offline Meta-Reinforcement Learning with Online Self-Supervision
Vitchyr H. Pong
Ashvin Nair
Laura M. Smith
Catherine Huang
Sergey Levine
OffRL
152
67
0
08 Jul 2021
RRL: Resnet as representation for Reinforcement Learning
Rutav Shah
Vikash Kumar
OffRL
109
115
0
07 Jul 2021
Evaluating the progress of Deep Reinforcement Learning in the real world: aligning domain-agnostic and domain-specific research
J. Luis
E. Crawley
B. Cameron
OffRL
77
6
0
07 Jul 2021
AdaRL: What, Where, and How to Adapt in Transfer Reinforcement Learning
Erdun Gao
Fan Feng
Chaochao Lu
Sara Magliacane
Kun Zhang
109
69
0
06 Jul 2021
Agents that Listen: High-Throughput Reinforcement Learning with Multiple Sensory Systems
Shashank Hegde
Anssi Kanervisto
Aleksei Petrenko
VLM
70
9
0
05 Jul 2021
MixStyle Neural Networks for Domain Generalization and Adaptation
Kaiyang Zhou
Yongxin Yang
Yu Qiao
Tao Xiang
OOD
TTA
79
87
0
05 Jul 2021
Improve Agents without Retraining: Parallel Tree Search with Off-Policy Correction
Assaf Hallak
Gal Dalal
Steven Dalton
I. Frosio
Shie Mannor
Gal Chechik
OffRL
OnRL
103
10
0
04 Jul 2021
Stabilizing Deep Q-Learning with ConvNets and Vision Transformers under Data Augmentation
Nicklas Hansen
H. Su
Xiaolong Wang
OffRL
125
137
0
01 Jul 2021
Tuning Mixed Input Hyperparameters on the Fly for Efficient Population Based AutoRL
Jack Parker-Holder
Vu Nguyen
Shaan Desai
Stephen J. Roberts
78
18
0
30 Jun 2021
The Values Encoded in Machine Learning Research
Abeba Birhane
Pratyusha Kalluri
Dallas Card
William Agnew
Ravit Dotan
Michelle Bao
93
295
0
29 Jun 2021
Generalization of Reinforcement Learning with Policy-Aware Adversarial Data Augmentation
Hanping Zhang
Yuhong Guo
88
23
0
29 Jun 2021
Multi-task curriculum learning in a complex, visual, hard-exploration domain: Minecraft
I. Kanitscheider
Joost Huizinga
David Farhi
William H. Guss
Brandon Houghton
...
Bowen Baker
Adrien Ecoffet
Jie Tang
Oleg Klimov
Jeff Clune
85
22
0
28 Jun 2021
Habitat 2.0: Training Home Assistants to Rearrange their Habitat
Andrew Szot
Alexander Clegg
Eric Undersander
Erik Wijmans
Yili Zhao
...
Z. Kira
V. Koltun
Jitendra Malik
Manolis Savva
Dhruv Batra
LM&Ro
123
528
0
28 Jun 2021
Graph Convolutional Memory using Topological Priors
Steven D. Morad
Stephan Liwicki
Ryan Kortvelesy
R. Mecca
Amanda Prorok
36
0
0
27 Jun 2021
Previous
1
2
3
...
10
11
12
...
18
19
20
Next