Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
1802.01561
Cited By
IMPALA: Scalable Distributed Deep-RL with Importance Weighted Actor-Learner Architectures
5 February 2018
L. Espeholt
Hubert Soyer
Rémi Munos
Karen Simonyan
Volodymyr Mnih
Tom Ward
Yotam Doron
Vlad Firoiu
Tim Harley
Iain Dunning
Shane Legg
Koray Kavukcuoglu
Re-assign community
ArXiv
PDF
HTML
Papers citing
"IMPALA: Scalable Distributed Deep-RL with Importance Weighted Actor-Learner Architectures"
50 / 982 papers shown
Title
An Exploration of Deep Learning Methods in Hungry Geese
Nikzad Khani
Matthew Kluska
17
0
0
05 Sep 2021
Eden: A Unified Environment Framework for Booming Reinforcement Learning Algorithms
Ruizhi Chen
Xiaoyu Wu
Yansong Pan
Kaizhao Yuan
Ling Li
...
Shaohui Peng
Xishan Zhang
Zidong Du
Qi Guo
Yunji Chen
OffRL
39
3
0
04 Sep 2021
Boosting Search Engines with Interactive Agents
Leonard Adolphs
Benjamin Boerschinger
Christian Buck
Michelle Chen Huebscher
Massimiliano Ciaramita
...
Thomas Hofmann
Yannic Kilcher
Sascha Rothe
Pier Giuseppe Sessa
Lierni Sestorain Saralegui
LLMAG
31
24
0
01 Sep 2021
Catastrophic Interference in Reinforcement Learning: A Solution Based on Context Division and Knowledge Distillation
Tiantian Zhang
Xueqian Wang
Bin Liang
Bo Yuan
OffRL
40
18
0
01 Sep 2021
WarpDrive: Extremely Fast End-to-End Deep Multi-Agent Reinforcement Learning on a GPU
Tian-Shing Lan
Sunil Srinivasa
Huan Wang
Stephan Zheng
AI4CE
24
13
0
31 Aug 2021
Deep Reinforcement Learning at the Edge of the Statistical Precipice
Rishabh Agarwal
Max Schwarzer
Pablo Samuel Castro
Aaron Courville
Marc G. Bellemare
OffRL
61
639
0
30 Aug 2021
Federated Reinforcement Learning: Techniques, Applications, and Open Challenges
Jiaju Qi
Qihao Zhou
Lei Lei
Kan Zheng
FedML
31
146
0
26 Aug 2021
Truncated Emphatic Temporal Difference Methods for Prediction and Control
Shangtong Zhang
Shimon Whiteson
OffRL
28
11
0
11 Aug 2021
Rethinking of AlphaStar
Ruoxi Liu
29
2
0
07 Aug 2021
Learning more skills through optimistic exploration
D. Strouse
Kate Baumli
David Warde-Farley
Vlad Mnih
Steven Hansen
SSL
13
45
0
29 Jul 2021
Lyapunov-based uncertainty-aware safe reinforcement learning
Ashkan B. Jeddi
Nariman L. Dehghani
A. Shafieezadeh
22
7
0
29 Jul 2021
Accelerating Quadratic Optimization with Reinforcement Learning
Jeffrey Ichnowski
Paras Jain
Bartolomeo Stellato
G. Banjac
Michael Luo
Francesco Borrelli
Joseph E. Gonzalez
Ion Stoica
Ken Goldberg
OffRL
21
36
0
22 Jul 2021
Decoupled Reinforcement Learning to Stabilise Intrinsically-Motivated Exploration
Lukas Schafer
Filippos Christianos
Josiah P. Hanna
Stefano V. Albrecht
50
22
0
19 Jul 2021
Megaverse: Simulating Embodied Agents at One Million Experiences per Second
Aleksei Petrenko
Erik Wijmans
Brennan Shacklett
V. Koltun
LM&Ro
VGen
31
22
0
17 Jul 2021
Scalable Evaluation of Multi-Agent Reinforcement Learning with Melting Pot
Joel Z. Leibo
Edgar A. Duénez-Guzmán
A. Vezhnevets
J. Agapiou
P. Sunehag
Raphael Köster
Jayd Matyas
Charlie Beattie
Igor Mordatch
T. Graepel
OffRL
58
104
0
14 Jul 2021
Shortest-Path Constrained Reinforcement Learning for Sparse Reward Tasks
Sungryull Sohn
Sungtae Lee
Jongwook Choi
H. V. Seijen
Mehdi Fatemi
Honglak Lee
182
3
0
13 Jul 2021
Learning Expected Emphatic Traces for Deep RL
Ray Jiang
Shangtong Zhang
Veronica Chelu
Adam White
Hado van Hasselt
OffRL
35
12
0
12 Jul 2021
Learning Vision-Guided Quadrupedal Locomotion End-to-End with Cross-Modal Transformers
Ruihan Yang
Minghao Zhang
Nicklas Hansen
Huazhe Xu
Xiaolong Wang
OffRL
23
102
0
08 Jul 2021
Offline Meta-Reinforcement Learning with Online Self-Supervision
Vitchyr H. Pong
Ashvin Nair
Laura M. Smith
Catherine Huang
Sergey Levine
OffRL
39
66
0
08 Jul 2021
RRL: Resnet as representation for Reinforcement Learning
Rutav Shah
Vikash Kumar
OffRL
36
112
0
07 Jul 2021
Evaluating the progress of Deep Reinforcement Learning in the real world: aligning domain-agnostic and domain-specific research
J. Luis
E. Crawley
B. Cameron
OffRL
27
6
0
07 Jul 2021
AdaRL: What, Where, and How to Adapt in Transfer Reinforcement Learning
Erdun Gao
Fan Feng
Chaochao Lu
Sara Magliacane
Kun Zhang
44
66
0
06 Jul 2021
Agents that Listen: High-Throughput Reinforcement Learning with Multiple Sensory Systems
Shashank Hegde
Anssi Kanervisto
Aleksei Petrenko
VLM
13
9
0
05 Jul 2021
MixStyle Neural Networks for Domain Generalization and Adaptation
Kaiyang Zhou
Yongxin Yang
Yu Qiao
Tao Xiang
OOD
TTA
31
76
0
05 Jul 2021
Improve Agents without Retraining: Parallel Tree Search with Off-Policy Correction
Assaf Hallak
Gal Dalal
Steven Dalton
I. Frosio
Shie Mannor
Gal Chechik
OffRL
OnRL
35
9
0
04 Jul 2021
Stabilizing Deep Q-Learning with ConvNets and Vision Transformers under Data Augmentation
Nicklas Hansen
H. Su
Xiaolong Wang
OffRL
44
135
0
01 Jul 2021
Tuning Mixed Input Hyperparameters on the Fly for Efficient Population Based AutoRL
Jack Parker-Holder
Vu Nguyen
Shaan Desai
Stephen J. Roberts
43
16
0
30 Jun 2021
The Values Encoded in Machine Learning Research
Abeba Birhane
Pratyusha Kalluri
Dallas Card
William Agnew
Ravit Dotan
Michelle Bao
41
275
0
29 Jun 2021
Generalization of Reinforcement Learning with Policy-Aware Adversarial Data Augmentation
Hanping Zhang
Yuhong Guo
30
23
0
29 Jun 2021
Multi-task curriculum learning in a complex, visual, hard-exploration domain: Minecraft
I. Kanitscheider
Joost Huizinga
David Farhi
William H. Guss
Brandon Houghton
...
Bowen Baker
Adrien Ecoffet
Jie Tang
Oleg Klimov
Jeff Clune
29
21
0
28 Jun 2021
Habitat 2.0: Training Home Assistants to Rearrange their Habitat
Andrew Szot
Alexander Clegg
Eric Undersander
Erik Wijmans
Yili Zhao
...
Z. Kira
V. Koltun
Jitendra Malik
Manolis Savva
Dhruv Batra
LM&Ro
39
502
0
28 Jun 2021
Graph Convolutional Memory using Topological Priors
Steven D. Morad
Stephan Liwicki
Ryan Kortvelesy
R. Mecca
Amanda Prorok
23
0
0
27 Jun 2021
Brax -- A Differentiable Physics Engine for Large Scale Rigid Body Simulation
C. Freeman
Erik Frey
Anton Raichuk
Sertan Girgin
Igor Mordatch
Olivier Bachem
48
355
0
24 Jun 2021
Unifying Gradient Estimators for Meta-Reinforcement Learning via Off-Policy Evaluation
Yunhao Tang
Tadashi Kozuno
Mark Rowland
Rémi Munos
Michal Valko
OffRL
27
9
0
24 Jun 2021
Finite-Sample Analysis of Off-Policy TD-Learning via Generalized Bellman Operators
Zaiwei Chen
S. T. Maguluri
Sanjay Shakkottai
Karthikeyan Shanmugam
OffRL
39
11
0
24 Jun 2021
Emphatic Algorithms for Deep Reinforcement Learning
Ray Jiang
Tom Zahavy
Zhongwen Xu
Adam White
Matteo Hessel
Charles Blundell
Hado van Hasselt
OffRL
41
19
0
21 Jun 2021
Scalable Safety-Critical Policy Evaluation with Accelerated Rare Event Sampling
Mengdi Xu
Peide Huang
Fengpei Li
Jiacheng Zhu
Xuewei Qi
K. Oguchi
Zhiyuan Huang
Henry Lam
Ding Zhao
16
4
0
19 Jun 2021
Proper Value Equivalence
Christopher Grimm
André Barreto
Gregory Farquhar
David Silver
Satinder Singh
OffRL
29
33
0
18 Jun 2021
Multi-Task Learning for User Engagement and Adoption in Live Video Streaming Events
Stefanos Antaris
Dimitrios Rafailidis
Romina Arriaza
OffRL
17
0
0
18 Jun 2021
MADE: Exploration via Maximizing Deviation from Explored Regions
Tianjun Zhang
Paria Rashidinejad
Jiantao Jiao
Yuandong Tian
Joseph E. Gonzalez
Stuart J. Russell
OffRL
36
42
0
18 Jun 2021
A learning agent that acquires social norms from public sanctions in decentralized multi-agent settings
Eugene Vinitsky
Raphael Köster
J. Agapiou
Edgar A. Duénez-Guzmán
A. Vezhnevets
Joel Z. Leibo
32
37
0
16 Jun 2021
Towards Automatic Actor-Critic Solutions to Continuous Control
J. E. Grigsby
Jinsu Yoo
Yanjun Qi
OffRL
25
6
0
16 Jun 2021
Deep Reinforcement Learning for Conservation Decisions
Marcus Lapeyrolerie
Melissa S. Chapman
Kari E. A. Norman
C. Boettiger
OffRL
25
16
0
15 Jun 2021
Going Beyond Linear Transformers with Recurrent Fast Weight Programmers
Kazuki Irie
Imanol Schlag
Róbert Csordás
Jürgen Schmidhuber
33
57
0
11 Jun 2021
GDI: Rethinking What Makes Reinforcement Learning Different From Supervised Learning
Jiajun Fan
Changnan Xiao
Yue Huang
OffRL
21
10
0
11 Jun 2021
Taylor Expansion of Discount Factors
Yunhao Tang
Mark Rowland
Rémi Munos
Michal Valko
OffRL
34
5
0
11 Jun 2021
DouZero: Mastering DouDizhu with Self-Play Deep Reinforcement Learning
Daochen Zha
Jingru Xie
Wenye Ma
Sheng Zhang
Xiangru Lian
Xia Hu
Ji Liu
25
117
0
11 Jun 2021
Unifying Behavioral and Response Diversity for Open-ended Learning in Zero-sum Games
Xiangyu Liu
Hangtian Jia
Ying Wen
Yaodong Yang
Yujing Hu
Yingfeng Chen
Changjie Fan
Zhipeng Hu
28
18
0
09 Jun 2021
Pretraining Representations for Data-Efficient Reinforcement Learning
Max Schwarzer
Nitarshan Rajkumar
Michael Noukhovitch
Ankesh Anand
Laurent Charlin
Devon Hjelm
Philip Bachman
Aaron Courville
OffRL
47
114
0
09 Jun 2021
There Is No Turning Back: A Self-Supervised Approach for Reversibility-Aware Reinforcement Learning
Nathan Grinsztajn
Johan Ferret
Olivier Pietquin
Philippe Preux
M. Geist
SSL
42
14
0
08 Jun 2021
Previous
1
2
3
...
10
11
12
...
18
19
20
Next