ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1802.01561
  4. Cited By
IMPALA: Scalable Distributed Deep-RL with Importance Weighted
  Actor-Learner Architectures

IMPALA: Scalable Distributed Deep-RL with Importance Weighted Actor-Learner Architectures

5 February 2018
L. Espeholt
Hubert Soyer
Rémi Munos
Karen Simonyan
Volodymyr Mnih
Tom Ward
Yotam Doron
Vlad Firoiu
Tim Harley
Iain Dunning
Shane Legg
Koray Kavukcuoglu
ArXivPDFHTML

Papers citing "IMPALA: Scalable Distributed Deep-RL with Importance Weighted Actor-Learner Architectures"

50 / 982 papers shown
Title
A Framework for Understanding and Visualizing Strategies of RL Agents
A Framework for Understanding and Visualizing Strategies of RL Agents
Pedro Sequeira
Daniel Elenius
Jesse Hostetler
Melinda Gervasio
6
2
0
17 Aug 2022
A Walk in the Park: Learning to Walk in 20 Minutes With Model-Free
  Reinforcement Learning
A Walk in the Park: Learning to Walk in 20 Minutes With Model-Free Reinforcement Learning
Laura M. Smith
Ilya Kostrikov
Sergey Levine
OffRL
26
103
0
16 Aug 2022
On the Limitations of Continual Learning for Malware Classification
On the Limitations of Continual Learning for Malware Classification
Mohammad Saidur Rahman
Scott E. Coull
M. Wright
34
13
0
13 Aug 2022
AutoShard: Automated Embedding Table Sharding for Recommender Systems
AutoShard: Automated Embedding Table Sharding for Recommender Systems
Daochen Zha
Louis Feng
Bhargav Bhushanam
Dhruv Choudhary
Jade Nie
Yuandong Tian
Jay Chae
Yi-An Ma
A. Kejariwal
Xia Hu
40
30
0
12 Aug 2022
Model-Free Generative Replay for Lifelong Reinforcement Learning:
  Application to Starcraft-2
Model-Free Generative Replay for Lifelong Reinforcement Learning: Application to Starcraft-2
Z. Daniels
Aswin Raghavan
Jesse Hostetler
Abrar Rahman
Indranil Sur
M. Piacentino
Ajay Divakaran
CLL
OffRL
36
12
0
09 Aug 2022
An Approximate Policy Iteration Viewpoint of Actor-Critic Algorithms
An Approximate Policy Iteration Viewpoint of Actor-Critic Algorithms
Zaiwei Chen
S. T. Maguluri
34
0
0
05 Aug 2022
Mitigating Off-Policy Bias in Actor-Critic Methods with One-Step
  Q-learning: A Novel Correction Approach
Mitigating Off-Policy Bias in Actor-Critic Methods with One-Step Q-learning: A Novel Correction Approach
Baturay Saglam
Dogan C. Cicek
Furkan B. Mutlu
Suleyman Serdar Kozat
OffRL
OnRL
31
1
0
01 Aug 2022
Latent Properties of Lifelong Learning Systems
Latent Properties of Lifelong Learning Systems
Corban G. Rivera
C. Ashcraft
Alexander New
J. Schmidt
Gautam K. Vallabha
CLL
17
0
0
28 Jul 2022
Explain My Surprise: Learning Efficient Long-Term Memory by Predicting
  Uncertain Outcomes
Explain My Surprise: Learning Efficient Long-Term Memory by Predicting Uncertain Outcomes
A. Sorokin
N. Buzun
Leonid Pugachev
Andrey Kravchenko
33
8
0
27 Jul 2022
Optimizing Empty Container Repositioning and Fleet Deployment via
  Configurable Semi-POMDPs
Optimizing Empty Container Repositioning and Fleet Deployment via Configurable Semi-POMDPs
Riccardo Poiani
Ciprian Stirbu
Alberto Maria Metelli
Marcello Restelli
14
1
0
25 Jul 2022
Annealed Training for Combinatorial Optimization on Graphs
Annealed Training for Combinatorial Optimization on Graphs
Haoran Sun
E. Guha
H. Dai
40
18
0
23 Jul 2022
Bootstrap State Representation using Style Transfer for Better
  Generalization in Deep Reinforcement Learning
Bootstrap State Representation using Style Transfer for Better Generalization in Deep Reinforcement Learning
Md Masudur Rahman
Yexiang Xue
OffRL
34
4
0
15 Jul 2022
Outcome-Guided Counterfactuals for Reinforcement Learning Agents from a
  Jointly Trained Generative Latent Space
Outcome-Guided Counterfactuals for Reinforcement Learning Agents from a Jointly Trained Generative Latent Space
Eric Yeh
Pedro Sequeira
Jesse Hostetler
Melinda Gervasio
OOD
CML
BDL
OffRL
25
2
0
15 Jul 2022
GriddlyJS: A Web IDE for Reinforcement Learning
GriddlyJS: A Web IDE for Reinforcement Learning
C. Bamford
Minqi Jiang
Mikayel Samvelyan
Tim Rocktaschel
OnRL
45
4
0
13 Jul 2022
Temporal Disentanglement of Representations for Improved Generalisation
  in Reinforcement Learning
Temporal Disentanglement of Representations for Improved Generalisation in Reinforcement Learning
Mhairi Dunion
Trevor A. McInroe
K. Luck
Josiah P. Hanna
Stefano V. Albrecht
OOD
DRL
28
18
0
12 Jul 2022
Grounding Aleatoric Uncertainty for Unsupervised Environment Design
Grounding Aleatoric Uncertainty for Unsupervised Environment Design
Minqi Jiang
Michael Dennis
Jack Parker-Holder
Andrei Lupu
Heinrich Küttler
Edward Grefenstette
Tim Rocktaschel
Jakob N. Foerster
48
13
0
11 Jul 2022
Don't Start From Scratch: Leveraging Prior Data to Automate Robotic
  Reinforcement Learning
Don't Start From Scratch: Leveraging Prior Data to Automate Robotic Reinforcement Learning
Homer Walke
Jonathan Yang
Albert Yu
Aviral Kumar
Jedrzej Orbik
Avi Singh
Sergey Levine
OffRL
OnRL
31
32
0
11 Jul 2022
Generalized Policy Improvement Algorithms with Theoretically Supported
  Sample Reuse
Generalized Policy Improvement Algorithms with Theoretically Supported Sample Reuse
James Queeney
I. Paschalidis
Christos G. Cassandras
OffRL
32
2
0
28 Jun 2022
Improving Policy Optimization with Generalist-Specialist Learning
Improving Policy Optimization with Generalist-Specialist Learning
Zhiwei Jia
Xuanlin Li
Z. Ling
Shuang Liu
Yiran Wu
H. Su
OffRL
34
24
0
26 Jun 2022
POGEMA: Partially Observable Grid Environment for Multiple Agents
POGEMA: Partially Observable Grid Environment for Multiple Agents
Alexey Skrynnik
Anton Andreychuk
Konstantin Yakovlev
Aleksandr I. Panov
12
6
0
22 Jun 2022
EnvPool: A Highly Parallel Reinforcement Learning Environment Execution
  Engine
EnvPool: A Highly Parallel Reinforcement Learning Environment Execution Engine
Jiayi Weng
Min Lin
Shengyi Huang
Bo Liu
Denys Makoviichuk
...
Yufan Song
Ting Luo
Yukun Jiang
Zhongwen Xu
Shuicheng Yan
MoE
19
61
0
21 Jun 2022
Safe and Psychologically Pleasant Traffic Signal Control with
  Reinforcement Learning using Action Masking
Safe and Psychologically Pleasant Traffic Signal Control with Reinforcement Learning using Action Masking
Arthur Muller
M. Sabatelli
30
8
0
21 Jun 2022
DNA: Proximal Policy Optimization with a Dual Network Architecture
DNA: Proximal Policy Optimization with a Dual Network Architecture
Mathew H. Aitchison
Penny Sweetser
OffRL
33
4
0
20 Jun 2022
Fast Population-Based Reinforcement Learning on a Single Machine
Fast Population-Based Reinforcement Learning on a Single Machine
Arthur Flajolet
Claire Bizon Monroc
Karim Beguir
Thomas Pierrot
OffRL
35
10
0
17 Jun 2022
SMPL: Simulated Industrial Manufacturing and Process Control Learning
  Environments
SMPL: Simulated Industrial Manufacturing and Process Control Learning Environments
Mohan Zhang
Xiaozhou Wang
Benjamin Decardi-Nelson
Bo Song
A. Zhang
...
Jiayi Cheng
Xiaohong Liu
DengDeng Yu
Matthew Poon
Animesh Garg
26
4
0
17 Jun 2022
The State of Sparse Training in Deep Reinforcement Learning
The State of Sparse Training in Deep Reinforcement Learning
L. Graesser
Utku Evci
Erich Elsen
Pablo Samuel Castro
OffRL
20
34
0
17 Jun 2022
A Parametric Class of Approximate Gradient Updates for Policy
  Optimization
A Parametric Class of Approximate Gradient Updates for Policy Optimization
Ramki Gummadi
Saurabh Kumar
Junfeng Wen
Dale Schuurmans
28
0
0
17 Jun 2022
Finite-Time Analysis of Fully Decentralized Single-Timescale
  Actor-Critic
Finite-Time Analysis of Fully Decentralized Single-Timescale Actor-Critic
Qijun Luo
Xiao Li
43
1
0
12 Jun 2022
Social Network Structure Shapes Innovation: Experience-sharing in RL
  with SAPIENS
Social Network Structure Shapes Innovation: Experience-sharing in RL with SAPIENS
Eleni Nisioti
Matéo Mahaut
Pierre-Yves Oudeyer
Ida Momennejad
Clément Moulin-Frier
27
9
0
10 Jun 2022
On the Effectiveness of Fine-tuning Versus Meta-reinforcement Learning
On the Effectiveness of Fine-tuning Versus Meta-reinforcement Learning
Mandi Zhao
Pieter Abbeel
Stephen James
OffRL
33
33
0
07 Jun 2022
Generalized Data Distribution Iteration
Generalized Data Distribution Iteration
Jiajun Fan
Changnan Xiao
OffRL
19
12
0
07 Jun 2022
Reincarnating Reinforcement Learning: Reusing Prior Computation to
  Accelerate Progress
Reincarnating Reinforcement Learning: Reusing Prior Computation to Accelerate Progress
Rishabh Agarwal
Max Schwarzer
Pablo Samuel Castro
Rameswar Panda
Marc G. Bellemare
OffRL
OnRL
37
63
0
03 Jun 2022
The Phenomenon of Policy Churn
The Phenomenon of Policy Churn
Tom Schaul
André Barreto
John Quan
Georg Ostrovski
44
26
0
01 Jun 2022
Efficient Scheduling of Data Augmentation for Deep Reinforcement
  Learning
Efficient Scheduling of Data Augmentation for Deep Reinforcement Learning
Byungchan Ko
Jungseul Ok
OnRL
27
5
0
01 Jun 2022
Byzantine-Robust Online and Offline Distributed Reinforcement Learning
Byzantine-Robust Online and Offline Distributed Reinforcement Learning
Yiding Chen
Xuezhou Zhang
Kai Zhang
Mengdi Wang
Xiaojin Zhu
OffRL
31
16
0
01 Jun 2022
BRExIt: On Opponent Modelling in Expert Iteration
BRExIt: On Opponent Modelling in Expert Iteration
Daniel Hernández
Hendrik Baier
Michael Kaisers
8
2
0
31 May 2022
Reinforcement Learning with a Terminator
Reinforcement Learning with a Terminator
Guy Tennenholtz
Nadav Merlis
Lior Shani
Shie Mannor
Uri Shalit
Gal Chechik
Assaf Hallak
Gal Dalal
25
5
0
30 May 2022
Off-Beat Multi-Agent Reinforcement Learning
Off-Beat Multi-Agent Reinforcement Learning
Wei Qiu
Weixun Wang
R. Wang
Bo An
Yujing Hu
S. Obraztsova
Zinovi Rabinovich
Jianye Hao
Yingfeng Chen
Changjie Fan
OffRL
33
2
0
27 May 2022
History Compression via Language Models in Reinforcement Learning
History Compression via Language Models in Reinforcement Learning
Fabian Paischer
Thomas Adler
Vihang Patil
Angela Bitto-Nemling
Markus Holzleitner
Sebastian Lehner
Hamid Eghbalzadeh
Sepp Hochreiter
OffRL
AI4TS
33
42
0
24 May 2022
An Evaluation Study of Intrinsic Motivation Techniques applied to
  Reinforcement Learning over Hard Exploration Environments
An Evaluation Study of Intrinsic Motivation Techniques applied to Reinforcement Learning over Hard Exploration Environments
Alain Andres
Esther Villar-Rodriguez
Javier Del Ser
29
9
0
23 May 2022
Learning Task-relevant Representations for Generalization via
  Characteristic Functions of Reward Sequence Distributions
Learning Task-relevant Representations for Generalization via Characteristic Functions of Reward Sequence Distributions
Rui Yang
Jie Wang
Zijie Geng
Mingxuan Ye
Shuiwang Ji
Bin Li
Fengli Wu
OOD
36
20
0
20 May 2022
The Sufficiency of Off-Policyness and Soft Clipping: PPO is still
  Insufficient according to an Off-Policy Measure
The Sufficiency of Off-Policyness and Soft Clipping: PPO is still Insufficient according to an Off-Policy Measure
Xing Chen
Dongcui Diao
Hechang Chen
Hengshuai Yao
Haiyin Piao
Zhixiao Sun
Zhiwei Yang
Randy Goebel
Bei Jiang
Yi-Ju Chang
OffRL
43
8
0
20 May 2022
A Generalist Agent
A Generalist Agent
Scott E. Reed
Konrad Zolna
Emilio Parisotto
Sergio Gomez Colmenarejo
Alexander Novikov
...
Yutian Chen
R. Hadsell
Oriol Vinyals
Mahyar Bordbar
Nando de Freitas
LM&Ro
LLMAG
AI4CE
124
793
0
12 May 2022
Efficient Distributed Framework for Collaborative Multi-Agent
  Reinforcement Learning
Efficient Distributed Framework for Collaborative Multi-Agent Reinforcement Learning
Shuhan Qi
Shuhao Zhang
Xiaohan Hou
Jia-jia Zhang
Xinyu Wang
Jing Xiao
24
0
0
11 May 2022
Interactive Grounded Language Understanding in a Collaborative
  Environment: IGLU 2021
Interactive Grounded Language Understanding in a Collaborative Environment: IGLU 2021
Julia Kiseleva
Ziming Li
Mohammad Aliannejadi
Shrestha Mohanty
Maartje ter Hoeve
...
I. Churin
Putra Manggala
Kata Naszádi
Michiel van der Meer
Taewoon Kim
LLMAG
35
30
0
05 May 2022
Collaborative Target Search with a Visual Drone Swarm: An Adaptive
  Curriculum Embedded Multistage Reinforcement Learning Approach
Collaborative Target Search with a Visual Drone Swarm: An Adaptive Curriculum Embedded Multistage Reinforcement Learning Approach
Jiaping Xiao
Phumrapee Pisutsin
Mir Feroskhan
29
16
0
26 Apr 2022
Graph Neural Network based Agent in Google Research Football
Graph Neural Network based Agent in Google Research Football
Yizhan Niu
Jinglong Liu
Yuhao Shi
Jiren Zhu
GNN
27
2
0
23 Apr 2022
Local Feature Swapping for Generalization in Reinforcement Learning
Local Feature Swapping for Generalization in Reinforcement Learning
David Bertoin
Emmanuel Rachelson
OOD
29
14
0
13 Apr 2022
Dynamic Dialogue Policy for Continual Reinforcement Learning
Dynamic Dialogue Policy for Continual Reinforcement Learning
Christian Geishauser
Carel van Niekerk
Nurul Lubis
Michael Heck
Hsien-chin Lin
Shutong Feng
Milica Gavsić
CLL
OffRL
48
14
0
12 Apr 2022
When Should We Prefer Offline Reinforcement Learning Over Behavioral
  Cloning?
When Should We Prefer Offline Reinforcement Learning Over Behavioral Cloning?
Aviral Kumar
Joey Hong
Anika Singh
Sergey Levine
OffRL
50
77
0
12 Apr 2022
Previous
123...789...181920
Next