ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1802.01561
  4. Cited By
IMPALA: Scalable Distributed Deep-RL with Importance Weighted
  Actor-Learner Architectures
v1v2v3 (latest)

IMPALA: Scalable Distributed Deep-RL with Importance Weighted Actor-Learner Architectures

5 February 2018
L. Espeholt
Hubert Soyer
Rémi Munos
Karen Simonyan
Volodymyr Mnih
Tom Ward
Yotam Doron
Vlad Firoiu
Tim Harley
Iain Dunning
Shane Legg
Koray Kavukcuoglu
ArXiv (abs)PDFHTML

Papers citing "IMPALA: Scalable Distributed Deep-RL with Importance Weighted Actor-Learner Architectures"

50 / 1,000 papers shown
Title
VER: Scaling On-Policy RL Leads to the Emergence of Navigation in
  Embodied Rearrangement
VER: Scaling On-Policy RL Leads to the Emergence of Navigation in Embodied Rearrangement
Erik Wijmans
Irfan Essa
Dhruv Batra
OffRL
120
14
0
11 Oct 2022
Using Both Demonstrations and Language Instructions to Efficiently Learn
  Robotic Tasks
Using Both Demonstrations and Language Instructions to Efficiently Learn Robotic Tasks
Albert Yu
Raymond J. Mooney
LM&Ro
74
20
0
10 Oct 2022
Scaling up Stochastic Gradient Descent for Non-convex Optimisation
Scaling up Stochastic Gradient Descent for Non-convex Optimisation
S. Mohamad
H. Alamri
A. Bouchachia
85
3
0
06 Oct 2022
Hyperbolic Deep Reinforcement Learning
Hyperbolic Deep Reinforcement Learning
Edoardo Cetin
B. Chamberlain
Michael M. Bronstein
Jonathan J. Hunt
93
22
0
04 Oct 2022
MSRL: Distributed Reinforcement Learning with Dataflow Fragments
MSRL: Distributed Reinforcement Learning with Dataflow Fragments
Huanzhou Zhu
Bo Zhao
Gang Chen
Weifeng Chen
Yijie Chen
Liang Shi
Yaodong Yang
Peter R. Pietzuch
Lei Chen
OffRLMoE
76
7
0
03 Oct 2022
Improving Policy Learning via Language Dynamics Distillation
Improving Policy Learning via Language Dynamics Distillation
Victor Zhong
Jesse Mu
Luke Zettlemoyer
Edward Grefenstette
Tim Rocktaschel
OffRL
88
15
0
30 Sep 2022
Reinforcement Learning Algorithms: An Overview and Classification
Reinforcement Learning Algorithms: An Overview and Classification
Fadi AlMahamid
Katarina Grolinger
39
45
0
29 Sep 2022
Opportunities and Challenges from Using Animal Videos in Reinforcement
  Learning for Navigation
Opportunities and Challenges from Using Animal Videos in Reinforcement Learning for Navigation
Vittorio Giammarino
James Queeney
Lucas C. Carstensen
Michael Hasselmo
I. Paschalidis
OffRL
84
5
0
25 Sep 2022
On Efficient Reinforcement Learning for Full-length Game of StarCraft II
On Efficient Reinforcement Learning for Full-length Game of StarCraft II
Ruo-Ze Liu
Zhen-Jia Pang
Zhou-Yu Meng
Wenhai Wang
Yang Yu
Tong Lu
OffRL
63
19
0
23 Sep 2022
Parallel Reinforcement Learning Simulation for Visual Quadrotor
  Navigation
Parallel Reinforcement Learning Simulation for Visual Quadrotor Navigation
Jack D. Saunders
Sajad Saeedi
Wenbin Li
49
3
0
22 Sep 2022
Lamarckian Platform: Pushing the Boundaries of Evolutionary
  Reinforcement Learning towards Asynchronous Commercial Games
Lamarckian Platform: Pushing the Boundaries of Evolutionary Reinforcement Learning towards Asynchronous Commercial Games
Hui Bai
R. Shen
Yue Lin
Bo Xu
Ran Cheng
VLM
85
5
0
21 Sep 2022
Human-level Atari 200x faster
Human-level Atari 200x faster
Steven Kapturowski
Victor Campos
Ray Jiang
Nemanja Rakićević
Hado van Hasselt
Charles Blundell
Adria Puigdomenech Badia
OffRL
94
30
0
15 Sep 2022
Obtaining Robust Control and Navigation Policies for Multi-Robot
  Navigation via Deep Reinforcement Learning
Obtaining Robust Control and Navigation Policies for Multi-Robot Navigation via Deep Reinforcement Learning
Christian Jestel
H. Surmann
Jonas Stenzel
Oliver Urbann
Marius Brehler
48
9
0
07 Sep 2022
Project proposal: A modular reinforcement learning based automated
  theorem prover
Project proposal: A modular reinforcement learning based automated theorem prover
Boris Shminke
64
1
0
06 Sep 2022
Style-Agnostic Reinforcement Learning
Style-Agnostic Reinforcement Learning
Juyong Lee
Seokjun Ahn
Jaesik Park
42
4
0
31 Aug 2022
Unsupervised Representation Learning in Deep Reinforcement Learning: A
  Review
Unsupervised Representation Learning in Deep Reinforcement Learning: A Review
N. Botteghi
M. Poel
C. Brune
SSLOffRL
101
13
0
27 Aug 2022
Towards Automated Imbalanced Learning with Deep Hierarchical
  Reinforcement Learning
Towards Automated Imbalanced Learning with Deep Hierarchical Reinforcement Learning
Daochen Zha
Kwei-Herng Lai
Qiaoyu Tan
Sirui Ding
Na Zou
Helen Zhou
AI4TS
68
18
0
26 Aug 2022
Autonomous Unmanned Aerial Vehicle Navigation using Reinforcement
  Learning: A Systematic Review
Autonomous Unmanned Aerial Vehicle Navigation using Reinforcement Learning: A Systematic Review
Fadi AlMahamid
Katarina Grolinger
63
76
0
25 Aug 2022
A Framework for Understanding and Visualizing Strategies of RL Agents
A Framework for Understanding and Visualizing Strategies of RL Agents
Pedro Sequeira
Daniel Elenius
Jesse Hostetler
Melinda Gervasio
25
2
0
17 Aug 2022
A Walk in the Park: Learning to Walk in 20 Minutes With Model-Free
  Reinforcement Learning
A Walk in the Park: Learning to Walk in 20 Minutes With Model-Free Reinforcement Learning
Laura M. Smith
Ilya Kostrikov
Sergey Levine
OffRL
73
105
0
16 Aug 2022
On the Limitations of Continual Learning for Malware Classification
On the Limitations of Continual Learning for Malware Classification
Mohammad Saidur Rahman
Scott E. Coull
M. Wright
52
17
0
13 Aug 2022
AutoShard: Automated Embedding Table Sharding for Recommender Systems
AutoShard: Automated Embedding Table Sharding for Recommender Systems
Daochen Zha
Louis Feng
Bhargav Bhushanam
Dhruv Choudhary
Jade Nie
Yuandong Tian
Jay Chae
Yi-An Ma
A. Kejariwal
Helen Zhou
85
32
0
12 Aug 2022
Model-Free Generative Replay for Lifelong Reinforcement Learning:
  Application to Starcraft-2
Model-Free Generative Replay for Lifelong Reinforcement Learning: Application to Starcraft-2
Z. Daniels
Aswin Raghavan
Jesse Hostetler
Abrar Rahman
Indranil Sur
M. Piacentino
Ajay Divakaran
CLLOffRL
93
13
0
09 Aug 2022
An Approximate Policy Iteration Viewpoint of Actor-Critic Algorithms
An Approximate Policy Iteration Viewpoint of Actor-Critic Algorithms
Zaiwei Chen
S. T. Maguluri
80
1
0
05 Aug 2022
Mitigating Off-Policy Bias in Actor-Critic Methods with One-Step
  Q-learning: A Novel Correction Approach
Mitigating Off-Policy Bias in Actor-Critic Methods with One-Step Q-learning: A Novel Correction Approach
Baturay Saglam
Dogan C. Cicek
Furkan B. Mutlu
Suleyman S. Kozat
OffRLOnRL
87
1
0
01 Aug 2022
Latent Properties of Lifelong Learning Systems
Latent Properties of Lifelong Learning Systems
Corban G. Rivera
C. Ashcraft
Alexander New
J. Schmidt
Gautam K. Vallabha
CLL
42
0
0
28 Jul 2022
Explain My Surprise: Learning Efficient Long-Term Memory by Predicting
  Uncertain Outcomes
Explain My Surprise: Learning Efficient Long-Term Memory by Predicting Uncertain Outcomes
A. Sorokin
N. Buzun
Leonid Pugachev
Andrey Kravchenko
167
8
0
27 Jul 2022
Optimizing Empty Container Repositioning and Fleet Deployment via
  Configurable Semi-POMDPs
Optimizing Empty Container Repositioning and Fleet Deployment via Configurable Semi-POMDPs
Riccardo Poiani
Ciprian Stirbu
Alberto Maria Metelli
Marcello Restelli
25
1
0
25 Jul 2022
Annealed Training for Combinatorial Optimization on Graphs
Annealed Training for Combinatorial Optimization on Graphs
Haoran Sun
E. Guha
H. Dai
96
21
0
23 Jul 2022
Bootstrap State Representation using Style Transfer for Better
  Generalization in Deep Reinforcement Learning
Bootstrap State Representation using Style Transfer for Better Generalization in Deep Reinforcement Learning
Md Masudur Rahman
Yexiang Xue
OffRL
71
4
0
15 Jul 2022
Outcome-Guided Counterfactuals for Reinforcement Learning Agents from a
  Jointly Trained Generative Latent Space
Outcome-Guided Counterfactuals for Reinforcement Learning Agents from a Jointly Trained Generative Latent Space
Eric Yeh
Pedro Sequeira
Jesse Hostetler
Melinda Gervasio
OODCMLBDLOffRL
54
2
0
15 Jul 2022
GriddlyJS: A Web IDE for Reinforcement Learning
GriddlyJS: A Web IDE for Reinforcement Learning
C. Bamford
Minqi Jiang
Mikayel Samvelyan
Tim Rocktaschel
OnRL
103
5
0
13 Jul 2022
Temporal Disentanglement of Representations for Improved Generalisation
  in Reinforcement Learning
Temporal Disentanglement of Representations for Improved Generalisation in Reinforcement Learning
Mhairi Dunion
Trevor A. McInroe
K. Luck
Josiah P. Hanna
Stefano V. Albrecht
OODDRL
81
18
0
12 Jul 2022
Grounding Aleatoric Uncertainty for Unsupervised Environment Design
Grounding Aleatoric Uncertainty for Unsupervised Environment Design
Minqi Jiang
Michael Dennis
Jack Parker-Holder
Andrei Lupu
Heinrich Küttler
Edward Grefenstette
Tim Rocktaschel
Jakob N. Foerster
100
15
0
11 Jul 2022
Don't Start From Scratch: Leveraging Prior Data to Automate Robotic
  Reinforcement Learning
Don't Start From Scratch: Leveraging Prior Data to Automate Robotic Reinforcement Learning
Homer Walke
Jonathan Yang
Albert Yu
Aviral Kumar
Jedrzej Orbik
Avi Singh
Sergey Levine
OffRLOnRL
92
32
0
11 Jul 2022
Generalized Policy Improvement Algorithms with Theoretically Supported
  Sample Reuse
Generalized Policy Improvement Algorithms with Theoretically Supported Sample Reuse
James Queeney
I. Paschalidis
Christos G. Cassandras
OffRL
68
2
0
28 Jun 2022
Improving Policy Optimization with Generalist-Specialist Learning
Improving Policy Optimization with Generalist-Specialist Learning
Zhiwei Jia
Xuanlin Li
Z. Ling
Shuang Liu
Yiran Wu
H. Su
OffRL
83
25
0
26 Jun 2022
POGEMA: Partially Observable Grid Environment for Multiple Agents
POGEMA: Partially Observable Grid Environment for Multiple Agents
Alexey Skrynnik
Anton Andreychuk
Konstantin Yakovlev
Aleksandr I. Panov
26
6
0
22 Jun 2022
EnvPool: A Highly Parallel Reinforcement Learning Environment Execution
  Engine
EnvPool: A Highly Parallel Reinforcement Learning Environment Execution Engine
Jiayi Weng
Min Lin
Shengyi Huang
Bo Liu
Denys Makoviichuk
...
Yufan Song
Ting Luo
Yukun Jiang
Zhongwen Xu
Shuicheng Yan
MoE
110
63
0
21 Jun 2022
Safe and Psychologically Pleasant Traffic Signal Control with
  Reinforcement Learning using Action Masking
Safe and Psychologically Pleasant Traffic Signal Control with Reinforcement Learning using Action Masking
Arthur Muller
M. Sabatelli
32
8
0
21 Jun 2022
DNA: Proximal Policy Optimization with a Dual Network Architecture
DNA: Proximal Policy Optimization with a Dual Network Architecture
Mathew H. Aitchison
Penny Sweetser
OffRL
58
4
0
20 Jun 2022
Fast Population-Based Reinforcement Learning on a Single Machine
Fast Population-Based Reinforcement Learning on a Single Machine
Arthur Flajolet
Claire Bizon Monroc
Karim Beguir
Thomas Pierrot
OffRL
74
10
0
17 Jun 2022
SMPL: Simulated Industrial Manufacturing and Process Control Learning
  Environments
SMPL: Simulated Industrial Manufacturing and Process Control Learning Environments
Mohan Zhang
Xiaozhou Wang
Benjamin Decardi-Nelson
Bo Song
A. Zhang
...
Jiayi Cheng
Xiaohong Liu
DengDeng Yu
Matthew Poon
Animesh Garg
69
4
0
17 Jun 2022
The State of Sparse Training in Deep Reinforcement Learning
The State of Sparse Training in Deep Reinforcement Learning
L. Graesser
Utku Evci
Erich Elsen
Pablo Samuel Castro
OffRL
77
40
0
17 Jun 2022
A Parametric Class of Approximate Gradient Updates for Policy
  Optimization
A Parametric Class of Approximate Gradient Updates for Policy Optimization
Ramki Gummadi
Saurabh Kumar
Junfeng Wen
Dale Schuurmans
54
0
0
17 Jun 2022
Finite-Time Analysis of Fully Decentralized Single-Timescale
  Actor-Critic
Finite-Time Analysis of Fully Decentralized Single-Timescale Actor-Critic
Qijun Luo
Xiao Li
102
1
0
12 Jun 2022
Social Network Structure Shapes Innovation: Experience-sharing in RL
  with SAPIENS
Social Network Structure Shapes Innovation: Experience-sharing in RL with SAPIENS
Eleni Nisioti
Matéo Mahaut
Pierre-Yves Oudeyer
Ida Momennejad
Clément Moulin-Frier
90
10
0
10 Jun 2022
On the Effectiveness of Fine-tuning Versus Meta-reinforcement Learning
On the Effectiveness of Fine-tuning Versus Meta-reinforcement Learning
Mandi Zhao
Pieter Abbeel
Stephen James
OffRL
151
34
0
07 Jun 2022
Generalized Data Distribution Iteration
Generalized Data Distribution Iteration
Jiajun Fan
Changnan Xiao
OffRL
49
13
0
07 Jun 2022
Reincarnating Reinforcement Learning: Reusing Prior Computation to
  Accelerate Progress
Reincarnating Reinforcement Learning: Reusing Prior Computation to Accelerate Progress
Rishabh Agarwal
Max Schwarzer
Pablo Samuel Castro
Rameswar Panda
Marc G. Bellemare
OffRLOnRL
126
66
0
03 Jun 2022
Previous
123...789...181920
Next