ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1802.01561
  4. Cited By
IMPALA: Scalable Distributed Deep-RL with Importance Weighted
  Actor-Learner Architectures

IMPALA: Scalable Distributed Deep-RL with Importance Weighted Actor-Learner Architectures

5 February 2018
L. Espeholt
Hubert Soyer
Rémi Munos
Karen Simonyan
Volodymyr Mnih
Tom Ward
Yotam Doron
Vlad Firoiu
Tim Harley
Iain Dunning
Shane Legg
Koray Kavukcuoglu
ArXivPDFHTML

Papers citing "IMPALA: Scalable Distributed Deep-RL with Importance Weighted Actor-Learner Architectures"

50 / 982 papers shown
Title
Melting Pot 2.0
Melting Pot 2.0
J. Agapiou
A. Vezhnevets
Edgar A. Duénez-Guzmán
Jayd Matyas
Yiran Mao
...
Sukhdeep Singh
Julia Haas
Igor Mordatch
D. Mobbs
Joel Z Leibo
45
32
0
24 Nov 2022
Actively Learning Costly Reward Functions for Reinforcement Learning
Actively Learning Costly Reward Functions for Reinforcement Learning
André Eberhard
Houssam Metni
G. Fahland
A. Stroh
Pascal Friederich
OffRL
43
0
0
23 Nov 2022
Improving Multimodal Interactive Agents with Reinforcement Learning from
  Human Feedback
Improving Multimodal Interactive Agents with Reinforcement Learning from Human Feedback
Josh Abramson
Arun Ahuja
Federico Carnevale
Petko Georgiev
Alex Goldin
...
Tamara von Glehn
Greg Wayne
Nathaniel Wong
Chen Yan
Rui Zhu
41
27
0
21 Nov 2022
Exploring through Random Curiosity with General Value Functions
Exploring through Random Curiosity with General Value Functions
Aditya A. Ramesh
Louis Kirsch
Sjoerd van Steenkiste
Jürgen Schmidhuber
40
9
0
18 Nov 2022
Explainability Via Causal Self-Talk
Explainability Via Causal Self-Talk
Nicholas A. Roy
Junkyung Kim
Neil C. Rabinowitz
CML
29
7
0
17 Nov 2022
Dynamic Collaborative Multi-Agent Reinforcement Learning Communication
  for Autonomous Drone Reforestation
Dynamic Collaborative Multi-Agent Reinforcement Learning Communication for Autonomous Drone Reforestation
P. D. Siedler
AI4CE
38
4
0
14 Nov 2022
Global and Local Analysis of Interestingness for Competency-Aware Deep
  Reinforcement Learning
Global and Local Analysis of Interestingness for Competency-Aware Deep Reinforcement Learning
Pedro Sequeira
Jesse Hostetler
Melinda Gervasio
18
0
0
11 Nov 2022
Efficient Deep Reinforcement Learning with Predictive Processing
  Proximal Policy Optimization
Efficient Deep Reinforcement Learning with Predictive Processing Proximal Policy Optimization
Burcu Küçükoglu
Walraaf Borkent
Bodo Rueckauer
Nasir Ahmad
Umut Güçlü
Marcel van Gerven
47
2
0
11 Nov 2022
Foundation Models for Semantic Novelty in Reinforcement Learning
Foundation Models for Semantic Novelty in Reinforcement Learning
Tarun Gupta
Peter Karkus
Tong Che
Danfei Xu
Marco Pavone
VLM
OffRL
LRM
45
7
0
09 Nov 2022
Progress and summary of reinforcement learning on energy management of
  MPS-EV
Progress and summary of reinforcement learning on energy management of MPS-EV
Jincheng Hu
Yang Lin
Liang Chu
Zhuoran Hou
Jihan Li
Jingjing Jiang
Yuanjian Zhang
28
13
0
08 Nov 2022
Curriculum-based Asymmetric Multi-task Reinforcement Learning
Curriculum-based Asymmetric Multi-task Reinforcement Learning
H. Huang
Deheng Ye
Li Shen
Wen Liu
32
12
0
07 Nov 2022
On learning history based policies for controlling Markov decision
  processes
On learning history based policies for controlling Markov decision processes
Gandharv Patil
Aditya Mahajan
Doina Precup
OffRL
21
5
0
06 Nov 2022
Contrastive Value Learning: Implicit Models for Simple Offline RL
Contrastive Value Learning: Implicit Models for Simple Offline RL
Bogdan Mazoure
Benjamin Eysenbach
Ofir Nachum
Jonathan Tompson
SSL
OffRL
48
8
0
03 Nov 2022
Learning to Solve Voxel Building Embodied Tasks from Pixels and Natural
  Language Instructions
Learning to Solve Voxel Building Embodied Tasks from Pixels and Natural Language Instructions
Alexey Skrynnik
Zoya Volovikova
Marc-Alexandre Côté
Anton Voronov
Artem Zholus
...
Milagro Teruel
Ahmed Hassan Awadallah
Aleksandr I. Panov
Andrey Kravchenko
Julia Kiseleva
LM&Ro
64
11
0
01 Nov 2022
Learning to Navigate Wikipedia by Taking Random Walks
Learning to Navigate Wikipedia by Taking Random Walks
Manzil Zaheer
Kenneth Marino
Will Grathwohl
John Schultz
Wendy Shang
Sheila Babayan
Arun Ahuja
Ishita Dasgupta
Christine Kaeser-Chen
Rob Fergus
13
5
0
31 Oct 2022
Towards Versatile Embodied Navigation
Towards Versatile Embodied Navigation
Hongru Wang
Wei Liang
Luc Van Gool
Wenguan Wang
LM&Ro
63
20
0
30 Oct 2022
Entity Divider with Language Grounding in Multi-Agent Reinforcement
  Learning
Entity Divider with Language Grounding in Multi-Agent Reinforcement Learning
Ziluo Ding
Wanpeng Zhang
Junpeng Yue
Xiangjun Wang
Tiejun Huang
Zongqing Lu
LLMAG
AI4CE
28
4
0
25 Oct 2022
Avalon: A Benchmark for RL Generalization Using Procedurally Generated
  Worlds
Avalon: A Benchmark for RL Generalization Using Procedurally Generated Worlds
Joshua Albrecht
Abraham J. Fetterman
Bryden Fogelman
Ellie Kitanidis
Bartosz Wróblewski
...
Michael Rosenthal
Maksis Knutins
Zachary Polizzi
James B. Simon
Kanjun Qiu
OffRL
29
23
0
24 Oct 2022
Evaluating Long-Term Memory in 3D Mazes
Evaluating Long-Term Memory in 3D Mazes
J. Pašukonis
Timothy Lillicrap
Danijar Hafner
3DV
23
21
0
24 Oct 2022
Rethinking Value Function Learning for Generalization in Reinforcement
  Learning
Rethinking Value Function Learning for Generalization in Reinforcement Learning
Seungyong Moon
JunYeong Lee
Hyun Oh Song
OOD
OffRL
26
16
0
18 Oct 2022
RPM: Generalizable Behaviors for Multi-Agent Reinforcement Learning
RPM: Generalizable Behaviors for Multi-Agent Reinforcement Learning
Wei Qiu
Xiao Ma
Bo An
S. Obraztsova
Shuicheng Yan
Zhongwen Xu
27
1
0
18 Oct 2022
WILD-SCAV: Benchmarking FPS Gaming AI on Unity3D-based Environments
WILD-SCAV: Benchmarking FPS Gaming AI on Unity3D-based Environments
Xi Chen
Tianyuan Shi
Qing Zhao
Yuchen Sun
Yunfei Gao
Xiangjun Wang
33
2
0
14 Oct 2022
A Scalable Finite Difference Method for Deep Reinforcement Learning
A Scalable Finite Difference Method for Deep Reinforcement Learning
Matthew Allen
John C. Raisbeck
Hakho Lee
16
0
0
14 Oct 2022
Bootstrap Advantage Estimation for Policy Optimization in Reinforcement
  Learning
Bootstrap Advantage Estimation for Policy Optimization in Reinforcement Learning
Md Masudur Rahman
Yexiang Xue
OffRL
11
0
0
13 Oct 2022
Harfang3D Dog-Fight Sandbox: A Reinforcement Learning Research Platform
  for the Customized Control Tasks of Fighter Aircrafts
Harfang3D Dog-Fight Sandbox: A Reinforcement Learning Research Platform for the Customized Control Tasks of Fighter Aircrafts
Muhammed Murat Özbek
S. Yildirim
Muhammet Aksoy
Eric Kernin
E. Koyuncu
27
4
0
13 Oct 2022
Object-Category Aware Reinforcement Learning
Object-Category Aware Reinforcement Learning
Qi Yi
Rui Zhang
Shaohui Peng
Jiaming Guo
Xingui Hu
Zidong Du
Xishan Zhang
Qi Guo
Yunji Chen
CML
LRM
25
6
0
13 Oct 2022
Reinforcement Learning with Automated Auxiliary Loss Search
Reinforcement Learning with Automated Auxiliary Loss Search
Tairan He
Yuge Zhang
Kan Ren
Minghuan Liu
Che Wang
Weinan Zhang
Yuqing Yang
Dongsheng Li
43
16
0
12 Oct 2022
Contrastive Retrospection: honing in on critical steps for rapid
  learning and generalization in RL
Contrastive Retrospection: honing in on critical steps for rapid learning and generalization in RL
Chen Sun
Wannan Yang
Thomas Jiralerspong
Dane Malenfant
Benjamin Alsbury-Nealy
Yoshua Bengio
Blake A. Richards
OffRL
24
2
0
12 Oct 2022
Exploration via Elliptical Episodic Bonuses
Exploration via Elliptical Episodic Bonuses
Mikael Henaff
Roberta Raileanu
Minqi Jiang
Tim Rocktaschel
OffRL
37
40
0
11 Oct 2022
Discovered Policy Optimisation
Discovered Policy Optimisation
Chris Xiaoxuan Lu
J. Kuba
Alistair Letcher
Luke Metz
Christian Schroeder de Witt
Jakob N. Foerster
OffRL
49
76
0
11 Oct 2022
LECO: Learnable Episodic Count for Task-Specific Intrinsic Reward
LECO: Learnable Episodic Count for Task-Specific Intrinsic Reward
DaeJin Jo
Sungwoong Kim
D. W. Nam
Taehwan Kwon
Seungeun Rho
Jongmin Kim
Donghoon Lee
OffRL
32
10
0
11 Oct 2022
Pre-Training for Robots: Offline RL Enables Learning New Tasks from a
  Handful of Trials
Pre-Training for Robots: Offline RL Enables Learning New Tasks from a Handful of Trials
Aviral Kumar
Anika Singh
F. Ebert
Mitsuhiko Nakamoto
Yanlai Yang
Chelsea Finn
Sergey Levine
OffRL
OnRL
131
66
0
11 Oct 2022
VER: Scaling On-Policy RL Leads to the Emergence of Navigation in
  Embodied Rearrangement
VER: Scaling On-Policy RL Leads to the Emergence of Navigation in Embodied Rearrangement
Erik Wijmans
Irfan Essa
Dhruv Batra
OffRL
52
13
0
11 Oct 2022
Using Both Demonstrations and Language Instructions to Efficiently Learn
  Robotic Tasks
Using Both Demonstrations and Language Instructions to Efficiently Learn Robotic Tasks
Albert Yu
Raymond J. Mooney
LM&Ro
32
19
0
10 Oct 2022
Scaling up Stochastic Gradient Descent for Non-convex Optimisation
Scaling up Stochastic Gradient Descent for Non-convex Optimisation
S. Mohamad
H. Alamri
A. Bouchachia
50
3
0
06 Oct 2022
Hyperbolic Deep Reinforcement Learning
Hyperbolic Deep Reinforcement Learning
Edoardo Cetin
B. Chamberlain
Michael M. Bronstein
Jonathan J. Hunt
50
21
0
04 Oct 2022
MSRL: Distributed Reinforcement Learning with Dataflow Fragments
MSRL: Distributed Reinforcement Learning with Dataflow Fragments
Huanzhou Zhu
Bo Zhao
Gang Chen
Weifeng Chen
Yijie Chen
Liang Shi
Yaodong Yang
Peter R. Pietzuch
Lei Chen
OffRL
MoE
31
7
0
03 Oct 2022
Improving Policy Learning via Language Dynamics Distillation
Improving Policy Learning via Language Dynamics Distillation
Victor Zhong
Jesse Mu
Luke Zettlemoyer
Edward Grefenstette
Tim Rocktaschel
OffRL
55
15
0
30 Sep 2022
Reinforcement Learning Algorithms: An Overview and Classification
Reinforcement Learning Algorithms: An Overview and Classification
Fadi AlMahamid
Katarina Grolinger
21
40
0
29 Sep 2022
Opportunities and Challenges from Using Animal Videos in Reinforcement
  Learning for Navigation
Opportunities and Challenges from Using Animal Videos in Reinforcement Learning for Navigation
Vittorio Giammarino
James Queeney
Lucas C. Carstensen
Michael Hasselmo
I. Paschalidis
OffRL
55
4
0
25 Sep 2022
On Efficient Reinforcement Learning for Full-length Game of StarCraft II
On Efficient Reinforcement Learning for Full-length Game of StarCraft II
Ruo-Ze Liu
Zhen-Jia Pang
Zhou-Yu Meng
Wenhai Wang
Yang Yu
Tong Lu
OffRL
36
18
0
23 Sep 2022
Parallel Reinforcement Learning Simulation for Visual Quadrotor
  Navigation
Parallel Reinforcement Learning Simulation for Visual Quadrotor Navigation
Jack D. Saunders
Sajad Saeedi
Wenbin Li
25
3
0
22 Sep 2022
Lamarckian Platform: Pushing the Boundaries of Evolutionary
  Reinforcement Learning towards Asynchronous Commercial Games
Lamarckian Platform: Pushing the Boundaries of Evolutionary Reinforcement Learning towards Asynchronous Commercial Games
Hui Bai
R. Shen
Yue Lin
Bo Xu
Ran Cheng
VLM
39
5
0
21 Sep 2022
Human-level Atari 200x faster
Human-level Atari 200x faster
Steven Kapturowski
Victor Campos
Ray Jiang
Nemanja Rakićević
Hado van Hasselt
Charles Blundell
Adria Puigdomenech Badia
OffRL
52
28
0
15 Sep 2022
Obtaining Robust Control and Navigation Policies for Multi-Robot
  Navigation via Deep Reinforcement Learning
Obtaining Robust Control and Navigation Policies for Multi-Robot Navigation via Deep Reinforcement Learning
Christian Jestel
H. Surmann
Jonas Stenzel
Oliver Urbann
Marius Brehler
11
9
0
07 Sep 2022
Project proposal: A modular reinforcement learning based automated
  theorem prover
Project proposal: A modular reinforcement learning based automated theorem prover
Boris Shminke
28
1
0
06 Sep 2022
Style-Agnostic Reinforcement Learning
Style-Agnostic Reinforcement Learning
Juyong Lee
Seokjun Ahn
Jaesik Park
33
4
0
31 Aug 2022
Unsupervised Representation Learning in Deep Reinforcement Learning: A
  Review
Unsupervised Representation Learning in Deep Reinforcement Learning: A Review
N. Botteghi
M. Poel
C. Brune
SSL
OffRL
49
11
0
27 Aug 2022
Towards Automated Imbalanced Learning with Deep Hierarchical
  Reinforcement Learning
Towards Automated Imbalanced Learning with Deep Hierarchical Reinforcement Learning
Daochen Zha
Kwei-Herng Lai
Qiaoyu Tan
Sirui Ding
Na Zou
Xia Hu
AI4TS
26
18
0
26 Aug 2022
Autonomous Unmanned Aerial Vehicle Navigation using Reinforcement
  Learning: A Systematic Review
Autonomous Unmanned Aerial Vehicle Navigation using Reinforcement Learning: A Systematic Review
Fadi AlMahamid
Katarina Grolinger
30
73
0
25 Aug 2022
Previous
123...678...181920
Next