ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1802.01561
  4. Cited By
IMPALA: Scalable Distributed Deep-RL with Importance Weighted
  Actor-Learner Architectures
v1v2v3 (latest)

IMPALA: Scalable Distributed Deep-RL with Importance Weighted Actor-Learner Architectures

5 February 2018
L. Espeholt
Hubert Soyer
Rémi Munos
Karen Simonyan
Volodymyr Mnih
Tom Ward
Yotam Doron
Vlad Firoiu
Tim Harley
Iain Dunning
Shane Legg
Koray Kavukcuoglu
ArXiv (abs)PDFHTML

Papers citing "IMPALA: Scalable Distributed Deep-RL with Importance Weighted Actor-Learner Architectures"

50 / 1,000 papers shown
Title
XLand-MiniGrid: Scalable Meta-Reinforcement Learning Environments in JAX
XLand-MiniGrid: Scalable Meta-Reinforcement Learning Environments in JAX
Alexander Nikulin
Vladislav Kurenkov
Ilya Zisman
Artem Agarkov
Viacheslav Sinii
Sergey Kolesnikov
116
30
0
19 Dec 2023
Learning to Act without Actions
Learning to Act without Actions
Dominik Schmidt
Minqi Jiang
OffRL
137
38
0
17 Dec 2023
Multi-agent Reinforcement Learning: A Comprehensive Survey
Multi-agent Reinforcement Learning: A Comprehensive Survey
Dom Huh
Prasant Mohapatra
AI4CE
82
10
0
15 Dec 2023
Auto MC-Reward: Automated Dense Reward Design with Large Language Models
  for Minecraft
Auto MC-Reward: Automated Dense Reward Design with Large Language Models for Minecraft
Hao Li
Xue Yang
Zhaokai Wang
Xizhou Zhu
Jie Zhou
Yu Qiao
Xiaogang Wang
Hongsheng Li
Lewei Lu
Jifeng Dai
97
36
0
14 Dec 2023
The Effective Horizon Explains Deep RL Performance in Stochastic
  Environments
The Effective Horizon Explains Deep RL Performance in Stochastic Environments
Cassidy Laidlaw
Banghua Zhu
Stuart J. Russell
Anca Dragan
88
3
0
13 Dec 2023
An Invitation to Deep Reinforcement Learning
An Invitation to Deep Reinforcement Learning
Bernhard Jaeger
Andreas Geiger
OffRLOOD
194
5
0
13 Dec 2023
Spreeze: High-Throughput Parallel Reinforcement Learning Framework
Spreeze: High-Throughput Parallel Reinforcement Learning Framework
Jing Hou
Guang Chen
Ruiqi Zhang
Zhijun Li
Shangding Gu
Changjun Jiang
OffRL
80
2
0
11 Dec 2023
Bad Students Make Great Teachers: Active Learning Accelerates
  Large-Scale Visual Understanding
Bad Students Make Great Teachers: Active Learning Accelerates Large-Scale Visual Understanding
Talfan Evans
Shreya Pathak
Hamza Merzic
Jonathan Schwarz
Ryutaro Tanno
Olivier J. Hénaff
80
17
0
08 Dec 2023
Efficient Parallel Reinforcement Learning Framework using the Reactor
  Model
Efficient Parallel Reinforcement Learning Framework using the Reactor Model
Jacky Kwok
Marten Lohstroh
Edward A. Lee
64
0
0
07 Dec 2023
Colour versus Shape Goal Misgeneralization in Reinforcement Learning: A
  Case Study
Colour versus Shape Goal Misgeneralization in Reinforcement Learning: A Case Study
Karolis Ramanauskas
Özgür Simsek
64
0
0
05 Dec 2023
Visual Encoders for Data-Efficient Imitation Learning in Modern Video Games
Visual Encoders for Data-Efficient Imitation Learning in Modern Video Games
Lukas Schäfer
Logan Jones
Anssi Kanervisto
Yuhan Cao
Tabish Rashid
Raluca Georgescu
David Bignell
Siddhartha Sen
Andrea Trevino Gavito
Sam Devlin
174
3
0
04 Dec 2023
Harnessing Discrete Representations For Continual Reinforcement Learning
Harnessing Discrete Representations For Continual Reinforcement Learning
Edan Meyer
Adam White
Marlos C. Machado
OffRL
72
5
0
02 Dec 2023
Handling Cost and Constraints with Off-Policy Deep Reinforcement
  Learning
Handling Cost and Constraints with Off-Policy Deep Reinforcement Learning
Jared Markowitz
Jesse Silverberg
Gary Collins
OffRL
38
0
0
30 Nov 2023
Replay across Experiments: A Natural Extension of Off-Policy RL
Replay across Experiments: A Natural Extension of Off-Policy RL
Dhruva Tirumala
Thomas Lampe
José Enrique Chen
Tuomas Haarnoja
Sandy Huang
...
Tim Hertweck
Leonard Hasenclever
Martin Riedmiller
N. Heess
Markus Wulfmeier
OffRL
105
8
0
27 Nov 2023
Agent as Cerebrum, Controller as Cerebellum: Implementing an Embodied
  LMM-based Agent on Drones
Agent as Cerebrum, Controller as Cerebellum: Implementing an Embodied LMM-based Agent on Drones
Haoran Zhao
Fengxing Pan
Huqiuyue Ping
Yaoming Zhou
AI4CE
86
12
0
25 Nov 2023
Probabilistic Inference in Reinforcement Learning Done Right
Probabilistic Inference in Reinforcement Learning Done Right
Jean Tarbouriech
Tor Lattimore
Brendan O'Donoghue
BDLOffRL
88
4
0
22 Nov 2023
minimax: Efficient Baselines for Autocurricula in JAX
minimax: Efficient Baselines for Autocurricula in JAX
Minqi Jiang
Michael Dennis
Edward Grefenstette
Tim Rocktaschel
76
9
0
21 Nov 2023
On-Policy Policy Gradient Reinforcement Learning Without On-Policy
  Sampling
On-Policy Policy Gradient Reinforcement Learning Without On-Policy Sampling
Nicholas Corrado
Josiah P. Hanna
OffRL
62
2
0
14 Nov 2023
An introduction to reinforcement learning for neuroscience
An introduction to reinforcement learning for neuroscience
Kristopher T. Jensen
OODOffRLAI4CE
50
1
0
13 Nov 2023
Towards Continual Reinforcement Learning for Quadruped Robots
Towards Continual Reinforcement Learning for Quadruped Robots
G. Minelli
V. Vassiliades
CLL
78
1
0
12 Nov 2023
LLM Augmented Hierarchical Agents
LLM Augmented Hierarchical Agents
Bharat Prakash
Tim Oates
T. Mohsenin
51
4
0
09 Nov 2023
Real-Time Recurrent Reinforcement Learning
Real-Time Recurrent Reinforcement Learning
Julian Lemmel
Radu Grosu
132
2
0
08 Nov 2023
Handover Protocol Learning for LEO Satellite Networks: Access Delay and
  Collision Minimization
Handover Protocol Learning for LEO Satellite Networks: Access Delay and Collision Minimization
Ju-Hyung Lee
C. Park
Soohyun Park
A. Molisch
136
11
0
31 Oct 2023
Improving Intrinsic Exploration by Creating Stationary Objectives
Improving Intrinsic Exploration by Creating Stationary Objectives
Roger Creus Castanyer
Javier Civera
Taihú Pire
OffRL
115
4
0
27 Oct 2023
DSAC-C: Constrained Maximum Entropy for Robust Discrete Soft-Actor Critic
DSAC-C: Constrained Maximum Entropy for Robust Discrete Soft-Actor Critic
Dexter Neo
Tsuhan Chen
54
1
0
26 Oct 2023
Combining Behaviors with the Successor Features Keyboard
Combining Behaviors with the Successor Features Keyboard
Wilka Carvalho
Andre Saraiva
Angelos Filos
Andrew Kyle Lampinen
Loic Matthey
Richard L. Lewis
Honglak Lee
Satinder Singh
Danilo Jimenez Rezende
Daniel Zoran
84
4
0
24 Oct 2023
μ-DDRL: A QoS-Aware Distributed Deep Reinforcement Learning
  Technique for Service Offloading in Fog computing Environments
μ-DDRL: A QoS-Aware Distributed Deep Reinforcement Learning Technique for Service Offloading in Fog computing Environments
M. Goudarzi
M. A. Rodriguez
Majid Sarvi
Rajkumar Buyya
OffRL
79
3
0
13 Oct 2023
Cross-Episodic Curriculum for Transformer Agents
Cross-Episodic Curriculum for Transformer Agents
Lucy Xiaoyang Shi
Yunfan Jiang
Jake Grigsby
Linxi "Jim" Fan
Yuke Zhu
77
7
0
12 Oct 2023
LightZero: A Unified Benchmark for Monte Carlo Tree Search in General
  Sequential Decision Scenarios
LightZero: A Unified Benchmark for Monte Carlo Tree Search in General Sequential Decision Scenarios
Yazhe Niu
Yuan Pu
Zhenjie Yang
Xueyan Li
Tong Zhou
Jiyuan Ren
Shuai Hu
Hongsheng Li
Yu Liu
139
15
0
12 Oct 2023
GROOT: Learning to Follow Instructions by Watching Gameplay Videos
GROOT: Learning to Follow Instructions by Watching Gameplay Videos
Shaofei Cai
Bowei Zhang
Zihao Wang
Xiaojian Ma
Hoang Trung-Dung
Yitao Liang
161
27
0
12 Oct 2023
Understanding and Controlling a Maze-Solving Policy Network
Understanding and Controlling a Maze-Solving Policy Network
Ulisse Mini
Peli Grietzer
Mrinank Sharma
Austin Meek
M. MacDiarmid
Alexander Matt Turner
51
18
0
12 Oct 2023
RoboHive: A Unified Framework for Robot Learning
RoboHive: A Unified Framework for Robot Learning
Vikash Kumar
Rutav Shah
Gaoyue Zhou
Vincent Moens
Vittorio Caggiano
Jay Vakil
Abhishek Gupta
Aravind Rajeswaran
69
25
0
10 Oct 2023
Confronting Reward Model Overoptimization with Constrained RLHF
Confronting Reward Model Overoptimization with Constrained RLHF
Ted Moskovitz
Aaditya K. Singh
DJ Strouse
Tuomas Sandholm
Ruslan Salakhutdinov
Anca D. Dragan
Stephen Marcus McAleer
103
55
0
06 Oct 2023
Small batch deep reinforcement learning
Small batch deep reinforcement learning
J. Obando-Ceron
Marc G. Bellemare
Pablo Samuel Castro
VLM
104
19
0
05 Oct 2023
Neural architecture impact on identifying temporally extended
  Reinforcement Learning tasks
Neural architecture impact on identifying temporally extended Reinforcement Learning tasks
Victor Vadakechirayath George
OffRL
57
0
0
04 Oct 2023
Learning Diverse Skills for Local Navigation under Multi-constraint
  Optimality
Learning Diverse Skills for Local Navigation under Multi-constraint Optimality
Jin Cheng
Marin Vlastelica
Pavel Kolev
Chenhao Li
Georg Martius
71
6
0
03 Oct 2023
Algebras of actions in an agent's representations of the world
Algebras of actions in an agent's representations of the world
Alexander Dean
Eduardo Alonso
Esther Mondragón
67
0
0
02 Oct 2023
Cleanba: A Reproducible and Efficient Distributed Reinforcement Learning
  Platform
Cleanba: A Reproducible and Efficient Distributed Reinforcement Learning Platform
Shengyi Huang
Jiayi Weng
Rujikorn Charakorn
Min Lin
Zhongwen Xu
Santiago Ontañón
92
3
0
29 Sep 2023
Memory Gym: Towards Endless Tasks to Benchmark Memory Capabilities of
  Agents
Memory Gym: Towards Endless Tasks to Benchmark Memory Capabilities of Agents
Marco Pleines
Matthias Pallasch
Frank Zimmer
Mike Preuss
OffRL
62
1
0
29 Sep 2023
Controlling Continuous Relaxation for Combinatorial Optimization
Controlling Continuous Relaxation for Combinatorial Optimization
Yuma Ichikawa
83
6
0
29 Sep 2023
RLLTE: Long-Term Evolution Project of Reinforcement Learning
RLLTE: Long-Term Evolution Project of Reinforcement Learning
Tao Lv
Zequn Zhang
Yang Xu
Shihao Luo
Bo Li
Xin Jin
Wenjun Zeng
OffRL
77
1
0
28 Sep 2023
Distill Knowledge in Multi-task Reinforcement Learning with
  Optimal-Transport Regularization
Distill Knowledge in Multi-task Reinforcement Learning with Optimal-Transport Regularization
Bang Giang Le
Viet-Cuong Ta
OT
85
1
0
27 Sep 2023
Diagnosing and exploiting the computational demands of videos games for
  deep reinforcement learning
Diagnosing and exploiting the computational demands of videos games for deep reinforcement learning
L. Govindarajan
Rex G Liu
Drew Linsley
A. Ashok
Max Reuter
M. Frank
Thomas Serre
OffRL
58
0
0
22 Sep 2023
Boosting Studies of Multi-Agent Reinforcement Learning on Google
  Research Football Environment: the Past, Present, and Future
Boosting Studies of Multi-Agent Reinforcement Learning on Google Research Football Environment: the Past, Present, and Future
Yan Song
He Jiang
Haifeng Zhang
Zheng Tian
Weinan Zhang
Jun Wang
OffRL
60
8
0
22 Sep 2023
Hierarchical reinforcement learning with natural language subgoals
Hierarchical reinforcement learning with natural language subgoals
Arun Ahuja
Kavya Kopparapu
Rob Fergus
Ishita Dasgupta
49
1
0
20 Sep 2023
RoboAgent: Generalization and Efficiency in Robot Manipulation via
  Semantic Augmentations and Action Chunking
RoboAgent: Generalization and Efficiency in Robot Manipulation via Semantic Augmentations and Action Chunking
Homanga Bharadhwaj
Jay Vakil
Mohit Sharma
Abhi Gupta
Shubham Tulsiani
Vikash Kumar
LM&Ro
118
132
0
05 Sep 2023
LoopTune: Optimizing Tensor Computations with Reinforcement Learning
LoopTune: Optimizing Tensor Computations with Reinforcement Learning
Dejan Grubisic
Bram Wasti
Chris Cummins
John Mellor-Crummey
A. Zlateski
67
1
0
04 Sep 2023
Neurosymbolic Reinforcement Learning and Planning: A Survey
Neurosymbolic Reinforcement Learning and Planning: A Survey
Kamal Acharya
Waleed Raza
Carlos Dourado
Alvaro Velasquez
Houbing Song
NAIOffRL
90
17
0
02 Sep 2023
D-VAT: End-to-End Visual Active Tracking for Micro Aerial Vehicles
D-VAT: End-to-End Visual Active Tracking for Micro Aerial Vehicles
Alberto Dionigi
Simone Felicioni
Mirko Leomanni
G. Costante
69
10
0
31 Aug 2023
Cyclophobic Reinforcement Learning
Cyclophobic Reinforcement Learning
Stefan Sylvius Wagner
P. Arndt
Jan Robine
Stefan Harmeling
68
1
0
30 Aug 2023
Previous
12345...181920
Next