ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1802.01561
  4. Cited By
IMPALA: Scalable Distributed Deep-RL with Importance Weighted
  Actor-Learner Architectures

IMPALA: Scalable Distributed Deep-RL with Importance Weighted Actor-Learner Architectures

5 February 2018
L. Espeholt
Hubert Soyer
Rémi Munos
Karen Simonyan
Volodymyr Mnih
Tom Ward
Yotam Doron
Vlad Firoiu
Tim Harley
Iain Dunning
Shane Legg
Koray Kavukcuoglu
ArXivPDFHTML

Papers citing "IMPALA: Scalable Distributed Deep-RL with Importance Weighted Actor-Learner Architectures"

50 / 981 papers shown
Title
SRL: Scaling Distributed Reinforcement Learning to Over Ten Thousand
  Cores
SRL: Scaling Distributed Reinforcement Learning to Over Ten Thousand Cores
Zhiyu Mei
Wei Fu
Jiaxuan Gao
Guang Wang
Huanchen Zhang
Yi Wu
OffRL
LRM
40
5
0
29 Jun 2023
Training Deep Surrogate Models with Large Scale Online Learning
Training Deep Surrogate Models with Large Scale Online Learning
Lucas Meyer
M. Schouler
R. Caulk
Alejandro Ribés
Bruno Raffin
3DGS
AI4CE
30
4
0
28 Jun 2023
Value-aware Importance Weighting for Off-policy Reinforcement Learning
Value-aware Importance Weighting for Off-policy Reinforcement Learning
Kristopher De Asis
Eric Graves
R. Sutton
OffRL
13
1
0
27 Jun 2023
Large Sequence Models for Sequential Decision-Making: A Survey
Large Sequence Models for Sequential Decision-Making: A Survey
Muning Wen
Runji Lin
Hanjing Wang
Yaodong Yang
Ying Wen
Luo Mai
Jun Wang
Haifeng Zhang
Weinan Zhang
LM&Ro
LRM
37
35
0
24 Jun 2023
Acceleration in Policy Optimization
Acceleration in Policy Optimization
Veronica Chelu
Tom Zahavy
A. Guez
Doina Precup
Sebastian Flennerhag
48
0
0
18 Jun 2023
Behavioral Cloning via Search in Embedded Demonstration Dataset
Behavioral Cloning via Search in Embedded Demonstration Dataset
Federico Malato
Florian Leopold
Ville Hautamaki
Andrew Melnik
OffRL
29
3
0
15 Jun 2023
OCAtari: Object-Centric Atari 2600 Reinforcement Learning Environments
OCAtari: Object-Centric Atari 2600 Reinforcement Learning Environments
Quentin Delfosse
Jannis Blüml
Bjarne Gregori
Sebastian Sztwiertnia
Kristian Kersting
46
17
0
14 Jun 2023
Galactic: Scaling End-to-End Reinforcement Learning for Rearrangement at
  100k Steps-Per-Second
Galactic: Scaling End-to-End Reinforcement Learning for Rearrangement at 100k Steps-Per-Second
Vincent-Pierre Berges
Andrew Szot
Devendra Singh Chaplot
Aaron Gokaslan
Roozbeh Mottaghi
Dhruv Batra
Eric Undersander
LRM
LM&Ro
40
5
0
13 Jun 2023
Diverse Projection Ensembles for Distributional Reinforcement Learning
Diverse Projection Ensembles for Distributional Reinforcement Learning
Moritz A. Zanger
Wendelin Bohmer
M. Spaan
33
4
0
12 Jun 2023
Design Principles for Model Generalization and Scalable AI Integration
  in Radio Access Networks
Design Principles for Model Generalization and Scalable AI Integration in Radio Access Networks
Pablo Soldati
E. Ghadimi
Burak Demirel
Yu Wang
Raimundas Gaigalas
Mathias Sintorn
11
3
0
09 Jun 2023
On the Importance of Exploration for Generalization in Reinforcement
  Learning
On the Importance of Exploration for Generalization in Reinforcement Learning
Yiding Jiang
J. Zico Kolter
Roberta Raileanu
UQCV
OffRL
34
20
0
08 Jun 2023
Rewarded soups: towards Pareto-optimal alignment by interpolating
  weights fine-tuned on diverse rewards
Rewarded soups: towards Pareto-optimal alignment by interpolating weights fine-tuned on diverse rewards
Alexandre Ramé
Guillaume Couairon
Mustafa Shukor
Corentin Dancette
Jean-Baptiste Gaya
Laure Soulier
Matthieu Cord
MoMe
35
136
0
07 Jun 2023
Stabilizing Contrastive RL: Techniques for Robotic Goal Reaching from
  Offline Data
Stabilizing Contrastive RL: Techniques for Robotic Goal Reaching from Offline Data
Chongyi Zheng
Benjamin Eysenbach
Homer Walke
Patrick Yin
Kuan Fang
Ruslan Salakhutdinov
Sergey Levine
SSL
OffRL
42
4
0
06 Jun 2023
A Study of Global and Episodic Bonuses for Exploration in Contextual
  MDPs
A Study of Global and Episodic Bonuses for Exploration in Contextual MDPs
Mikael Henaff
Minqi Jiang
Roberta Raileanu
46
13
0
05 Jun 2023
Explore to Generalize in Zero-Shot RL
Explore to Generalize in Zero-Shot RL
E. Zisselman
Itai Lavie
Daniel Soudry
Aviv Tamar
31
15
0
05 Jun 2023
Evaluating Continual Learning on a Home Robot
Evaluating Continual Learning on a Home Robot
Sam Powers
Abhi Gupta
Chris Paxton
CLL
36
3
0
04 Jun 2023
TorchRL: A data-driven decision-making library for PyTorch
TorchRL: A data-driven decision-making library for PyTorch
Albert Bou
Matteo Bettini
Sebastian Dittert
Vikash Kumar
Shagun Sodhani
Xiaomeng Yang
Gianni De Fabritiis
Vincent Moens
OffRL
AI4CE
30
37
0
01 Jun 2023
Accelerating Reinforcement Learning with Value-Conditional State Entropy
  Exploration
Accelerating Reinforcement Learning with Value-Conditional State Entropy Exploration
Dongyoung Kim
Jinwoo Shin
Pieter Abbeel
Younggyo Seo
32
19
0
31 May 2023
Bigger, Better, Faster: Human-level Atari with human-level efficiency
Bigger, Better, Faster: Human-level Atari with human-level efficiency
Max Schwarzer
J. Obando-Ceron
Rameswar Panda
Marc G. Bellemare
Rishabh Agarwal
Pablo Samuel Castro
OffRL
54
85
0
30 May 2023
Exploring the Promise and Limits of Real-Time Recurrent Learning
Exploring the Promise and Limits of Real-Time Recurrent Learning
Kazuki Irie
Anand Gopalakrishnan
Jürgen Schmidhuber
32
15
0
30 May 2023
Doing the right thing for the right reason: Evaluating artificial moral
  cognition by probing cost insensitivity
Doing the right thing for the right reason: Evaluating artificial moral cognition by probing cost insensitivity
Yiran Mao
Madeline G. Reinecke
M. Kunesch
Edgar A. Duénez-Guzmán
Ramona Comanescu
Julia Haas
Joel Z. Leibo
40
2
0
29 May 2023
RLAD: Reinforcement Learning from Pixels for Autonomous Driving in Urban
  Environments
RLAD: Reinforcement Learning from Pixels for Autonomous Driving in Urban Environments
Daniel Coelho
Miguel Oliveira
Vítor M. F. Santos
25
3
0
29 May 2023
DoMo-AC: Doubly Multi-step Off-policy Actor-Critic Algorithm
DoMo-AC: Doubly Multi-step Off-policy Actor-Critic Algorithm
Yunhao Tang
Tadashi Kozuno
Mark Rowland
Anna Harutyunyan
Rémi Munos
Bernardo Avila-Pires
Michal Valko
11
0
0
29 May 2023
Coherent Soft Imitation Learning
Coherent Soft Imitation Learning
Joe Watson
Sandy H. Huang
Nicholas Heess
36
11
0
25 May 2023
Deep Reinforcement Learning with Plasticity Injection
Deep Reinforcement Learning with Plasticity Injection
Evgenii Nikishin
Junhyuk Oh
Georg Ostrovski
Clare Lyle
Razvan Pascanu
Will Dabney
André Barreto
OffRL
26
50
0
24 May 2023
Barkour: Benchmarking Animal-level Agility with Quadruped Robots
Barkour: Benchmarking Animal-level Agility with Quadruped Robots
Ken Caluwaerts
Atil Iscen
J. Kew
Wenhao Yu
Tingnan Zhang
...
J. Seto
Carolina Parada
Vikas Sindhwani
Vincent Vanhoucke
Jie Tan
27
59
0
24 May 2023
Co-Learning Empirical Games and World Models
Co-Learning Empirical Games and World Models
Max O. Smith
Michael P. Wellman
24
2
0
23 May 2023
Policy Representation via Diffusion Probability Model for Reinforcement
  Learning
Policy Representation via Diffusion Probability Model for Reinforcement Learning
Long Yang
Zhixiong Huang
Fenghao Lei
Yucun Zhong
Yiming Yang
Cong Fang
Shiting Wen
Binbin Zhou
Zhouchen Lin
DiffM
41
40
0
22 May 2023
Learning Diverse Risk Preferences in Population-based Self-play
Learning Diverse Risk Preferences in Population-based Self-play
Y. Jiang
Qihan Liu
Xiaoteng Ma
Chenghao Li
Yiqin Yang
Jun Yang
Bin Liang
Qianchuan Zhao
54
5
0
19 May 2023
Sharing Lifelong Reinforcement Learning Knowledge via Modulating Masks
Sharing Lifelong Reinforcement Learning Knowledge via Modulating Masks
Saptarshi Nath
Christos Peridis
Eseoghene Ben-Iwhiwhu
Xinran Liu
Shirin Dora
Cong Liu
Soheil Kolouri
Andrea Soltoggio
CLL
33
8
0
18 May 2023
The Blessing of Heterogeneity in Federated Q-Learning: Linear Speedup
  and Beyond
The Blessing of Heterogeneity in Federated Q-Learning: Linear Speedup and Beyond
Jiin Woo
Gauri Joshi
Yuejie Chi
FedML
35
19
0
18 May 2023
An Empirical Study on Google Research Football Multi-agent Scenarios
An Empirical Study on Google Research Football Multi-agent Scenarios
Yan Song
He Jiang
Zheng Tian
Haifeng Zhang
Yingping Zhang
Jiangcheng Zhu
Zonghong Dai
Weinan Zhang
Jun Wang
47
5
0
16 May 2023
Cooperative Multi-Agent Reinforcement Learning: Asynchronous
  Communication and Linear Function Approximation
Cooperative Multi-Agent Reinforcement Learning: Asynchronous Communication and Linear Function Approximation
Yifei Min
Jiafan He
Tianhao Wang
Quanquan Gu
38
7
0
10 May 2023
Learnable Behavior Control: Breaking Atari Human World Records via
  Sample-Efficient Behavior Selection
Learnable Behavior Control: Breaking Atari Human World Records via Sample-Efficient Behavior Selection
Jiajun Fan
Yuzheng Zhuang
Yuecheng Liu
Jianye Hao
Bin Wang
Jiangcheng Zhu
Hao Wang
Shutao Xia
34
17
0
09 May 2023
Truncating Trajectories in Monte Carlo Reinforcement Learning
Truncating Trajectories in Monte Carlo Reinforcement Learning
Riccardo Poiani
Alberto Maria Metelli
Marcello Restelli
26
2
0
07 May 2023
Train a Real-world Local Path Planner in One Hour via Partially
  Decoupled Reinforcement Learning and Vectorized Diversity
Train a Real-world Local Path Planner in One Hour via Partially Decoupled Reinforcement Learning and Vectorized Diversity
Jinghao Xin
Jinwoo Kim
Zhiyu Li
Ning Li
OffRL
28
3
0
07 May 2023
Single Node Injection Label Specificity Attack on Graph Neural Networks
  via Reinforcement Learning
Single Node Injection Label Specificity Attack on Graph Neural Networks via Reinforcement Learning
Dayuan Chen
Jian Zhang
Yuqian Lv
Jinhuan Wang
Hongjie Ni
Shanqing Yu
Zhen Wang
Qi Xuan
AAML
33
3
0
04 May 2023
Unlocking the Power of Representations in Long-term Novelty-based
  Exploration
Unlocking the Power of Representations in Long-term Novelty-based Exploration
Alaa Saade
Steven Kapturowski
Daniele Calandriello
Charles Blundell
Pablo Sprechmann
Leopoldo Sarra
Oliver Groth
Michal Valko
Bilal Piot
OffRL
37
6
0
02 May 2023
Heterogeneous Social Value Orientation Leads to Meaningful Diversity in
  Sequential Social Dilemmas
Heterogeneous Social Value Orientation Leads to Meaningful Diversity in Sequential Social Dilemmas
Udari Madhushani
Kevin R. McKee
J. Agapiou
Joel Z. Leibo
Richard Everett
Thomas W. Anthony
Edward Hughes
K. Tuyls
Edgar A. Duénez-Guzmán
49
2
0
01 May 2023
Representations and Exploration for Deep Reinforcement Learning using
  Singular Value Decomposition
Representations and Exploration for Deep Reinforcement Learning using Singular Value Decomposition
Yash Chandak
S. Thakoor
Z. Guo
Yunhao Tang
Rémi Munos
Will Dabney
Diana Borsa
24
2
0
01 May 2023
Learning Achievement Structure for Structured Exploration in Domains
  with Sparse Reward
Learning Achievement Structure for Structured Exploration in Domains with Sparse Reward
Zihan Zhou
Animesh Garg
OffRL
27
3
0
30 Apr 2023
Fundamental Tradeoffs in Learning with Prior Information
Fundamental Tradeoffs in Learning with Prior Information
Anirudha Majumdar
32
0
0
26 Apr 2023
Proto-Value Networks: Scaling Representation Learning with Auxiliary
  Tasks
Proto-Value Networks: Scaling Representation Learning with Auxiliary Tasks
Jesse Farebrother
Joshua Greaves
Rishabh Agarwal
Charline Le Lan
Ross Goroshin
Pablo Samuel Castro
Marc G. Bellemare
54
25
0
25 Apr 2023
DEIR: Efficient and Robust Exploration through
  Discriminative-Model-Based Episodic Intrinsic Rewards
DEIR: Efficient and Robust Exploration through Discriminative-Model-Based Episodic Intrinsic Rewards
Shanchuan Wan
Yujin Tang
Yingtao Tian
Tomoyuki Kaneko
OffRL
22
4
0
21 Apr 2023
Using Offline Data to Speed-up Reinforcement Learning in Procedurally
  Generated Environments
Using Offline Data to Speed-up Reinforcement Learning in Procedurally Generated Environments
Alain Andres
Lukas Schafer
Esther Villar-Rodriguez
Stefano V. Albrecht
Javier Del Ser
OffRL
OnRL
34
2
0
18 Apr 2023
Efficient Automation of Neural Network Design: A Survey on
  Differentiable Neural Architecture Search
Efficient Automation of Neural Network Design: A Survey on Differentiable Neural Architecture Search
Alexandre Heuillet
A. Nasser
Hichem Arioui
Hedi Tabia
AI4CE
35
12
0
11 Apr 2023
Reinforcement Learning from Passive Data via Latent Intentions
Reinforcement Learning from Passive Data via Latent Intentions
Dibya Ghosh
Chethan Bhateja
Sergey Levine
OffRL
33
44
0
10 Apr 2023
Planning with Sequence Models through Iterative Energy Minimization
Planning with Sequence Models through Iterative Energy Minimization
Hongyi Chen
Yilun Du
Yiye Chen
J. Tenenbaum
Patricio A. Vela
32
6
0
28 Mar 2023
marl-jax: Multi-Agent Reinforcement Leaning Framework
marl-jax: Multi-Agent Reinforcement Leaning Framework
K. Mehta
Anuj Mahajan
Kiran Ravish
34
3
0
24 Mar 2023
SVDE: Scalable Value-Decomposition Exploration for Cooperative
  Multi-Agent Reinforcement Learning
SVDE: Scalable Value-Decomposition Exploration for Cooperative Multi-Agent Reinforcement Learning
Shuhan Qi
Shuhao Zhang
Qiang-qiang Wang
Jia-jia Zhang
Jing Xiao
Xihuai Wang
34
0
0
16 Mar 2023
Previous
123456...181920
Next