ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1802.01561
  4. Cited By
IMPALA: Scalable Distributed Deep-RL with Importance Weighted
  Actor-Learner Architectures
v1v2v3 (latest)

IMPALA: Scalable Distributed Deep-RL with Importance Weighted Actor-Learner Architectures

5 February 2018
L. Espeholt
Hubert Soyer
Rémi Munos
Karen Simonyan
Volodymyr Mnih
Tom Ward
Yotam Doron
Vlad Firoiu
Tim Harley
Iain Dunning
Shane Legg
Koray Kavukcuoglu
ArXiv (abs)PDFHTML

Papers citing "IMPALA: Scalable Distributed Deep-RL with Importance Weighted Actor-Learner Architectures"

50 / 1,000 papers shown
Title
Adversarial Style Transfer for Robust Policy Optimization in Deep
  Reinforcement Learning
Adversarial Style Transfer for Robust Policy Optimization in Deep Reinforcement Learning
Md Masudur Rahman
Yexiang Xue
59
4
0
29 Aug 2023
Go Beyond Imagination: Maximizing Episodic Reachability with World
  Models
Go Beyond Imagination: Maximizing Episodic Reachability with World Models
Yao Fu
Run Peng
Honglak Lee
68
1
0
25 Aug 2023
Reinforcement Learning Informed Evolutionary Search for Autonomous
  Systems Testing
Reinforcement Learning Informed Evolutionary Search for Autonomous Systems Testing
D. Humeniuk
Foutse Khomh
G. Antoniol
59
4
0
24 Aug 2023
Reinforced Self-Training (ReST) for Language Modeling
Reinforced Self-Training (ReST) for Language Modeling
Çağlar Gülçehre
T. Paine
S. Srinivasan
Ksenia Konyushkova
L. Weerts
...
Chenjie Gu
Wolfgang Macherey
Arnaud Doucet
Orhan Firat
Nando de Freitas
OffRL
129
309
0
17 Aug 2023
Scope Loss for Imbalanced Classification and RL Exploration
Scope Loss for Imbalanced Classification and RL Exploration
Hasham Burhani
Xiaolong Shi
Jonathan Jaegerman
Daniel Balicki
68
0
0
08 Aug 2023
AlphaStar Unplugged: Large-Scale Offline Reinforcement Learning
AlphaStar Unplugged: Large-Scale Offline Reinforcement Learning
Michaël Mathieu
Sherjil Ozair
Srivatsan Srinivasan
Çağlar Gülçehre
Shangtong Zhang
...
Sergio Gomez Colmenarejo
Aaron van den Oord
Wojciech M. Czarnecki
Nando de Freitas
Oriol Vinyals
OffRL
52
10
0
07 Aug 2023
Bag of Policies for Distributional Deep Exploration
Bag of Policies for Distributional Deep Exploration
Asen Nachkov
Luchen Li
Giulia Luise
Filippo Valdettaro
Aldo A. Faisal
OffRL
84
0
0
03 Aug 2023
Learning to Model the World with Language
Learning to Model the World with Language
Jessy Lin
Yuqing Du
Olivia Watkins
Danijar Hafner
Pieter Abbeel
Dan Klein
Anca Dragan
LM&RoSyDa
130
55
0
31 Jul 2023
Robust Multi-Agent Reinforcement Learning with State Uncertainty
Robust Multi-Agent Reinforcement Learning with State Uncertainty
Sihong He
Songyang Han
Sanbao Su
Shuo Han
Shaofeng Zou
Fei Miao
OOD
77
47
0
30 Jul 2023
A new Gradient TD Algorithm with only One Step-size: Convergence Rate
  Analysis using $L$-$λ$ Smoothness
A new Gradient TD Algorithm with only One Step-size: Convergence Rate Analysis using LLL-λλλ Smoothness
Hengshuai Yao
62
2
0
29 Jul 2023
Thinker: Learning to Plan and Act
Thinker: Learning to Plan and Act
Stephen Chung
Ivan Anokhin
David M. Krueger
LLMAGOffRLLRM
54
9
0
27 Jul 2023
HIQL: Offline Goal-Conditioned RL with Latent States as Actions
HIQL: Offline Goal-Conditioned RL with Latent States as Actions
Seohong Park
Dibya Ghosh
Benjamin Eysenbach
Sergey Levine
OffRL
128
61
0
22 Jul 2023
Scaling Laws for Imitation Learning in Single-Agent Games
Scaling Laws for Imitation Learning in Single-Agent Games
Jens Tuyls
Dhruv Madeka
Kari Torkkola
Dean Phillips Foster
Karthik Narasimhan
Sham Kakade
48
5
0
18 Jul 2023
IxDRL: A Novel Explainable Deep Reinforcement Learning Toolkit based on
  Analyses of Interestingness
IxDRL: A Novel Explainable Deep Reinforcement Learning Toolkit based on Analyses of Interestingness
Pedro Sequeira
Melinda Gervasio
53
2
0
18 Jul 2023
LuckyMera: a Modular AI Framework for Building Hybrid NetHack Agents
LuckyMera: a Modular AI Framework for Building Hybrid NetHack Agents
Luigi Quarantiello
Simone Marzeddu
Antonio Guzzi
Vincenzo Lomonaco
46
0
0
17 Jul 2023
`It is currently hodgepodge'': Examining AI/ML Practitioners' Challenges
  during Co-production of Responsible AI Values
`It is currently hodgepodge'': Examining AI/ML Practitioners' Challenges during Co-production of Responsible AI Values
R. Varanasi
Nitesh Goyal
78
48
0
14 Jul 2023
A Survey From Distributed Machine Learning to Distributed Deep Learning
A Survey From Distributed Machine Learning to Distributed Deep Learning
Mohammad Dehghani
Zahra Yazdanparast
118
0
0
11 Jul 2023
SpawnNet: Learning Generalizable Visuomotor Skills from Pre-trained
  Networks
SpawnNet: Learning Generalizable Visuomotor Skills from Pre-trained Networks
Xingyu Lin
John So
Sashwat Mahalingam
Fangchen Liu
Pieter Abbeel
SSL
92
26
0
07 Jul 2023
Discovering Hierarchical Achievements in Reinforcement Learning via
  Contrastive Learning
Discovering Hierarchical Achievements in Reinforcement Learning via Contrastive Learning
Seungyong Moon
Junyoung Yeom
Bumsoo Park
Hyun Oh Song
OffRL
95
5
0
07 Jul 2023
Eigensubspace of Temporal-Difference Dynamics and How It Improves Value
  Approximation in Reinforcement Learning
Eigensubspace of Temporal-Difference Dynamics and How It Improves Value Approximation in Reinforcement Learning
Qiang He
Dinesh Manocha
Meng Fang
S. Maghsudi
76
5
0
29 Jun 2023
SRL: Scaling Distributed Reinforcement Learning to Over Ten Thousand
  Cores
SRL: Scaling Distributed Reinforcement Learning to Over Ten Thousand Cores
Zhiyu Mei
Wei Fu
Jiaxuan Gao
Guang Wang
Huanchen Zhang
Yi Wu
OffRLLRM
68
6
0
29 Jun 2023
Training Deep Surrogate Models with Large Scale Online Learning
Training Deep Surrogate Models with Large Scale Online Learning
Lucas Meyer
M. Schouler
R. Caulk
Alejandro Ribés
Bruno Raffin
3DGSAI4CE
89
5
0
28 Jun 2023
Value-aware Importance Weighting for Off-policy Reinforcement Learning
Value-aware Importance Weighting for Off-policy Reinforcement Learning
Kristopher De Asis
Eric Graves
R. Sutton
OffRL
60
1
0
27 Jun 2023
Large Sequence Models for Sequential Decision-Making: A Survey
Large Sequence Models for Sequential Decision-Making: A Survey
Muning Wen
Runji Lin
Hanjing Wang
Yaodong Yang
Ying Wen
Kai Zou
Jun Wang
Haifeng Zhang
Weinan Zhang
LM&RoLRM
102
36
0
24 Jun 2023
Acceleration in Policy Optimization
Acceleration in Policy Optimization
Veronica Chelu
Tom Zahavy
A. Guez
Doina Precup
Sebastian Flennerhag
95
0
0
18 Jun 2023
Behavioral Cloning via Search in Embedded Demonstration Dataset
Behavioral Cloning via Search in Embedded Demonstration Dataset
Federico Malato
Florian Leopold
Ville Hautamaki
Andrew Melnik
OffRL
65
3
0
15 Jun 2023
OCAtari: Object-Centric Atari 2600 Reinforcement Learning Environments
OCAtari: Object-Centric Atari 2600 Reinforcement Learning Environments
Quentin Delfosse
Johannes Czech
Bjarne Gregori
Sebastian Sztwiertnia
Kristian Kersting
108
18
0
14 Jun 2023
Galactic: Scaling End-to-End Reinforcement Learning for Rearrangement at
  100k Steps-Per-Second
Galactic: Scaling End-to-End Reinforcement Learning for Rearrangement at 100k Steps-Per-Second
Vincent-Pierre Berges
Andrew Szot
Devendra Singh Chaplot
Aaron Gokaslan
Roozbeh Mottaghi
Dhruv Batra
Eric Undersander
LRMLM&Ro
96
5
0
13 Jun 2023
Diverse Projection Ensembles for Distributional Reinforcement Learning
Diverse Projection Ensembles for Distributional Reinforcement Learning
Moritz A. Zanger
Wendelin Bohmer
M. Spaan
57
6
0
12 Jun 2023
Design Principles for Model Generalization and Scalable AI Integration
  in Radio Access Networks
Design Principles for Model Generalization and Scalable AI Integration in Radio Access Networks
Pablo Soldati
E. Ghadimi
Burak Demirel
Yu Wang
Raimundas Gaigalas
Mathias Sintorn
41
3
0
09 Jun 2023
On the Importance of Exploration for Generalization in Reinforcement
  Learning
On the Importance of Exploration for Generalization in Reinforcement Learning
Yiding Jiang
J. Zico Kolter
Roberta Raileanu
UQCVOffRL
80
24
0
08 Jun 2023
Rewarded soups: towards Pareto-optimal alignment by interpolating
  weights fine-tuned on diverse rewards
Rewarded soups: towards Pareto-optimal alignment by interpolating weights fine-tuned on diverse rewards
Alexandre Ramé
Guillaume Couairon
Mustafa Shukor
Corentin Dancette
Jean-Baptiste Gaya
Laure Soulier
Matthieu Cord
MoMe
120
158
0
07 Jun 2023
Stabilizing Contrastive RL: Techniques for Robotic Goal Reaching from Offline Data
Stabilizing Contrastive RL: Techniques for Robotic Goal Reaching from Offline Data
Chongyi Zheng
Benjamin Eysenbach
Homer Walke
Patrick Yin
Kuan Fang
Ruslan Salakhutdinov
Sergey Levine
OffRLSSL
92
6
0
06 Jun 2023
A Study of Global and Episodic Bonuses for Exploration in Contextual
  MDPs
A Study of Global and Episodic Bonuses for Exploration in Contextual MDPs
Mikael Henaff
Minqi Jiang
Roberta Raileanu
90
13
0
05 Jun 2023
Explore to Generalize in Zero-Shot RL
Explore to Generalize in Zero-Shot RL
E. Zisselman
Itai Lavie
Daniel Soudry
Aviv Tamar
102
17
0
05 Jun 2023
Evaluating Continual Learning on a Home Robot
Evaluating Continual Learning on a Home Robot
Sam Powers
Abhi Gupta
Chris Paxton
CLL
96
3
0
04 Jun 2023
TorchRL: A data-driven decision-making library for PyTorch
TorchRL: A data-driven decision-making library for PyTorch
Albert Bou
Matteo Bettini
Sebastian Dittert
Vikash Kumar
Shagun Sodhani
Xiaomeng Yang
Gianni De Fabritiis
Vincent Moens
OffRLAI4CE
126
41
0
01 Jun 2023
Accelerating Reinforcement Learning with Value-Conditional State Entropy
  Exploration
Accelerating Reinforcement Learning with Value-Conditional State Entropy Exploration
Dongyoung Kim
Jinwoo Shin
Pieter Abbeel
Younggyo Seo
78
22
0
31 May 2023
Bigger, Better, Faster: Human-level Atari with human-level efficiency
Bigger, Better, Faster: Human-level Atari with human-level efficiency
Max Schwarzer
J. Obando-Ceron
Rameswar Panda
Marc G. Bellemare
Rishabh Agarwal
Pablo Samuel Castro
OffRL
124
102
0
30 May 2023
Exploring the Promise and Limits of Real-Time Recurrent Learning
Exploring the Promise and Limits of Real-Time Recurrent Learning
Kazuki Irie
Anand Gopalakrishnan
Jürgen Schmidhuber
75
16
0
30 May 2023
Doing the right thing for the right reason: Evaluating artificial moral
  cognition by probing cost insensitivity
Doing the right thing for the right reason: Evaluating artificial moral cognition by probing cost insensitivity
Yiran Mao
Madeline G. Reinecke
M. Kunesch
Edgar A. Duénez-Guzmán
Ramona Comanescu
Julia Haas
Joel Z Leibo
66
2
0
29 May 2023
RLAD: Reinforcement Learning from Pixels for Autonomous Driving in Urban
  Environments
RLAD: Reinforcement Learning from Pixels for Autonomous Driving in Urban Environments
Daniel Coelho
Miguel Oliveira
Vítor M. F. Santos
50
4
0
29 May 2023
DoMo-AC: Doubly Multi-step Off-policy Actor-Critic Algorithm
DoMo-AC: Doubly Multi-step Off-policy Actor-Critic Algorithm
Yunhao Tang
Tadashi Kozuno
Mark Rowland
Anna Harutyunyan
Rémi Munos
Bernardo Avila-Pires
Michal Valko
37
0
0
29 May 2023
Coherent Soft Imitation Learning
Coherent Soft Imitation Learning
Joe Watson
Sandy H. Huang
Nicholas Heess
93
12
0
25 May 2023
Deep Reinforcement Learning with Plasticity Injection
Deep Reinforcement Learning with Plasticity Injection
Evgenii Nikishin
Junhyuk Oh
Georg Ostrovski
Clare Lyle
Razvan Pascanu
Will Dabney
André Barreto
OffRL
66
52
0
24 May 2023
Barkour: Benchmarking Animal-level Agility with Quadruped Robots
Barkour: Benchmarking Animal-level Agility with Quadruped Robots
Ken Caluwaerts
Atil Iscen
J. Kew
Wenhao Yu
Tingnan Zhang
...
J. Seto
Carolina Parada
Vikas Sindhwani
Vincent Vanhoucke
Jie Tan
75
63
0
24 May 2023
Co-Learning Empirical Games and World Models
Co-Learning Empirical Games and World Models
Max O. Smith
Michael P. Wellman
119
2
0
23 May 2023
Policy Representation via Diffusion Probability Model for Reinforcement
  Learning
Policy Representation via Diffusion Probability Model for Reinforcement Learning
Long Yang
Zhixiong Huang
Fenghao Lei
Yucun Zhong
Yiming Yang
Cong Fang
Shiting Wen
Binbin Zhou
Zhouchen Lin
DiffM
111
53
0
22 May 2023
Learning Diverse Risk Preferences in Population-based Self-play
Learning Diverse Risk Preferences in Population-based Self-play
Y. Jiang
Qihan Liu
Xiaoteng Ma
Chenghao Li
Yiqin Yang
Jun Yang
Bin Liang
Qianchuan Zhao
134
6
0
19 May 2023
Sharing Lifelong Reinforcement Learning Knowledge via Modulating Masks
Sharing Lifelong Reinforcement Learning Knowledge via Modulating Masks
Saptarshi Nath
Christos Peridis
Eseoghene Ben-Iwhiwhu
Xinran Liu
Shirin Dora
Cong Liu
Soheil Kolouri
Andrea Soltoggio
CLL
76
10
0
18 May 2023
Previous
123456...181920
Next