ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1802.01561
  4. Cited By
IMPALA: Scalable Distributed Deep-RL with Importance Weighted
  Actor-Learner Architectures

IMPALA: Scalable Distributed Deep-RL with Importance Weighted Actor-Learner Architectures

5 February 2018
L. Espeholt
Hubert Soyer
Rémi Munos
Karen Simonyan
Volodymyr Mnih
Tom Ward
Yotam Doron
Vlad Firoiu
Tim Harley
Iain Dunning
Shane Legg
Koray Kavukcuoglu
ArXivPDFHTML

Papers citing "IMPALA: Scalable Distributed Deep-RL with Importance Weighted Actor-Learner Architectures"

50 / 981 papers shown
Title
Learning Representations in Reinforcement Learning:An Information
  Bottleneck Approach
Learning Representations in Reinforcement Learning:An Information Bottleneck Approach
Yingjun Pei
Xinwen Hou
SSL
39
10
0
12 Nov 2019
Provably Convergent Two-Timescale Off-Policy Actor-Critic with Function
  Approximation
Provably Convergent Two-Timescale Off-Policy Actor-Critic with Function Approximation
Shangtong Zhang
Bo Liu
Hengshuai Yao
Shimon Whiteson
OffRL
29
8
0
11 Nov 2019
Multi-Agent Connected Autonomous Driving using Deep Reinforcement
  Learning
Multi-Agent Connected Autonomous Driving using Deep Reinforcement Learning
Praveen Palanisamy
45
142
0
11 Nov 2019
DeepRacer: Educational Autonomous Racing Platform for Experimentation
  with Sim2Real Reinforcement Learning
DeepRacer: Educational Autonomous Racing Platform for Experimentation with Sim2Real Reinforcement Learning
Bharathan Balaji
S. Mallya
Sahika Genc
Saurabh Gupta
Leo Dirac
...
Yunzhe Tao
Brian Townsend
E. Calleja
Sunil Muralidhara
Dhanasekar Karuppasamy
21
56
0
05 Nov 2019
Keeping Your Distance: Solving Sparse Reward Tasks Using Self-Balancing
  Shaped Rewards
Keeping Your Distance: Solving Sparse Reward Tasks Using Self-Balancing Shaped Rewards
Alexander R. Trott
Stephan Zheng
Caiming Xiong
R. Socher
66
109
0
04 Nov 2019
Gradient-based Adaptive Markov Chain Monte Carlo
Gradient-based Adaptive Markov Chain Monte Carlo
Michalis K. Titsias
P. Dellaportas
BDL
39
22
0
04 Nov 2019
DD-PPO: Learning Near-Perfect PointGoal Navigators from 2.5 Billion
  Frames
DD-PPO: Learning Near-Perfect PointGoal Navigators from 2.5 Billion Frames
Erik Wijmans
Abhishek Kadian
Ari S. Morcos
Stefan Lee
Irfan Essa
Devi Parikh
Manolis Savva
Dhruv Batra
42
469
0
01 Nov 2019
Generalization of Reinforcement Learners with Working and Episodic
  Memory
Generalization of Reinforcement Learners with Working and Episodic Memory
Meire Fortunato
Melissa Tan
Ryan Faulkner
Steven Hansen
Adria Puigdomenech Badia
Gavin Buttimore
Charlie Deck
Joel Z Leibo
Charles Blundell
27
70
0
29 Oct 2019
Asynchronous Methods for Model-Based Reinforcement Learning
Asynchronous Methods for Model-Based Reinforcement Learning
Yunzhi Zhang
I. Clavera
Bo-Yu Tsai
Pieter Abbeel
OffRL
19
27
0
28 Oct 2019
Meta-World: A Benchmark and Evaluation for Multi-Task and Meta
  Reinforcement Learning
Meta-World: A Benchmark and Evaluation for Multi-Task and Meta Reinforcement Learning
Tianhe Yu
Deirdre Quillen
Zhanpeng He
Ryan Julian
Avnish Narayan
Hayden Shively
Adithya Bellathur
Karol Hausman
Chelsea Finn
Sergey Levine
OffRL
101
1,132
0
24 Oct 2019
Improving the Gating Mechanism of Recurrent Neural Networks
Improving the Gating Mechanism of Recurrent Neural Networks
Albert Gu
Çağlar Gülçehre
T. Paine
Matthew W. Hoffman
Razvan Pascanu
AI4CE
19
2
0
22 Oct 2019
Collaborative Graph Walk for Semi-supervised Multi-Label Node
  Classification
Collaborative Graph Walk for Semi-supervised Multi-Label Node Classification
Uchenna Akujuobi
Yufei Han
Qiannan Zhang
Xiangliang Zhang
25
16
0
22 Oct 2019
Modelling Generalized Forces with Reinforcement Learning for Sim-to-Real
  Transfer
Modelling Generalized Forces with Reinforcement Learning for Sim-to-Real Transfer
Rae Jeong
Jackie Kay
Francesco Romano
Thomas Lampe
Thomas Rothörl
A. Abdolmaleki
Tom Erez
Yuval Tassa
F. Nori
6
23
0
21 Oct 2019
Dealing with Sparse Rewards in Reinforcement Learning
Dealing with Sparse Rewards in Reinforcement Learning
J. Hare
21
77
0
21 Oct 2019
Regularization Matters in Policy Optimization
Regularization Matters in Policy Optimization
Zhuang Liu
Xuanlin Li
Bingyi Kang
Trevor Darrell
OffRL
25
33
0
21 Oct 2019
Dynamic Subgoal-based Exploration via Bayesian Optimization
Dynamic Subgoal-based Exploration via Bayesian Optimization
Yijia Wang
Matthias Poloczek
Daniel R. Jiang
34
3
0
21 Oct 2019
RTFM: Generalising to Novel Environment Dynamics via Reading
RTFM: Generalising to Novel Environment Dynamics via Reading
Victor Zhong
Tim Rocktaschel
Edward Grefenstette
LLMAG
OffRL
AI4CE
19
54
0
18 Oct 2019
A Hybrid Compact Neural Architecture for Visual Place Recognition
A Hybrid Compact Neural Architecture for Visual Place Recognition
Marvin Chancán
Luis Hernandez-Nunez
A. Narendra
A. Barron
Michael Milford
25
55
0
15 Oct 2019
Stabilizing Transformers for Reinforcement Learning
Stabilizing Transformers for Reinforcement Learning
Emilio Parisotto
H. F. Song
Jack W. Rae
Razvan Pascanu
Çağlar Gülçehre
...
Aidan Clark
Seb Noury
M. Botvinick
N. Heess
R. Hadsell
OffRL
22
360
0
13 Oct 2019
CityLearn: Diverse Real-World Environments for Sample-Efficient
  Navigation Policy Learning
CityLearn: Diverse Real-World Environments for Sample-Efficient Navigation Policy Learning
Marvin Chancán
Michael Milford
SSL
27
5
0
10 Oct 2019
Imagined Value Gradients: Model-Based Policy Optimization with
  Transferable Latent Dynamics Models
Imagined Value Gradients: Model-Based Policy Optimization with Transferable Latent Dynamics Models
Arunkumar Byravan
Jost Tobias Springenberg
A. Abdolmaleki
Roland Hafner
Michael Neunert
Thomas Lampe
Noah Y. Siegel
N. Heess
Martin Riedmiller
OffRL
11
41
0
09 Oct 2019
MVFST-RL: An Asynchronous RL Framework for Congestion Control with
  Delayed Actions
MVFST-RL: An Asynchronous RL Framework for Congestion Control with Delayed Actions
V. Sivakumar
Olivier Delalleau
Tim Rocktaschel
Alexander H. Miller
Heinrich Küttler
Nantas Nardelli
Michael G. Rabbat
Joelle Pineau
Sebastian Riedel
18
36
0
09 Oct 2019
Policy Optimization Through Approximate Importance Sampling
Policy Optimization Through Approximate Importance Sampling
Marcin Tomczak
Dongho Kim
Peter Vrancx
Kyungmin Kim
17
4
0
09 Oct 2019
TorchBeast: A PyTorch Platform for Distributed RL
TorchBeast: A PyTorch Platform for Distributed RL
Heinrich Küttler
Nantas Nardelli
Thibaut Lavril
Marco Selvatici
V. Sivakumar
Tim Rocktaschel
Edward Grefenstette
OffRL
19
58
0
08 Oct 2019
QuaRL: Quantization for Fast and Environmentally Sustainable
  Reinforcement Learning
QuaRL: Quantization for Fast and Environmentally Sustainable Reinforcement Learning
Srivatsan Krishnan
Maximilian Lam
Sharad Chitlangia
Zishen Wan
Gabriel Barth-Maron
Aleksandra Faust
Vijay Janapa Reddi
MQ
29
23
0
02 Oct 2019
Environmental drivers of systematicity and generalization in a situated
  agent
Environmental drivers of systematicity and generalization in a situated agent
Felix Hill
Andrew Kyle Lampinen
R. Schneider
S. Clark
M. Botvinick
James L. McClelland
Adam Santoro
OOD
14
103
0
01 Oct 2019
SURREAL-System: Fully-Integrated Stack for Distributed Deep
  Reinforcement Learning
SURREAL-System: Fully-Integrated Stack for Distributed Deep Reinforcement Learning
Linxi Fan
Yuke Zhu
Jiren Zhu
Zihua Liu
Orien Zeng
Anchit Gupta
Joan Creus-Costa
Silvio Savarese
Li Fei-Fei
OffRL
GNN
43
3
0
27 Sep 2019
Automated curricula through setter-solver interactions
Automated curricula through setter-solver interactions
S. Racanière
Andrew Kyle Lampinen
Adam Santoro
David P. Reichert
Vlad Firoiu
Timothy Lillicrap
31
53
0
27 Sep 2019
V-MPO: On-Policy Maximum a Posteriori Policy Optimization for Discrete
  and Continuous Control
V-MPO: On-Policy Maximum a Posteriori Policy Optimization for Discrete and Continuous Control
H. F. Song
A. Abdolmaleki
Jost Tobias Springenberg
Aidan Clark
Hubert Soyer
...
Dhruva Tirumala
N. Heess
Dan Belov
Martin Riedmiller
M. Botvinick
37
121
0
26 Sep 2019
MERL: Multi-Head Reinforcement Learning
MERL: Multi-Head Reinforcement Learning
Yannis Flet-Berliac
Philippe Preux
OffRL
11
13
0
26 Sep 2019
Off-Policy Actor-Critic with Shared Experience Replay
Off-Policy Actor-Critic with Shared Experience Replay
Simon Schmitt
Matteo Hessel
Karen Simonyan
OffRL
27
68
0
25 Sep 2019
The Animal-AI Environment: Training and Testing Animal-Like Artificial
  Cognition
The Animal-AI Environment: Training and Testing Animal-Like Artificial Cognition
Benjamin Beyret
José Hernández-Orallo
Lucy G. Cheke
Marta Halina
Murray Shanahan
Matthew Crosby
19
35
0
12 Sep 2019
Discovery of Useful Questions as Auxiliary Tasks
Discovery of Useful Questions as Auxiliary Tasks
Vivek Veeriah
Matteo Hessel
Zhongwen Xu
Richard L. Lewis
Janarthanan Rajendran
Junhyuk Oh
H. V. Hasselt
David Silver
Satinder Singh
LLMAG
22
86
0
10 Sep 2019
Logic and the $2$-Simplicial Transformer
Logic and the 222-Simplicial Transformer
James Clift
D. Doryn
Daniel Murfet
James Wallbridge
NAI
21
3
0
02 Sep 2019
An Open-Source Framework for Adaptive Traffic Signal Control
An Open-Source Framework for Adaptive Traffic Signal Control
Wade Genders
S. Razavi
14
29
0
01 Sep 2019
Neural Policy Gradient Methods: Global Optimality and Rates of
  Convergence
Neural Policy Gradient Methods: Global Optimality and Rates of Convergence
Lingxiao Wang
Qi Cai
Zhuoran Yang
Zhaoran Wang
25
236
0
29 Aug 2019
Dynamics-aware Embeddings
Dynamics-aware Embeddings
William F. Whitney
Rajat Agarwal
Kyunghyun Cho
Abhinav Gupta
SSL
25
53
0
25 Aug 2019
Feature Partitioning for Efficient Multi-Task Architectures
Feature Partitioning for Efficient Multi-Task Architectures
Alejandro Newell
Lu Jiang
Chong-Jun Wang
Li-Jia Li
Jia Deng
30
17
0
12 Aug 2019
Free-Lunch Saliency via Attention in Atari Agents
Free-Lunch Saliency via Attention in Atari Agents
Dmitry Nikulin
A. Ianina
Vladimir Aliev
Sergey I. Nikolenko
FAtt
25
24
0
07 Aug 2019
Benchmarking Bonus-Based Exploration Methods on the Arcade Learning
  Environment
Benchmarking Bonus-Based Exploration Methods on the Arcade Learning Environment
Adrien Ali Taïga
W. Fedus
Marlos C. Machado
Aaron Courville
Marc G. Bellemare
12
40
0
06 Aug 2019
Google Research Football: A Novel Reinforcement Learning Environment
Google Research Football: A Novel Reinforcement Learning Environment
Karol Kurach
Anton Raichuk
Piotr Stańczyk
Michal Zajac
Olivier Bachem
...
C. Riquelme
Damien Vincent
Marcin Michalski
Olivier Bousquet
Sylvain Gelly
54
398
0
25 Jul 2019
Variance Reduction in Actor Critic Methods (ACM)
Variance Reduction in Actor Critic Methods (ACM)
Eric Benhamou
OffRL
18
143
0
23 Jul 2019
Accelerating Reinforcement Learning through GPU Atari Emulation
Accelerating Reinforcement Learning through GPU Atari Emulation
Steven Dalton
I. Frosio
M. Garland
ELM
27
9
0
19 Jul 2019
Proximal Policy Optimization with Mixed Distributed Training
Proximal Policy Optimization with Mixed Distributed Training
Zhenyu Zhang
Xiangfeng Luo
Tong Liu
Shaorong Xie
Jianshu Wang
Wei Wang
Heng Chang
Yan Peng
OffRL
24
21
0
15 Jul 2019
Learning Safe Unlabeled Multi-Robot Planning with Motion Constraints
Learning Safe Unlabeled Multi-Robot Planning with Motion Constraints
Arbaaz Khan
Chi Zhang
Shuo Li
J. Wu
Brent Schlotfeldt
Sarah Tang
Alejandro Ribeiro
Osbert Bastani
Vijay Kumar
14
28
0
11 Jul 2019
On Inductive Biases in Deep Reinforcement Learning
On Inductive Biases in Deep Reinforcement Learning
Matteo Hessel
H. V. Hasselt
Joseph Modayil
David Silver
AI4CE
30
41
0
05 Jul 2019
Attentive Multi-Task Deep Reinforcement Learning
Attentive Multi-Task Deep Reinforcement Learning
Timo Bram
Gino Brunner
Oliver Richter
Roger Wattenhofer
CLL
23
18
0
05 Jul 2019
Generalizing from a few environments in safety-critical reinforcement
  learning
Generalizing from a few environments in safety-critical reinforcement learning
Zachary Kenton
Angelos Filos
Owain Evans
Y. Gal
10
16
0
02 Jul 2019
Growing Action Spaces
Growing Action Spaces
Gregory Farquhar
Laura Gustafson
Zeming Lin
Shimon Whiteson
Nicolas Usunier
Gabriel Synnaeve
14
38
0
28 Jun 2019
Learning Policies through Quantile Regression
Learning Policies through Quantile Regression
Oliver Richter
Roger Wattenhofer
16
0
0
27 Jun 2019
Previous
123...1617181920
Next