ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1802.01561
  4. Cited By
IMPALA: Scalable Distributed Deep-RL with Importance Weighted
  Actor-Learner Architectures

IMPALA: Scalable Distributed Deep-RL with Importance Weighted Actor-Learner Architectures

5 February 2018
L. Espeholt
Hubert Soyer
Rémi Munos
Karen Simonyan
Volodymyr Mnih
Tom Ward
Yotam Doron
Vlad Firoiu
Tim Harley
Iain Dunning
Shane Legg
Koray Kavukcuoglu
ArXivPDFHTML

Papers citing "IMPALA: Scalable Distributed Deep-RL with Importance Weighted Actor-Learner Architectures"

50 / 982 papers shown
Title
Launchpad: A Programming Model for Distributed Machine Learning Research
Launchpad: A Programming Model for Distributed Machine Learning Research
Fan Yang
Gabriel Barth-Maron
Piotr Stańczyk
Matthew Hoffman
Siqi Liu
M. Kroiss
Aedan Pope
Alban Rrustemi
18
24
0
07 Jun 2021
Towards robust and domain agnostic reinforcement learning competitions
Towards robust and domain agnostic reinforcement learning competitions
William H. Guss
Stephanie Milani
Nicholay Topin
Brandon Houghton
Sharada Mohanty
...
Lu Liu
Daichi Nishio
Toi Tsuneda
Karolis Ramanauskas
Gabija Juceviciute
OOD
27
2
0
07 Jun 2021
Same State, Different Task: Continual Reinforcement Learning without
  Interference
Same State, Different Task: Continual Reinforcement Learning without Interference
Samuel Kessler
Jack Parker-Holder
Philip J. Ball
S. Zohren
Stephen J. Roberts
CLL
OffRL
21
46
0
05 Jun 2021
MALib: A Parallel Framework for Population-based Multi-agent
  Reinforcement Learning
MALib: A Parallel Framework for Population-based Multi-agent Reinforcement Learning
Ming Zhou
Bo Liu
Hanjing Wang
Muning Wen
Runzhe Wu
Ying Wen
Yaodong Yang
Weinan Zhang
Jun Wang
OffRL
27
46
0
05 Jun 2021
Heuristic-Guided Reinforcement Learning
Heuristic-Guided Reinforcement Learning
Ching-An Cheng
Andrey Kolobov
Adith Swaminathan
OffRL
40
61
0
05 Jun 2021
Differentiable Architecture Search for Reinforcement Learning
Differentiable Architecture Search for Reinforcement Learning
Yingjie Miao
Xingyou Song
John D. Co-Reyes
Daiyi Peng
Summer Yue
E. Brevdo
Aleksandra Faust
20
4
0
04 Jun 2021
Towards Deeper Deep Reinforcement Learning with Spectral Normalization
Towards Deeper Deep Reinforcement Learning with Spectral Normalization
Johan Bjorck
Carla P. Gomes
Kilian Q. Weinberger
19
23
0
02 Jun 2021
An Empirical Comparison of Off-policy Prediction Learning Algorithms on
  the Collision Task
An Empirical Comparison of Off-policy Prediction Learning Algorithms on the Collision Task
Sina Ghiassian
R. Sutton
AAML
OffRL
19
5
0
02 Jun 2021
An Entropy Regularization Free Mechanism for Policy-based Reinforcement
  Learning
An Entropy Regularization Free Mechanism for Policy-based Reinforcement Learning
Changnan Xiao
Haosen Shi
Jiajun Fan
Shihong Deng
26
5
0
01 Jun 2021
Reward is enough for convex MDPs
Reward is enough for convex MDPs
Tom Zahavy
Brendan O'Donoghue
Guillaume Desjardins
Satinder Singh
72
73
0
01 Jun 2021
Did I do that? Blame as a means to identify controlled effects in
  reinforcement learning
Did I do that? Blame as a means to identify controlled effects in reinforcement learning
Oriol Corcoll
Youssef Mohamed
Raul Vicente
24
3
0
01 Jun 2021
Goal Misgeneralization in Deep Reinforcement Learning
Goal Misgeneralization in Deep Reinforcement Learning
L. Langosco
Jack Koch
Lee D. Sharkey
J. Pfau
Laurent Orseau
David M. Krueger
30
78
0
28 May 2021
Towards mental time travel: a hierarchical memory for reinforcement
  learning agents
Towards mental time travel: a hierarchical memory for reinforcement learning agents
Andrew Kyle Lampinen
Stephanie C. Y. Chan
Andrea Banino
Felix Hill
24
47
0
28 May 2021
AndroidEnv: A Reinforcement Learning Platform for Android
AndroidEnv: A Reinforcement Learning Platform for Android
Daniel Toyama
P. Hamel
Anita Gergely
Gheorghe Comanici
Amelia Glaese
Zafarali Ahmed
Tyler Jackson
Shibl Mourad
Doina Precup
VLM
SSeg
19
70
0
27 May 2021
Finite-Sample Analysis of Off-Policy Natural Actor-Critic with Linear
  Function Approximation
Finite-Sample Analysis of Off-Policy Natural Actor-Critic with Linear Function Approximation
Zaiwei Chen
S. Khodadadian
S. T. Maguluri
OffRL
68
29
0
26 May 2021
Gym-$μ$RTS: Toward Affordable Full Game Real-time Strategy Games
  Research with Deep Reinforcement Learning
Gym-μμμRTS: Toward Affordable Full Game Real-time Strategy Games Research with Deep Reinforcement Learning
Sheng-Jun Huang
Santiago Ontañón
Chris Bamford
Lukasz Grela
OffRL
16
36
0
21 May 2021
Don't Do What Doesn't Matter: Intrinsic Motivation with Action
  Usefulness
Don't Do What Doesn't Matter: Intrinsic Motivation with Action Usefulness
Mathieu Seurin
Florian Strub
Philippe Preux
Olivier Pietquin
32
9
0
20 May 2021
Uncertainty Weighted Actor-Critic for Offline Reinforcement Learning
Uncertainty Weighted Actor-Critic for Offline Reinforcement Learning
Yue Wu
Shuangfei Zhai
Nitish Srivastava
J. Susskind
Jian Zhang
Ruslan Salakhutdinov
Hanlin Goh
EDL
OffRL
OnRL
21
184
0
17 May 2021
Return-based Scaling: Yet Another Normalisation Trick for Deep RL
Return-based Scaling: Yet Another Normalisation Trick for Deep RL
Tom Schaul
Georg Ostrovski
Iurii Kemaev
Diana Borsa
15
19
0
11 May 2021
CASA: Bridging the Gap between Policy Improvement and Policy Evaluation
  with Conflict Averse Policy Iteration
CASA: Bridging the Gap between Policy Improvement and Policy Evaluation with Conflict Averse Policy Iteration
Changnan Xiao
Haosen Shi
Jiajun Fan
Shihong Deng
Haiyan Yin
19
0
0
09 May 2021
Agent-Centric Representations for Multi-Agent Reinforcement Learning
Agent-Centric Representations for Multi-Agent Reinforcement Learning
Wenling Shang
L. Espeholt
Anton Raichuk
Tim Salimans
EgoV
32
10
0
19 Apr 2021
MT-Opt: Continuous Multi-Task Robotic Reinforcement Learning at Scale
MT-Opt: Continuous Multi-Task Robotic Reinforcement Learning at Scale
Dmitry Kalashnikov
Jacob Varley
Yevgen Chebotar
Benjamin Swanson
Rico Jonschkowski
Chelsea Finn
Sergey Levine
Karol Hausman
OffRL
47
272
0
16 Apr 2021
Generalising Discrete Action Spaces with Conditional Action Trees
Generalising Discrete Action Spaces with Conditional Action Trees
Christopher Bamford
Alvaro Ovalle
13
8
0
15 Apr 2021
A Novel Approach to Curiosity and Explainable Reinforcement Learning via
  Interpretable Sub-Goals
A Novel Approach to Curiosity and Explainable Reinforcement Learning via Interpretable Sub-Goals
C. V. Rossum
Candice Feinberg
Adam Abu Shumays
Kyle Baxter
Benedek Bartha
GAN
LLMAG
LRM
26
1
0
14 Apr 2021
Online and Offline Reinforcement Learning by Planning with a Learned
  Model
Online and Offline Reinforcement Learning by Planning with a Learned Model
Julian Schrittwieser
Thomas Hubert
Amol Mandhane
M. Barekatain
Ioannis Antonoglou
David Silver
OffRL
31
114
0
13 Apr 2021
Podracer architectures for scalable Reinforcement Learning
Podracer architectures for scalable Reinforcement Learning
Matteo Hessel
M. Kroiss
Aidan Clark
Iurii Kemaev
John Quan
Thomas Keck
Fabio Viola
H. V. Hasselt
24
38
0
13 Apr 2021
Muesli: Combining Improvements in Policy Optimization
Muesli: Combining Improvements in Policy Optimization
Matteo Hessel
Ivo Danihelka
Fabio Viola
A. Guez
Simon Schmitt
Laurent Sifre
T. Weber
David Silver
H. V. Hasselt
24
66
0
13 Apr 2021
Auxiliary Tasks and Exploration Enable ObjectNav
Auxiliary Tasks and Exploration Enable ObjectNav
Joel Ye
Dhruv Batra
Abhishek Das
Erik Wijmans
36
91
0
08 Apr 2021
Risk-Conditioned Distributional Soft Actor-Critic for Risk-Sensitive
  Navigation
Risk-Conditioned Distributional Soft Actor-Critic for Risk-Sensitive Navigation
Jinyoung Choi
C. Dance
Jung-Eun Kim
Seulbin Hwang
Kyungsik Park
UQCV
23
26
0
07 Apr 2021
Domain Generalization with MixStyle
Domain Generalization with MixStyle
Kaiyang Zhou
Yongxin Yang
Yu Qiao
Tao Xiang
76
746
0
05 Apr 2021
Efficient Transformers in Reinforcement Learning using Actor-Learner
  Distillation
Efficient Transformers in Reinforcement Learning using Actor-Learner Distillation
Emilio Parisotto
Ruslan Salakhutdinov
42
44
0
04 Apr 2021
Deep Reinforcement Learning for Constrained Field Development
  Optimization in Subsurface Two-phase Flow
Deep Reinforcement Learning for Constrained Field Development Optimization in Subsurface Two-phase Flow
Y. Nasir
Jincong He
Chaoshun Hu
Shusei Tanaka
Kainan Wang
X. Wen
AI4CE
14
18
0
31 Mar 2021
Measuring Sample Efficiency and Generalization in Reinforcement Learning
  Benchmarks: NeurIPS 2020 Procgen Benchmark
Measuring Sample Efficiency and Generalization in Reinforcement Learning Benchmarks: NeurIPS 2020 Procgen Benchmark
Sharada Mohanty
Jyotish Poonganam
Adrien Gaidon
Andrey Kolobov
Blake Wulfe
...
Jacob Hilton
William H. Guss
Sahika Genc
John Schulman
K. Cobbe
34
22
0
29 Mar 2021
SegVisRL: Visuomotor Development for a Lunar Rover for Hazard Avoidance
  using Camera Images
SegVisRL: Visuomotor Development for a Lunar Rover for Hazard Avoidance using Camera Images
Tamir Blum
Gabin Paillet
Watcharawut Masawat
Mickaël Laîné
Kazuya Yoshida
13
1
0
26 Mar 2021
Sample-efficient Reinforcement Learning Representation Learning with
  Curiosity Contrastive Forward Dynamics Model
Sample-efficient Reinforcement Learning Representation Learning with Curiosity Contrastive Forward Dynamics Model
Thanh Nguyen
Tung M. Luu
Thang Vu
Chang D. Yoo
23
17
0
15 Mar 2021
Large Batch Simulation for Deep Reinforcement Learning
Large Batch Simulation for Deep Reinforcement Learning
Brennan Shacklett
Erik Wijmans
Aleksei Petrenko
Manolis Savva
Dhruv Batra
V. Koltun
Kayvon Fatahalian
3DV
OffRL
AI4CE
31
26
0
12 Mar 2021
Model-free Policy Learning with Reward Gradients
Model-free Policy Learning with Reward Gradients
Qingfeng Lan
Samuele Tosatto
Homayoon Farrahi
Rupam Mahmood
19
6
0
09 Mar 2021
A multi-agent reinforcement learning model of reputation and cooperation
  in human groups
A multi-agent reinforcement learning model of reputation and cooperation in human groups
Kevin R. McKee
Edward Hughes
Tina Zhu
Martin Chadwick
Raphael Köster
Antonio García Castañeda
Charlie Beattie
T. Graepel
M. Botvinick
Joel Z. Leibo
8
6
0
08 Mar 2021
Provably Efficient Cooperative Multi-Agent Reinforcement Learning with
  Function Approximation
Provably Efficient Cooperative Multi-Agent Reinforcement Learning with Function Approximation
Abhimanyu Dubey
Alex Pentland
30
23
0
08 Mar 2021
Causal Analysis of Agent Behavior for AI Safety
Causal Analysis of Agent Behavior for AI Safety
Grégoire Delétang
Jordi Grau-Moya
Miljan Martic
Tim Genewein
Tom McGrath
Vladimir Mikulik
M. Kunesch
Shane Legg
Pedro A. Ortega
CML
32
6
0
05 Mar 2021
Reinforcement Learning Trajectory Generation and Control for Aggressive
  Perching on Vertical Walls with Quadrotors
Reinforcement Learning Trajectory Generation and Control for Aggressive Perching on Vertical Walls with Quadrotors
Chen-Huan Pi
Kai-Chun Hu
Yu-ting Huang
Stone Cheng
16
2
0
04 Mar 2021
Improving Computational Efficiency in Visual Reinforcement Learning via
  Stored Embeddings
Improving Computational Efficiency in Visual Reinforcement Learning via Stored Embeddings
Lili Chen
Kimin Lee
A. Srinivas
Pieter Abbeel
OffRL
24
11
0
04 Mar 2021
Self-play Learning Strategies for Resource Assignment in Open-RAN
  Networks
Self-play Learning Strategies for Resource Assignment in Open-RAN Networks
Xiaoyang Wang
Jonathan D. Thomas
Robert Piechocki
S. Kapoor
Raúl Santos-Rodríguez
Arjun Parekh
24
24
0
03 Mar 2021
Inference-Based Deterministic Messaging For Multi-Agent Communication
Inference-Based Deterministic Messaging For Multi-Agent Communication
Varun Bhatt
M. Buro
28
4
0
03 Mar 2021
The Surprising Effectiveness of PPO in Cooperative, Multi-Agent Games
The Surprising Effectiveness of PPO in Cooperative, Multi-Agent Games
Chao Yu
Akash Velu
Eugene Vinitsky
Jiaxuan Gao
Yu Wang
Alexandre M. Bayen
Yi Wu
OffRL
45
1,207
0
02 Mar 2021
Low-Precision Reinforcement Learning: Running Soft Actor-Critic in Half
  Precision
Low-Precision Reinforcement Learning: Running Soft Actor-Critic in Half Precision
Johan Bjorck
Xiangyu Chen
Christopher De Sa
Carla P. Gomes
Kilian Q. Weinberger
23
6
0
26 Feb 2021
PsiPhi-Learning: Reinforcement Learning with Demonstrations using
  Successor Features and Inverse Temporal Difference Learning
PsiPhi-Learning: Reinforcement Learning with Demonstrations using Successor Features and Inverse Temporal Difference Learning
Angelos Filos
Clare Lyle
Y. Gal
Sergey Levine
Natasha Jaques
Gregory Farquhar
26
22
0
24 Feb 2021
Synthetic Returns for Long-Term Credit Assignment
Synthetic Returns for Long-Term Credit Assignment
David Raposo
Samuel Ritter
Adam Santoro
Greg Wayne
T. Weber
M. Botvinick
H. V. Hasselt
Francis Song
AI4TS
29
34
0
24 Feb 2021
Beyond Fine-Tuning: Transferring Behavior in Reinforcement Learning
Beyond Fine-Tuning: Transferring Behavior in Reinforcement Learning
Victor Campos
Pablo Sprechmann
Steven Hansen
André Barreto
Steven Kapturowski
Alex Vitvitskyi
Adria Puigdomenech Badia
Charles Blundell
OffRL
OnRL
46
25
0
24 Feb 2021
PFRL: Pose-Free Reinforcement Learning for 6D Pose Estimation
PFRL: Pose-Free Reinforcement Learning for 6D Pose Estimation
Jianzhun Shao
Yuhang Jiang
Gu Wang
Zhigang Li
Xiangyang Ji
33
29
0
24 Feb 2021
Previous
123...111213...181920
Next