ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1802.01561
  4. Cited By
IMPALA: Scalable Distributed Deep-RL with Importance Weighted
  Actor-Learner Architectures
v1v2v3 (latest)

IMPALA: Scalable Distributed Deep-RL with Importance Weighted Actor-Learner Architectures

5 February 2018
L. Espeholt
Hubert Soyer
Rémi Munos
Karen Simonyan
Volodymyr Mnih
Tom Ward
Yotam Doron
Vlad Firoiu
Tim Harley
Iain Dunning
Shane Legg
Koray Kavukcuoglu
ArXiv (abs)PDFHTML

Papers citing "IMPALA: Scalable Distributed Deep-RL with Importance Weighted Actor-Learner Architectures"

50 / 1,000 papers shown
Title
Predictive Coding for Boosting Deep Reinforcement Learning with Sparse
  Rewards
Predictive Coding for Boosting Deep Reinforcement Learning with Sparse Rewards
Xingyu Lu
Stas Tiomkin
Pieter Abbeel
OffRL
72
5
0
21 Dec 2019
Mastering Complex Control in MOBA Games with Deep Reinforcement Learning
Mastering Complex Control in MOBA Games with Deep Reinforcement Learning
Deheng Ye
Zhao Liu
Mingfei Sun
Bei Shi
P. Zhao
...
Tengfei Shi
Liang Wang
Qiang Fu
Wei Yang
Lanxiao Huang
67
324
0
20 Dec 2019
Soft Q Network
Soft Q Network
Jingbin Liu
Shuai Liu
Xinyang Gu
OffRL
51
2
0
20 Dec 2019
Relational Mimic for Visual Adversarial Imitation Learning
Relational Mimic for Visual Adversarial Imitation Learning
Lionel Blondé
Yichuan Tang
Jian Zhang
Russ Webb
40
0
0
18 Dec 2019
Adapting Behaviour for Learning Progress
Adapting Behaviour for Learning Progress
Tom Schaul
Diana Borsa
David Ding
David Szepesvari
Georg Ostrovski
Will Dabney
Simon Osindero
160
19
0
14 Dec 2019
Long-Term Planning and Situational Awareness in OpenAI Five
Long-Term Planning and Situational Awareness in OpenAI Five
Jonathan Raiman
Susan Zhang
Filip Wolski
49
10
0
13 Dec 2019
Dota 2 with Large Scale Deep Reinforcement Learning
Dota 2 with Large Scale Deep Reinforcement Learning
OpenAI OpenAI
:
Christopher Berner
Greg Brockman
Brooke Chan
...
Szymon Sidor
Ilya Sutskever
Jie Tang
Filip Wolski
Susan Zhang
GNNVLMCLLAI4CELRM
181
1,840
0
13 Dec 2019
Biases for Emergent Communication in Multi-agent Reinforcement Learning
Biases for Emergent Communication in Multi-agent Reinforcement Learning
Tom Eccles
Yoram Bachrach
Guy Lever
Angeliki Lazaridou
T. Graepel
AI4CE
83
70
0
11 Dec 2019
Zero-shot generalization using cascaded system-representations
Zero-shot generalization using cascaded system-representations
A. Malik
OffRL
24
2
0
11 Dec 2019
VALAN: Vision and Language Agent Navigation
VALAN: Vision and Language Agent Navigation
L. Lansing
Vihan Jain
Harsh Mehta
Haoshuo Huang
Eugene Ie
LM&RoAI4TS
58
8
0
06 Dec 2019
Observational Overfitting in Reinforcement Learning
Observational Overfitting in Reinforcement Learning
Xingyou Song
Yiding Jiang
Stephen Tu
Yilun Du
Behnam Neyshabur
OffRL
134
140
0
06 Dec 2019
Dream to Control: Learning Behaviors by Latent Imagination
Dream to Control: Learning Behaviors by Latent Imagination
Danijar Hafner
Timothy Lillicrap
Jimmy Ba
Mohammad Norouzi
VLM
200
1,378
0
03 Dec 2019
Leveraging Procedural Generation to Benchmark Reinforcement Learning
Leveraging Procedural Generation to Benchmark Reinforcement Learning
K. Cobbe
Christopher Hesse
Jacob Hilton
John Schulman
135
557
0
03 Dec 2019
Policy Optimization Reinforcement Learning with Entropy Regularization
Policy Optimization Reinforcement Learning with Entropy Regularization
Jingbin Liu
Xinyang Gu
Shuai Liu
107
4
0
02 Dec 2019
IMPACT: Importance Weighted Asynchronous Architectures with Clipped
  Target Networks
IMPACT: Importance Weighted Asynchronous Architectures with Clipped Target Networks
Michael Luo
Jiahao Yao
Richard Liaw
Eric Liang
Ion Stoica
72
15
0
30 Nov 2019
Simulation-based reinforcement learning for real-world autonomous
  driving
Simulation-based reinforcement learning for real-world autonomous driving
B. Osinski
Adam Jakubowski
Piotr Milos
Pawel Ziecina
Christopher Galias
S. Homoceanu
Henryk Michalewski
112
122
0
29 Nov 2019
End-to-End Model-Free Reinforcement Learning for Urban Driving using
  Implicit Affordances
End-to-End Model-Free Reinforcement Learning for Urban Driving using Implicit Affordances
Marin Toromanoff
É. Wirbel
Fabien Moutarde
OffRL
177
209
0
25 Nov 2019
Catch & Carry: Reusable Neural Controllers for Vision-Guided Whole-Body
  Tasks
Catch & Carry: Reusable Neural Controllers for Vision-Guided Whole-Body Tasks
J. Merel
S. Tunyasuvunakool
Arun Ahuja
Yuval Tassa
Leonard Hasenclever
Vu Pham
Tom Erez
Greg Wayne
N. Heess
81
9
0
15 Nov 2019
Compressive Transformers for Long-Range Sequence Modelling
Compressive Transformers for Long-Range Sequence Modelling
Jack W. Rae
Anna Potapenko
Siddhant M. Jayakumar
Timothy Lillicrap
RALMVLMKELM
105
656
0
13 Nov 2019
Learning Representations in Reinforcement Learning:An Information
  Bottleneck Approach
Learning Representations in Reinforcement Learning:An Information Bottleneck Approach
Yingjun Pei
Xinwen Hou
SSL
76
10
0
12 Nov 2019
Provably Convergent Two-Timescale Off-Policy Actor-Critic with Function
  Approximation
Provably Convergent Two-Timescale Off-Policy Actor-Critic with Function Approximation
Shangtong Zhang
Bo Liu
Hengshuai Yao
Shimon Whiteson
OffRL
143
8
0
11 Nov 2019
Multi-Agent Connected Autonomous Driving using Deep Reinforcement
  Learning
Multi-Agent Connected Autonomous Driving using Deep Reinforcement Learning
Praveen Palanisamy
101
146
0
11 Nov 2019
DeepRacer: Educational Autonomous Racing Platform for Experimentation
  with Sim2Real Reinforcement Learning
DeepRacer: Educational Autonomous Racing Platform for Experimentation with Sim2Real Reinforcement Learning
Bharathan Balaji
S. Mallya
Sahika Genc
Saurabh Gupta
Leo Dirac
...
Yunzhe Tao
Brian Townsend
E. Calleja
Sunil Muralidhara
Dhanasekar Karuppasamy
75
57
0
05 Nov 2019
Keeping Your Distance: Solving Sparse Reward Tasks Using Self-Balancing
  Shaped Rewards
Keeping Your Distance: Solving Sparse Reward Tasks Using Self-Balancing Shaped Rewards
Alexander R. Trott
Stephan Zheng
Caiming Xiong
R. Socher
125
112
0
04 Nov 2019
Gradient-based Adaptive Markov Chain Monte Carlo
Gradient-based Adaptive Markov Chain Monte Carlo
Michalis K. Titsias
P. Dellaportas
BDL
102
22
0
04 Nov 2019
DD-PPO: Learning Near-Perfect PointGoal Navigators from 2.5 Billion
  Frames
DD-PPO: Learning Near-Perfect PointGoal Navigators from 2.5 Billion Frames
Erik Wijmans
Abhishek Kadian
Ari S. Morcos
Stefan Lee
Irfan Essa
Devi Parikh
Manolis Savva
Dhruv Batra
97
485
0
01 Nov 2019
Generalization of Reinforcement Learners with Working and Episodic
  Memory
Generalization of Reinforcement Learners with Working and Episodic Memory
Meire Fortunato
Melissa Tan
Ryan Faulkner
Steven Hansen
Adria Puigdomenech Badia
Gavin Buttimore
Charlie Deck
Joel Z Leibo
Charles Blundell
119
70
0
29 Oct 2019
Asynchronous Methods for Model-Based Reinforcement Learning
Asynchronous Methods for Model-Based Reinforcement Learning
Yunzhi Zhang
I. Clavera
Bo-Yu Tsai
Pieter Abbeel
OffRL
60
27
0
28 Oct 2019
Meta-World: A Benchmark and Evaluation for Multi-Task and Meta
  Reinforcement Learning
Meta-World: A Benchmark and Evaluation for Multi-Task and Meta Reinforcement Learning
Tianhe Yu
Deirdre Quillen
Zhanpeng He
Ryan Julian
Avnish Narayan
Hayden Shively
Adithya Bellathur
Karol Hausman
Chelsea Finn
Sergey Levine
OffRL
325
1,182
0
24 Oct 2019
Improving the Gating Mechanism of Recurrent Neural Networks
Improving the Gating Mechanism of Recurrent Neural Networks
Albert Gu
Çağlar Gülçehre
T. Paine
Matthew W. Hoffman
Razvan Pascanu
AI4CE
26
2
0
22 Oct 2019
Collaborative Graph Walk for Semi-supervised Multi-Label Node
  Classification
Collaborative Graph Walk for Semi-supervised Multi-Label Node Classification
Uchenna Akujuobi
Yufei Han
Qiannan Zhang
Xiangliang Zhang
55
17
0
22 Oct 2019
Modelling Generalized Forces with Reinforcement Learning for Sim-to-Real
  Transfer
Modelling Generalized Forces with Reinforcement Learning for Sim-to-Real Transfer
Rae Jeong
Jackie Kay
Francesco Romano
Thomas Lampe
Thomas Rothörl
A. Abdolmaleki
Tom Erez
Yuval Tassa
F. Nori
63
23
0
21 Oct 2019
Dealing with Sparse Rewards in Reinforcement Learning
Dealing with Sparse Rewards in Reinforcement Learning
J. Hare
64
80
0
21 Oct 2019
Regularization Matters in Policy Optimization
Regularization Matters in Policy Optimization
Zhuang Liu
Xuanlin Li
Bingyi Kang
Trevor Darrell
OffRL
83
33
0
21 Oct 2019
Dynamic Subgoal-based Exploration via Bayesian Optimization
Dynamic Subgoal-based Exploration via Bayesian Optimization
Yijia Wang
Matthias Poloczek
Daniel R. Jiang
80
3
0
21 Oct 2019
RTFM: Generalising to Novel Environment Dynamics via Reading
RTFM: Generalising to Novel Environment Dynamics via Reading
Victor Zhong
Tim Rocktaschel
Edward Grefenstette
LLMAGOffRLAI4CE
94
54
0
18 Oct 2019
A Hybrid Compact Neural Architecture for Visual Place Recognition
A Hybrid Compact Neural Architecture for Visual Place Recognition
Marvin Chancán
Luis Hernandez-Nunez
A. Narendra
A. Barron
Michael Milford
58
57
0
15 Oct 2019
Stabilizing Transformers for Reinforcement Learning
Stabilizing Transformers for Reinforcement Learning
Emilio Parisotto
H. F. Song
Jack W. Rae
Razvan Pascanu
Çağlar Gülçehre
...
Aidan Clark
Seb Noury
M. Botvinick
N. Heess
R. Hadsell
OffRL
110
368
0
13 Oct 2019
CityLearn: Diverse Real-World Environments for Sample-Efficient
  Navigation Policy Learning
CityLearn: Diverse Real-World Environments for Sample-Efficient Navigation Policy Learning
Marvin Chancán
Michael Milford
SSL
68
5
0
10 Oct 2019
Imagined Value Gradients: Model-Based Policy Optimization with
  Transferable Latent Dynamics Models
Imagined Value Gradients: Model-Based Policy Optimization with Transferable Latent Dynamics Models
Arunkumar Byravan
Jost Tobias Springenberg
A. Abdolmaleki
Roland Hafner
Michael Neunert
Thomas Lampe
Noah Y. Siegel
N. Heess
Martin Riedmiller
OffRL
83
41
0
09 Oct 2019
MVFST-RL: An Asynchronous RL Framework for Congestion Control with
  Delayed Actions
MVFST-RL: An Asynchronous RL Framework for Congestion Control with Delayed Actions
V. Sivakumar
Olivier Delalleau
Tim Rocktaschel
Alexander H. Miller
Heinrich Küttler
Nantas Nardelli
Michael G. Rabbat
Joelle Pineau
Sebastian Riedel
104
36
0
09 Oct 2019
Policy Optimization Through Approximate Importance Sampling
Policy Optimization Through Approximate Importance Sampling
Marcin Tomczak
Dongho Kim
Peter Vrancx
Kyungmin Kim
29
4
0
09 Oct 2019
TorchBeast: A PyTorch Platform for Distributed RL
TorchBeast: A PyTorch Platform for Distributed RL
Heinrich Küttler
Nantas Nardelli
Thibaut Lavril
Marco Selvatici
V. Sivakumar
Tim Rocktaschel
Edward Grefenstette
OffRL
94
58
0
08 Oct 2019
QuaRL: Quantization for Fast and Environmentally Sustainable
  Reinforcement Learning
QuaRL: Quantization for Fast and Environmentally Sustainable Reinforcement Learning
Srivatsan Krishnan
Maximilian Lam
Sharad Chitlangia
Zishen Wan
Gabriel Barth-Maron
Aleksandra Faust
Vijay Janapa Reddi
MQ
44
26
0
02 Oct 2019
Environmental drivers of systematicity and generalization in a situated
  agent
Environmental drivers of systematicity and generalization in a situated agent
Felix Hill
Andrew Kyle Lampinen
R. Schneider
S. Clark
M. Botvinick
James L. McClelland
Adam Santoro
OOD
139
107
0
01 Oct 2019
SURREAL-System: Fully-Integrated Stack for Distributed Deep
  Reinforcement Learning
SURREAL-System: Fully-Integrated Stack for Distributed Deep Reinforcement Learning
Linxi Fan
Yuke Zhu
Jiren Zhu
Zihua Liu
Orien Zeng
Anchit Gupta
Joan Creus-Costa
Silvio Savarese
Li Fei-Fei
OffRLGNN
91
3
0
27 Sep 2019
Automated curricula through setter-solver interactions
Automated curricula through setter-solver interactions
S. Racanière
Andrew Kyle Lampinen
Adam Santoro
David P. Reichert
Vlad Firoiu
Timothy Lillicrap
81
53
0
27 Sep 2019
V-MPO: On-Policy Maximum a Posteriori Policy Optimization for Discrete
  and Continuous Control
V-MPO: On-Policy Maximum a Posteriori Policy Optimization for Discrete and Continuous Control
H. F. Song
A. Abdolmaleki
Jost Tobias Springenberg
Aidan Clark
Hubert Soyer
...
Dhruva Tirumala
N. Heess
Dan Belov
Martin Riedmiller
M. Botvinick
116
126
0
26 Sep 2019
MERL: Multi-Head Reinforcement Learning
MERL: Multi-Head Reinforcement Learning
Yannis Flet-Berliac
Philippe Preux
OffRL
124
13
0
26 Sep 2019
Off-Policy Actor-Critic with Shared Experience Replay
Off-Policy Actor-Critic with Shared Experience Replay
Simon Schmitt
Matteo Hessel
Karen Simonyan
OffRL
80
68
0
25 Sep 2019
Previous
123...1617181920
Next