ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1802.01561
  4. Cited By
IMPALA: Scalable Distributed Deep-RL with Importance Weighted
  Actor-Learner Architectures

IMPALA: Scalable Distributed Deep-RL with Importance Weighted Actor-Learner Architectures

5 February 2018
L. Espeholt
Hubert Soyer
Rémi Munos
Karen Simonyan
Volodymyr Mnih
Tom Ward
Yotam Doron
Vlad Firoiu
Tim Harley
Iain Dunning
Shane Legg
Koray Kavukcuoglu
ArXivPDFHTML

Papers citing "IMPALA: Scalable Distributed Deep-RL with Importance Weighted Actor-Learner Architectures"

50 / 981 papers shown
Title
MVP: Unified Motion and Visual Self-Supervised Learning for Large-Scale
  Robotic Navigation
MVP: Unified Motion and Visual Self-Supervised Learning for Large-Scale Robotic Navigation
Marvin Chancán
Michael Milford
SSL
33
8
0
02 Mar 2020
Environment-agnostic Multitask Learning for Natural Language Grounded
  Navigation
Environment-agnostic Multitask Learning for Natural Language Grounded Navigation
Junfeng Fang
Vihan Jain
Eugene Ie
William Yang Wang
Zornitsa Kozareva
Sujith Ravi
LM&Ro
43
63
0
01 Mar 2020
Fully Asynchronous Policy Evaluation in Distributed Reinforcement
  Learning over Networks
Fully Asynchronous Policy Evaluation in Distributed Reinforcement Learning over Networks
Xingyu Sha
Jiaqi Zhang
Keyou You
Kaipeng Zhang
Tamer Basar
OffRL
6
22
0
01 Mar 2020
A Self-Tuning Actor-Critic Algorithm
A Self-Tuning Actor-Critic Algorithm
Tom Zahavy
Zhongwen Xu
Vivek Veeriah
Matteo Hessel
Junhyuk Oh
H. V. Hasselt
David Silver
Satinder Singh
28
13
0
28 Feb 2020
On Catastrophic Interference in Atari 2600 Games
On Catastrophic Interference in Atari 2600 Games
W. Fedus
Dibya Ghosh
John D. Martin
Marc G. Bellemare
Yoshua Bengio
Hugo Larochelle
18
26
0
28 Feb 2020
Towards Modular Algorithm Induction
Towards Modular Algorithm Induction
Daniel A. Abolafia
Rishabh Singh
Manzil Zaheer
Charles Sutton
9
2
0
27 Feb 2020
RIDE: Rewarding Impact-Driven Exploration for Procedurally-Generated
  Environments
RIDE: Rewarding Impact-Driven Exploration for Procedurally-Generated Environments
Roberta Raileanu
Tim Rocktaschel
13
170
0
27 Feb 2020
Learning to Resolve Alliance Dilemmas in Many-Player Zero-Sum Games
Learning to Resolve Alliance Dilemmas in Many-Player Zero-Sum Games
Edward Hughes
Thomas W. Anthony
Tom Eccles
Joel Z Leibo
David Balduzzi
Yoram Bachrach
14
20
0
27 Feb 2020
Rewriting History with Inverse RL: Hindsight Inference for Policy
  Improvement
Rewriting History with Inverse RL: Hindsight Inference for Policy Improvement
Benjamin Eysenbach
Xinyang Geng
Sergey Levine
Ruslan Salakhutdinov
OffRL
18
86
0
25 Feb 2020
From Poincaré Recurrence to Convergence in Imperfect Information
  Games: Finding Equilibrium via Regularization
From Poincaré Recurrence to Convergence in Imperfect Information Games: Finding Equilibrium via Regularization
Julien Perolat
Rémi Munos
Jean-Baptiste Lespiau
Shayegan Omidshafiei
Mark Rowland
...
David Balduzzi
Bart De Vylder
Georgios Piliouras
Marc Lanctot
K. Tuyls
12
84
0
19 Feb 2020
Value-driven Hindsight Modelling
Value-driven Hindsight Modelling
A. Guez
Fabio Viola
T. Weber
Lars Buesing
Steven Kapturowski
Doina Precup
David Silver
N. Heess
OffRL
32
12
0
19 Feb 2020
Adaptive Experience Selection for Policy Gradient
Adaptive Experience Selection for Policy Gradient
S. Mohamad
Giovanni Montana
39
0
0
17 Feb 2020
Never Give Up: Learning Directed Exploration Strategies
Never Give Up: Learning Directed Exploration Strategies
Adria Puigdomenech Badia
Pablo Sprechmann
Alex Vitvitskyi
Daniel Guo
Bilal Piot
...
O. Tieleman
Martín Arjovsky
Alexander Pritzel
Andew Bolt
Charles Blundell
28
291
0
14 Feb 2020
Hoplite: Efficient and Fault-Tolerant Collective Communication for
  Task-Based Distributed Systems
Hoplite: Efficient and Fault-Tolerant Collective Communication for Task-Based Distributed Systems
Siyuan Zhuang
Zhuohan Li
Danyang Zhuo
Stephanie Wang
Eric Liang
Robert Nishihara
Philipp Moritz
Ion Stoica
27
23
0
13 Feb 2020
Explore, Discover and Learn: Unsupervised Discovery of State-Covering
  Skills
Explore, Discover and Learn: Unsupervised Discovery of State-Covering Skills
Victor Campos
Alexander R. Trott
Caiming Xiong
R. Socher
Xavier Giró-i-Nieto
Jordi Torres
OffRL
19
150
0
10 Feb 2020
Causally Correct Partial Models for Reinforcement Learning
Causally Correct Partial Models for Reinforcement Learning
Danilo Jimenez Rezende
Ivo Danihelka
George Papamakarios
Nan Rosemary Ke
Ray Jiang
...
Jane X. Wang
Jovana Mitrović
F. Besse
Ioannis Antonoglou
Lars Buesing
AI4TS
24
32
0
07 Feb 2020
Provably Efficient Online Hyperparameter Optimization with
  Population-Based Bandits
Provably Efficient Online Hyperparameter Optimization with Population-Based Bandits
Jack Parker-Holder
Vu Nguyen
Stephen J. Roberts
OffRL
75
83
0
06 Feb 2020
Social diversity and social preferences in mixed-motive reinforcement
  learning
Social diversity and social preferences in mixed-motive reinforcement learning
Kevin R. McKee
I. Gemp
Brian McWilliams
Edgar A. Duénez-Guzmán
Edward Hughes
Joel Z Leibo
20
80
0
06 Feb 2020
Finite-Sample Analysis of Stochastic Approximation Using Smooth Convex
  Envelopes
Finite-Sample Analysis of Stochastic Approximation Using Smooth Convex Envelopes
Zaiwei Chen
S. T. Maguluri
Sanjay Shakkottai
Karthikeyan Shanmugam
53
33
0
03 Feb 2020
Towards the Systematic Reporting of the Energy and Carbon Footprints of
  Machine Learning
Towards the Systematic Reporting of the Energy and Carbon Footprints of Machine Learning
Peter Henderson
Jie Hu
Joshua Romoff
Emma Brunskill
Dan Jurafsky
Joelle Pineau
34
441
0
31 Jan 2020
Towards Learning Multi-agent Negotiations via Self-Play
Towards Learning Multi-agent Negotiations via Self-Play
Yichuan Tang
25
33
0
28 Jan 2020
Rotation, Translation, and Cropping for Zero-Shot Generalization
Rotation, Translation, and Cropping for Zero-Shot Generalization
Chang Ye
Ahmed Khalifa
Philip Bontrager
Julian Togelius
32
38
0
27 Jan 2020
Silly rules improve the capacity of agents to learn stable enforcement
  and compliance behaviors
Silly rules improve the capacity of agents to learn stable enforcement and compliance behaviors
Raphael Köster
Dylan Hadfield-Menell
Gillian K. Hadfield
Joel Z Leibo
8
10
0
25 Jan 2020
Q-Learning in enormous action spaces via amortized approximate
  maximization
Q-Learning in enormous action spaces via amortized approximate maximization
T. Wiele
David Warde-Farley
A. Mnih
Volodymyr Mnih
29
60
0
22 Jan 2020
Gradient Surgery for Multi-Task Learning
Gradient Surgery for Multi-Task Learning
Tianhe Yu
Saurabh Kumar
Abhishek Gupta
Sergey Levine
Karol Hausman
Chelsea Finn
41
1,175
0
19 Jan 2020
FRESH: Interactive Reward Shaping in High-Dimensional State Spaces using
  Human Feedback
FRESH: Interactive Reward Shaping in High-Dimensional State Spaces using Human Feedback
Baicen Xiao
Qifan Lu
Bhaskar Ramasubramanian
Andrew Clark
L. Bushnell
Radha Poovendran
13
25
0
19 Jan 2020
Algorithms in Multi-Agent Systems: A Holistic Perspective from
  Reinforcement Learning and Game Theory
Algorithms in Multi-Agent Systems: A Holistic Perspective from Reinforcement Learning and Game Theory
Yunlong Lu
Kai Yan
AI4CE
15
13
0
17 Jan 2020
Population-Guided Parallel Policy Search for Reinforcement Learning
Population-Guided Parallel Policy Search for Reinforcement Learning
Whiyoung Jung
Giseung Park
Y. Sung
OffRL
24
38
0
09 Jan 2020
Distributional Soft Actor-Critic: Off-Policy Reinforcement Learning for
  Addressing Value Estimation Errors
Distributional Soft Actor-Critic: Off-Policy Reinforcement Learning for Addressing Value Estimation Errors
Jingliang Duan
Yang Guan
Shengbo Eben Li
Yangang Ren
B. Cheng
OffRL
25
174
0
09 Jan 2020
A Survey of Deep Reinforcement Learning in Video Games
A Survey of Deep Reinforcement Learning in Video Games
Kun Shao
Zhentao Tang
Yuanheng Zhu
Nannan Li
Dongbin Zhao
OffRL
AI4TS
43
188
0
23 Dec 2019
Direct and indirect reinforcement learning
Direct and indirect reinforcement learning
Yang Guan
Shengbo Eben Li
Jingliang Duan
Jie Li
Yangang Ren
Qi Sun
B. Cheng
OffRL
38
34
0
23 Dec 2019
Predictive Coding for Boosting Deep Reinforcement Learning with Sparse
  Rewards
Predictive Coding for Boosting Deep Reinforcement Learning with Sparse Rewards
Xingyu Lu
Stas Tiomkin
Pieter Abbeel
OffRL
39
4
0
21 Dec 2019
Mastering Complex Control in MOBA Games with Deep Reinforcement Learning
Mastering Complex Control in MOBA Games with Deep Reinforcement Learning
Deheng Ye
Zhao Liu
Mingfei Sun
Bei Shi
P. Zhao
...
Tengfei Shi
Liang Wang
Qiang Fu
Wei Yang
Lanxiao Huang
29
313
0
20 Dec 2019
Soft Q Network
Soft Q Network
Jingbin Liu
Shuai Liu
Xinyang Gu
OffRL
26
2
0
20 Dec 2019
Relational Mimic for Visual Adversarial Imitation Learning
Relational Mimic for Visual Adversarial Imitation Learning
Lionel Blondé
Yichuan Tang
Jian Zhang
Russ Webb
36
0
0
18 Dec 2019
Adapting Behaviour for Learning Progress
Adapting Behaviour for Learning Progress
Tom Schaul
Diana Borsa
David Ding
David Szepesvari
Georg Ostrovski
Will Dabney
Simon Osindero
22
18
0
14 Dec 2019
Long-Term Planning and Situational Awareness in OpenAI Five
Long-Term Planning and Situational Awareness in OpenAI Five
Jonathan Raiman
Susan Zhang
Filip Wolski
14
10
0
13 Dec 2019
Dota 2 with Large Scale Deep Reinforcement Learning
Dota 2 with Large Scale Deep Reinforcement Learning
OpenAI OpenAI
:
Christopher Berner
Greg Brockman
Brooke Chan
...
Szymon Sidor
Ilya Sutskever
Jie Tang
Filip Wolski
Susan Zhang
GNN
VLM
CLL
AI4CE
LRM
46
1,799
0
13 Dec 2019
Biases for Emergent Communication in Multi-agent Reinforcement Learning
Biases for Emergent Communication in Multi-agent Reinforcement Learning
Tom Eccles
Yoram Bachrach
Guy Lever
Angeliki Lazaridou
T. Graepel
AI4CE
19
70
0
11 Dec 2019
Zero-shot generalization using cascaded system-representations
Zero-shot generalization using cascaded system-representations
A. Malik
OffRL
19
2
0
11 Dec 2019
VALAN: Vision and Language Agent Navigation
VALAN: Vision and Language Agent Navigation
L. Lansing
Vihan Jain
Harsh Mehta
Haoshuo Huang
Eugene Ie
LM&Ro
AI4TS
11
8
0
06 Dec 2019
Observational Overfitting in Reinforcement Learning
Observational Overfitting in Reinforcement Learning
Xingyou Song
Yiding Jiang
Stephen Tu
Yilun Du
Behnam Neyshabur
OffRL
33
138
0
06 Dec 2019
Dream to Control: Learning Behaviors by Latent Imagination
Dream to Control: Learning Behaviors by Latent Imagination
Danijar Hafner
Timothy Lillicrap
Jimmy Ba
Mohammad Norouzi
VLM
42
1,313
0
03 Dec 2019
Leveraging Procedural Generation to Benchmark Reinforcement Learning
Leveraging Procedural Generation to Benchmark Reinforcement Learning
K. Cobbe
Christopher Hesse
Jacob Hilton
John Schulman
45
541
0
03 Dec 2019
Policy Optimization Reinforcement Learning with Entropy Regularization
Policy Optimization Reinforcement Learning with Entropy Regularization
Jingbin Liu
Xinyang Gu
Shuai Liu
28
4
0
02 Dec 2019
IMPACT: Importance Weighted Asynchronous Architectures with Clipped
  Target Networks
IMPACT: Importance Weighted Asynchronous Architectures with Clipped Target Networks
Michael Luo
Jiahao Yao
Richard Liaw
Eric Liang
Ion Stoica
27
15
0
30 Nov 2019
Simulation-based reinforcement learning for real-world autonomous
  driving
Simulation-based reinforcement learning for real-world autonomous driving
B. Osinski
Adam Jakubowski
Piotr Milos
Pawel Ziecina
Christopher Galias
S. Homoceanu
Henryk Michalewski
35
122
0
29 Nov 2019
End-to-End Model-Free Reinforcement Learning for Urban Driving using
  Implicit Affordances
End-to-End Model-Free Reinforcement Learning for Urban Driving using Implicit Affordances
Marin Toromanoff
É. Wirbel
Fabien Moutarde
OffRL
44
205
0
25 Nov 2019
Catch & Carry: Reusable Neural Controllers for Vision-Guided Whole-Body
  Tasks
Catch & Carry: Reusable Neural Controllers for Vision-Guided Whole-Body Tasks
J. Merel
S. Tunyasuvunakool
Arun Ahuja
Yuval Tassa
Leonard Hasenclever
Vu Pham
Tom Erez
Greg Wayne
N. Heess
36
9
0
15 Nov 2019
Compressive Transformers for Long-Range Sequence Modelling
Compressive Transformers for Long-Range Sequence Modelling
Jack W. Rae
Anna Potapenko
Siddhant M. Jayakumar
Timothy Lillicrap
RALM
VLM
KELM
13
623
0
13 Nov 2019
Previous
123...151617181920
Next