ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1802.01561
  4. Cited By
IMPALA: Scalable Distributed Deep-RL with Importance Weighted
  Actor-Learner Architectures

IMPALA: Scalable Distributed Deep-RL with Importance Weighted Actor-Learner Architectures

5 February 2018
L. Espeholt
Hubert Soyer
Rémi Munos
Karen Simonyan
Volodymyr Mnih
Tom Ward
Yotam Doron
Vlad Firoiu
Tim Harley
Iain Dunning
Shane Legg
Koray Kavukcuoglu
ArXivPDFHTML

Papers citing "IMPALA: Scalable Distributed Deep-RL with Importance Weighted Actor-Learner Architectures"

50 / 981 papers shown
Title
Compositional Transfer in Hierarchical Reinforcement Learning
Compositional Transfer in Hierarchical Reinforcement Learning
Markus Wulfmeier
A. Abdolmaleki
Roland Hafner
Jost Tobias Springenberg
Michael Neunert
Tim Hertweck
Thomas Lampe
Noah Y. Siegel
N. Heess
Martin Riedmiller
30
27
0
26 Jun 2019
Shaping Belief States with Generative Environment Models for RL
Shaping Belief States with Generative Environment Models for RL
Karol Gregor
Danilo Jimenez Rezende
F. Besse
Yan Wu
Hamza Merzic
Aaron van den Oord
OffRL
AI4CE
16
118
0
21 Jun 2019
Cross-View Policy Learning for Street Navigation
Cross-View Policy Learning for Street Navigation
Ang Li
Huiyi Hu
Piotr Wojciech Mirowski
Mehrdad Farajtabar
30
27
0
13 Jun 2019
Fast Task Inference with Variational Intrinsic Successor Features
Fast Task Inference with Variational Intrinsic Successor Features
Steven Hansen
Will Dabney
André Barreto
T. Wiele
David Warde-Farley
Volodymyr Mnih
BDL
44
151
0
12 Jun 2019
Reinforcement Learning for Integer Programming: Learning to Cut
Reinforcement Learning for Integer Programming: Learning to Cut
Yunhao Tang
Shipra Agrawal
Yuri Faenza
AI4CE
24
165
0
11 Jun 2019
Importance Resampling for Off-policy Prediction
Importance Resampling for Off-policy Prediction
M. Schlegel
Wesley Chung
Daniel Graves
Jian Qian
Martha White
OffRL
11
41
0
11 Jun 2019
Gossip-based Actor-Learner Architectures for Deep Reinforcement Learning
Gossip-based Actor-Learner Architectures for Deep Reinforcement Learning
Mahmoud Assran
Joshua Romoff
Nicolas Ballas
Joelle Pineau
Michael G. Rabbat
28
32
0
09 Jun 2019
Empirical Likelihood for Contextual Bandits
Empirical Likelihood for Contextual Bandits
Nikos Karampatziakis
John Langford
Paul Mineiro
OffRL
23
9
0
07 Jun 2019
Towards Interpretable Reinforcement Learning Using Attention Augmented
  Agents
Towards Interpretable Reinforcement Learning Using Attention Augmented Agents
Alex Mott
Daniel Zoran
Mike Chrzanowski
Daan Wierstra
Danilo Jimenez Rezende
26
188
0
06 Jun 2019
How to Initialize your Network? Robust Initialization for WeightNorm &
  ResNets
How to Initialize your Network? Robust Initialization for WeightNorm & ResNets
Devansh Arpit
Victor Campos
Yoshua Bengio
18
56
0
05 Jun 2019
Options as responses: Grounding behavioural hierarchies in multi-agent
  RL
Options as responses: Grounding behavioural hierarchies in multi-agent RL
A. Vezhnevets
Yuhuai Wu
Rémi Leblond
Joel Z Leibo
AI4CE
20
17
0
04 Jun 2019
Stabilizing Off-Policy Q-Learning via Bootstrapping Error Reduction
Stabilizing Off-Policy Q-Learning via Bootstrapping Error Reduction
Aviral Kumar
Justin Fu
George Tucker
Sergey Levine
OffRL
OnRL
4
1,034
0
03 Jun 2019
Neural Replicator Dynamics
Neural Replicator Dynamics
Daniel Hennes
Dustin Morrill
Shayegan Omidshafiei
Rémi Munos
Julien Perolat
...
A. Gruslys
Jean-Baptiste Lespiau
Paavo Parmas
Edgar A. Duénez-Guzmán
K. Tuyls
21
16
0
01 Jun 2019
Interval timing in deep reinforcement learning agents
Interval timing in deep reinforcement learning agents
B. Deverett
Ryan Faulkner
Meire Fortunato
Greg Wayne
Joel Z Leibo
17
14
0
31 May 2019
Unsupervised Model Selection for Variational Disentangled Representation
  Learning
Unsupervised Model Selection for Variational Disentangled Representation Learning
Sunny Duan
Loic Matthey
Andre Saraiva
Nicholas Watters
Christopher P. Burgess
Alexander Lerchner
I. Higgins
OOD
DRL
6
78
0
29 May 2019
An Explicitly Relational Neural Network Architecture
An Explicitly Relational Neural Network Architecture
Murray Shanahan
Kyriacos Nikiforou
Antonia Creswell
Christos Kaplanis
David Barrett
M. Garnelo
NAI
3DV
GAN
25
68
0
24 May 2019
Combining Experience Replay with Exploration by Random Network
  Distillation
Combining Experience Replay with Exploration by Random Network Distillation
Francesco Sovrano
18
15
0
18 May 2019
Optimizing Sequential Medical Treatments with Auto-Encoding Heuristic
  Search in POMDPs
Optimizing Sequential Medical Treatments with Auto-Encoding Heuristic Search in POMDPs
Luchen Li
Matthieu Komorowski
Aldo A. Faisal
OffRL
29
13
0
17 May 2019
Trajectory-Based Off-Policy Deep Reinforcement Learning
Trajectory-Based Off-Policy Deep Reinforcement Learning
Andreas Doerr
Michael Volpp
Marc Toussaint
Sebastian Trimpe
Christian Daniel
OffRL
29
2
0
14 May 2019
Smoothing Policies and Safe Policy Gradients
Smoothing Policies and Safe Policy Gradients
Matteo Papini
Matteo Pirotta
Marcello Restelli
26
29
0
08 May 2019
Reinforced Genetic Algorithm Learning for Optimizing Computation Graphs
Reinforced Genetic Algorithm Learning for Optimizing Computation Graphs
Aditya Sanjay Paliwal
Felix Gimeno
Vinod Nair
Yujia Li
Miles Lubin
Pushmeet Kohli
Oriol Vinyals
OffRL
GNN
16
64
0
07 May 2019
Dimension-Wise Importance Sampling Weight Clipping for Sample-Efficient
  Reinforcement Learning
Dimension-Wise Importance Sampling Weight Clipping for Sample-Efficient Reinforcement Learning
Seungyul Han
Y. Sung
OffRL
6
20
0
07 May 2019
Information asymmetry in KL-regularized RL
Information asymmetry in KL-regularized RL
Alexandre Galashov
Siddhant M. Jayakumar
Leonard Hasenclever
Dhruva Tirumala
Jonathan Richard Schwarz
Guillaume Desjardins
Wojciech M. Czarnecki
Yee Whye Teh
Razvan Pascanu
N. Heess
OffRL
17
102
0
03 May 2019
Challenges of Real-World Reinforcement Learning
Challenges of Real-World Reinforcement Learning
Gabriel Dulac-Arnold
D. Mankowitz
Todd Hester
OffRL
37
542
0
29 Apr 2019
Ray Interference: a Source of Plateaus in Deep Reinforcement Learning
Ray Interference: a Source of Plateaus in Deep Reinforcement Learning
Tom Schaul
Diana Borsa
Joseph Modayil
Razvan Pascanu
16
63
0
25 Apr 2019
Towards Combining On-Off-Policy Methods for Real-World Applications
Towards Combining On-Off-Policy Methods for Real-World Applications
Kai-Chun Hu
Chen-Huan Pi
Ting Han Wei
I-Chen Wu
Stone Cheng
Yi-Wei Dai
Wei-Yuan Ye
OffRL
6
2
0
24 Apr 2019
Rogue-Gym: A New Challenge for Generalization in Reinforcement Learning
Rogue-Gym: A New Challenge for Generalization in Reinforcement Learning
Yuji Kanagawa
Tomoyuki Kaneko
21
12
0
17 Apr 2019
Only Relevant Information Matters: Filtering Out Noisy Samples to Boost
  RL
Only Relevant Information Matters: Filtering Out Noisy Samples to Boost RL
Yannis Flet-Berliac
Philippe Preux
6
2
0
08 Apr 2019
Creating Pro-Level AI for a Real-Time Fighting Game Using Deep
  Reinforcement Learning
Creating Pro-Level AI for a Real-Time Fighting Game Using Deep Reinforcement Learning
In-Suk Oh
Seungeun Rho
Sangbin Moon
Seongho Son
Hyoil Lee
Jinyun Chung
23
52
0
08 Apr 2019
Deep Reinforcement Learning on a Budget: 3D Control and Reasoning
  Without a Supercomputer
Deep Reinforcement Learning on a Budget: 3D Control and Reasoning Without a Supercomputer
E. Beeching
Christian Wolf
J. Dibangoye
Olivier Simonin
OffRL
LRM
35
25
0
03 Apr 2019
Meta-Learning surrogate models for sequential decision making
Meta-Learning surrogate models for sequential decision making
Alexandre Galashov
Jonathan Richard Schwarz
Hyunjik Kim
M. Garnelo
D. Saxton
Pushmeet Kohli
S. M. Ali Eslami
Yee Whye Teh
BDL
OffRL
28
26
0
28 Mar 2019
Generalized Off-Policy Actor-Critic
Generalized Off-Policy Actor-Critic
Shangtong Zhang
Wendelin Bohmer
Shimon Whiteson
OffRL
CML
19
43
0
27 Mar 2019
Optimization Methods for Interpretable Differentiable Decision Trees in
  Reinforcement Learning
Optimization Methods for Interpretable Differentiable Decision Trees in Reinforcement Learning
I. D. Rodriguez
Taylor W. Killian
Ivan Dario Jimenez Rodriguez
Sung-Hyun Son
Matthew C. Gombolay
OffRL
15
12
0
22 Mar 2019
Learning Reciprocity in Complex Sequential Social Dilemmas
Learning Reciprocity in Complex Sequential Social Dilemmas
Tom Eccles
Edward Hughes
János Kramár
S. Wheelwright
Joel Z Leibo
9
49
0
19 Mar 2019
Exploiting Hierarchy for Learning and Transfer in KL-regularized RL
Exploiting Hierarchy for Learning and Transfer in KL-regularized RL
Dhruva Tirumala
Hyeonwoo Noh
Alexandre Galashov
Leonard Hasenclever
Arun Ahuja
Greg Wayne
Razvan Pascanu
Yee Whye Teh
N. Heess
OffRL
11
45
0
18 Mar 2019
Scheduled Intrinsic Drive: A Hierarchical Take on Intrinsically
  Motivated Exploration
Scheduled Intrinsic Drive: A Hierarchical Take on Intrinsically Motivated Exploration
Jingwei Zhang
Niklas Wetzel
Nicolai Dorka
Joschka Boedecker
Wolfram Burgard
11
26
0
18 Mar 2019
A Multi-Agent Off-Policy Actor-Critic Algorithm for Distributed
  Reinforcement Learning
A Multi-Agent Off-Policy Actor-Critic Algorithm for Distributed Reinforcement Learning
Wesley A Suttle
Zhuoran Yang
Kaipeng Zhang
Zhaoran Wang
Tamer Basar
Ji Liu
OffRL
10
62
0
15 Mar 2019
The StreetLearn Environment and Dataset
The StreetLearn Environment and Dataset
Piotr Wojciech Mirowski
Andras Banki-Horvath
Keith Anderson
Denis Teplyashin
Karl Moritz Hermann
...
Matthew Koichi Grimes
Karen Simonyan
Koray Kavukcuoglu
Andrew Zisserman
R. Hadsell
3DV
26
64
0
04 Mar 2019
Asynchronous Episodic Deep Deterministic Policy Gradient: Towards
  Continuous Control in Computationally Complex Environments
Asynchronous Episodic Deep Deterministic Policy Gradient: Towards Continuous Control in Computationally Complex Environments
Zhizheng Zhang
Jiale Chen
Zhibo Chen
Weiping Li
OffRL
27
60
0
03 Mar 2019
Autocurricula and the Emergence of Innovation from Social Interaction: A
  Manifesto for Multi-Agent Intelligence Research
Autocurricula and the Emergence of Innovation from Social Interaction: A Manifesto for Multi-Agent Intelligence Research
Joel Z Leibo
Edward Hughes
Marc Lanctot
T. Graepel
27
105
0
02 Mar 2019
Learning To Follow Directions in Street View
Learning To Follow Directions in Street View
Karl Moritz Hermann
Mateusz Malinowski
Piotr Wojciech Mirowski
Andras Banki-Horvath
Keith Anderson
R. Hadsell
SSL
26
66
0
01 Mar 2019
Model-Based Reinforcement Learning for Atari
Model-Based Reinforcement Learning for Atari
Lukasz Kaiser
Mohammad Babaeizadeh
Piotr Milos
B. Osinski
R. Campbell
...
Sergey Levine
Afroz Mohiuddin
Ryan Sepassi
George Tucker
Henryk Michalewski
OffRL
24
840
0
01 Mar 2019
Neural Packet Classification
Neural Packet Classification
Eric Liang
Hang Zhu
Xin Jin
Ion Stoica
OffRL
35
120
0
27 Feb 2019
Leveraging Communication Topologies Between Learning Agents in Deep
  Reinforcement Learning
Leveraging Communication Topologies Between Learning Agents in Deep Reinforcement Learning
D. Adjodah
D. Calacci
Abhimanyu Dubey
Anirudh Goyal
P. Krafft
Esteban Moro Egido
Alex Pentland
AI4CE
22
8
0
16 Feb 2019
Neural-encoding Human Experts' Domain Knowledge to Warm Start
  Reinforcement Learning
Neural-encoding Human Experts' Domain Knowledge to Warm Start Reinforcement Learning
Andrew Silva
Matthew C. Gombolay
OffRL
27
20
0
15 Feb 2019
Simultaneously Learning Vision and Feature-based Control Policies for
  Real-world Ball-in-a-Cup
Simultaneously Learning Vision and Feature-based Control Policies for Real-world Ball-in-a-Cup
Devin Schwab
Tobias Springenberg
M. Martins
Thomas Lampe
Michael Neunert
A. Abdolmaleki
Tim Hertweck
Roland Hafner
F. Nori
Martin Riedmiller
21
22
0
13 Feb 2019
Contextual Recurrent Neural Networks
Contextual Recurrent Neural Networks
Sam Wenke
J. Fleming
11
5
0
09 Feb 2019
Metaoptimization on a Distributed System for Deep Reinforcement Learning
Metaoptimization on a Distributed System for Deep Reinforcement Learning
Greg Heinrich
I. Frosio
OffRL
21
2
0
07 Feb 2019
Distilling Policy Distillation
Distilling Policy Distillation
Wojciech M. Czarnecki
Razvan Pascanu
Simon Osindero
Siddhant M. Jayakumar
G. Swirszcz
Max Jaderberg
16
131
0
06 Feb 2019
The Hanabi Challenge: A New Frontier for AI Research
The Hanabi Challenge: A New Frontier for AI Research
Nolan Bard
Jakob N. Foerster
A. Chandar
Neil Burch
Marc Lanctot
...
Iain Dunning
Shibl Mourad
Hugo Larochelle
Marc G. Bellemare
Michael Bowling
LLMAG
15
351
0
01 Feb 2019
Previous
123...17181920
Next