ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1802.01561
  4. Cited By
IMPALA: Scalable Distributed Deep-RL with Importance Weighted
  Actor-Learner Architectures

IMPALA: Scalable Distributed Deep-RL with Importance Weighted Actor-Learner Architectures

5 February 2018
L. Espeholt
Hubert Soyer
Rémi Munos
Karen Simonyan
Volodymyr Mnih
Tom Ward
Yotam Doron
Vlad Firoiu
Tim Harley
Iain Dunning
Shane Legg
Koray Kavukcuoglu
ArXivPDFHTML

Papers citing "IMPALA: Scalable Distributed Deep-RL with Importance Weighted Actor-Learner Architectures"

50 / 982 papers shown
Title
From Eye-blinks to State Construction: Diagnostic Benchmarks for Online
  Representation Learning
From Eye-blinks to State Construction: Diagnostic Benchmarks for Online Representation Learning
Banafsheh Rafiee
Zaheer Abbas
Sina Ghiassian
Raksha Kumaraswamy
R. Sutton
Elliot A. Ludvig
Adam White
OffRL
19
17
0
09 Nov 2020
Hybrid Supervised Reinforced Model for Dialogue Systems
Hybrid Supervised Reinforced Model for Dialogue Systems
Carlos Miranda
Y. Kessaci
BDL
OffRL
11
0
0
04 Nov 2020
A Study of Policy Gradient on a Class of Exactly Solvable Models
A Study of Policy Gradient on a Class of Exactly Solvable Models
Gavin McCracken
Colin Daniels
Rosie Zhao
Anna M. Brandenberger
Prakash Panangaden
Doina Precup
19
0
0
03 Nov 2020
Instance based Generalization in Reinforcement Learning
Instance based Generalization in Reinforcement Learning
Martín Bertrán
Natalia Martínez
Mariano Phielipp
Guillermo Sapiro
OffRL
34
16
0
02 Nov 2020
Cooperative Heterogeneous Deep Reinforcement Learning
Cooperative Heterogeneous Deep Reinforcement Learning
Han Zheng
Pengfei Wei
Jing Jiang
Guodong Long
Qinghua Lu
Chengqi Zhang
51
12
0
02 Nov 2020
Reinforcement Learning of Causal Variables Using Mediation Analysis
Reinforcement Learning of Causal Variables Using Mediation Analysis
Tue Herlau
Rasmus Larsen
OOD
CML
34
8
0
29 Oct 2020
Understanding the Pathologies of Approximate Policy Evaluation when
  Combined with Greedification in Reinforcement Learning
Understanding the Pathologies of Approximate Policy Evaluation when Combined with Greedification in Reinforcement Learning
Kenny Young
R. Sutton
13
8
0
28 Oct 2020
Refactoring Policy for Compositional Generalizability using
  Self-Supervised Object Proposals
Refactoring Policy for Compositional Generalizability using Self-Supervised Object Proposals
Tongzhou Mu
Jiayuan Gu
Zhiwei Jia
Hao Tang
Hao Su
27
13
0
26 Oct 2020
Meta-trained agents implement Bayes-optimal agents
Meta-trained agents implement Bayes-optimal agents
Vladimir Mikulik
Grégoire Delétang
Tom McGrath
Tim Genewein
Miljan Martic
Shane Legg
Pedro A. Ortega
OOD
FedML
37
41
0
21 Oct 2020
Deep Q-Network-based Adaptive Alert Threshold Selection Policy for
  Payment Fraud Systems in Retail Banking
Deep Q-Network-based Adaptive Alert Threshold Selection Policy for Payment Fraud Systems in Retail Banking
Hongda Shen
Eren Kurshan
40
20
0
21 Oct 2020
Improving Generalization in Reinforcement Learning with Mixture
  Regularization
Improving Generalization in Reinforcement Learning with Mixture Regularization
Kaixin Wang
Bingyi Kang
Jie Shao
Jiashi Feng
112
117
0
21 Oct 2020
Negotiating Team Formation Using Deep Reinforcement Learning
Negotiating Team Formation Using Deep Reinforcement Learning
Yoram Bachrach
Richard Everett
Edward Hughes
Angeliki Lazaridou
Joel Z. Leibo
Marc Lanctot
Michael Bradley Johanson
Wojciech M. Czarnecki
T. Graepel
48
35
0
20 Oct 2020
Model-free conventions in multi-agent reinforcement learning with
  heterogeneous preferences
Model-free conventions in multi-agent reinforcement learning with heterogeneous preferences
Raphael Köster
Kevin R. McKee
Richard Everett
Laura Weidinger
William S. Isaac
Edward Hughes
Edgar A. Duénez-Guzmán
T. Graepel
M. Botvinick
Joel Z. Leibo
32
22
0
18 Oct 2020
A Learning Approach to Robot-Agnostic Force-Guided High Precision
  Assembly
A Learning Approach to Robot-Agnostic Force-Guided High Precision Assembly
Jieliang Luo
Hui Li
20
18
0
15 Oct 2020
Local Search for Policy Iteration in Continuous Control
Local Search for Policy Iteration in Continuous Control
Jost Tobias Springenberg
N. Heess
D. Mankowitz
J. Merel
Arunkumar Byravan
...
Julian Schrittwieser
Yuval Tassa
J. Buchli
Dan Belov
Martin Riedmiller
OffRL
22
15
0
12 Oct 2020
Learning Intrinsic Symbolic Rewards in Reinforcement Learning
Learning Intrinsic Symbolic Rewards in Reinforcement Learning
Hassam Sheikh
Shauharda Khadka
Santiago Miret
Somdeb Majumdar
OffRL
29
7
0
08 Oct 2020
Human-Level Performance in No-Press Diplomacy via Equilibrium Search
Human-Level Performance in No-Press Diplomacy via Equilibrium Search
Jonathan Gray
Adam Lerer
A. Bakhtin
Noam Brown
15
50
0
06 Oct 2020
Episodic Memory for Learning Subjective-Timescale Models
Episodic Memory for Learning Subjective-Timescale Models
Alexey Zakharov
Matthew Crosby
Zafeirios Fountas
16
4
0
03 Oct 2020
Beyond Tabula-Rasa: a Modular Reinforcement Learning Approach for
  Physically Embedded 3D Sokoban
Beyond Tabula-Rasa: a Modular Reinforcement Learning Approach for Physically Embedded 3D Sokoban
Peter Karkus
M. Berk Mirza
A. Guez
Andrew Jaegle
Timothy Lillicrap
Lars Buesing
N. Heess
T. Weber
OffRL
22
8
0
03 Oct 2020
D3C: Reducing the Price of Anarchy in Multi-Agent Learning
D3C: Reducing the Price of Anarchy in Multi-Agent Learning
I. Gemp
Kevin R. McKee
Richard Everett
Edgar A. Duénez-Guzmán
Yoram Bachrach
David Balduzzi
Andrea Tacchetti
17
17
0
01 Oct 2020
Align-RUDDER: Learning From Few Demonstrations by Reward Redistribution
Align-RUDDER: Learning From Few Demonstrations by Reward Redistribution
Vihang Patil
M. Hofmarcher
Marius-Constantin Dinu
Matthias Dorfer
P. Blies
Johannes Brandstetter
Jose A. Arjona-Medina
Sepp Hochreiter
27
42
0
29 Sep 2020
Lucid Dreaming for Experience Replay: Refreshing Past States with the
  Current Policy
Lucid Dreaming for Experience Replay: Refreshing Past States with the Current Policy
Yunshu Du
Garrett A. Warnell
A. Gebremedhin
Peter Stone
Matthew E. Taylor
34
10
0
29 Sep 2020
Enhancing Continuous Control of Mobile Robots for End-to-End Visual
  Active Tracking
Enhancing Continuous Control of Mobile Robots for End-to-End Visual Active Tracking
Alessandro Devo
Alberto Dionigi
G. Costante
14
25
0
28 Sep 2020
Distributed Structured Actor-Critic Reinforcement Learning for Universal
  Dialogue Management
Distributed Structured Actor-Critic Reinforcement Learning for Universal Dialogue Management
Zhi Chen
Lu Chen
Xiaoyuan Liu
Kai Yu
41
20
0
22 Sep 2020
Efficient Reinforcement Learning Development with RLzoo
Efficient Reinforcement Learning Development with RLzoo
Zihan Ding
Tianyang Yu
Yanhua Huang
Hongming Zhang
Guo Li
Quancheng Guo
Luo Mai
Hao Dong
OffRL
OnRL
15
6
0
18 Sep 2020
Physically Embedded Planning Problems: New Challenges for Reinforcement
  Learning
Physically Embedded Planning Problems: New Challenges for Reinforcement Learning
M. Berk Mirza
Andrew Jaegle
Jonathan J. Hunt
A. Guez
S. Tunyasuvunakool
...
Peter Karkus
S. Racanière
Lars Buesing
Timothy Lillicrap
N. Heess
AI4CE
31
12
0
11 Sep 2020
Multi-Task Learning with Deep Neural Networks: A Survey
Multi-Task Learning with Deep Neural Networks: A Survey
M. Crawshaw
CVBM
55
610
0
10 Sep 2020
Importance Weighted Policy Learning and Adaptation
Importance Weighted Policy Learning and Adaptation
Alexandre Galashov
Jakub Sygnowski
Guillaume Desjardins
Jan Humplik
Leonard Hasenclever
Rae Jeong
Yee Whye Teh
N. Heess
OffRL
21
1
0
10 Sep 2020
Phasic Policy Gradient
Phasic Policy Gradient
K. Cobbe
Jacob Hilton
Oleg Klimov
John Schulman
OffRL
15
153
0
09 Sep 2020
Grounded Language Learning Fast and Slow
Grounded Language Learning Fast and Slow
Felix Hill
O. Tieleman
Tamara von Glehn
Nathaniel Wong
Hamza Merzic
S. Clark
LM&Ro
37
77
0
03 Sep 2020
Sample-Efficient Automated Deep Reinforcement Learning
Sample-Efficient Automated Deep Reinforcement Learning
Jörg Franke
Gregor Koehler
André Biedenkapp
Frank Hutter
31
39
0
03 Sep 2020
Dynamic Scheduling for Stochastic Edge-Cloud Computing Environments
  using A3C learning and Residual Recurrent Neural Networks
Dynamic Scheduling for Stochastic Edge-Cloud Computing Environments using A3C learning and Residual Recurrent Neural Networks
Shreshth Tuli
Shashikant Ilager
K. Ramamohanarao
Rajkumar Buyya
27
176
0
01 Sep 2020
Human-in-the-Loop Methods for Data-Driven and Reinforcement Learning
  Systems
Human-in-the-Loop Methods for Data-Driven and Reinforcement Learning Systems
Vinicius G. Goecks
33
11
0
30 Aug 2020
Deep Reinforcement Learning for Field Development Optimization
Deep Reinforcement Learning for Field Development Optimization
Y. Nasir
16
1
0
05 Aug 2020
WordCraft: An Environment for Benchmarking Commonsense Agents
WordCraft: An Environment for Benchmarking Commonsense Agents
Minqi Jiang
Jelena Luketina
Nantas Nardelli
Pasquale Minervini
Philip Torr
Shimon Whiteson
Tim Rocktaschel
LLMAG
OffRL
22
22
0
17 Jul 2020
Discovering Reinforcement Learning Algorithms
Discovering Reinforcement Learning Algorithms
Junhyuk Oh
Matteo Hessel
Wojciech M. Czarnecki
Zhongwen Xu
H. V. Hasselt
Satinder Singh
David Silver
29
126
0
17 Jul 2020
Meta-Gradient Reinforcement Learning with an Objective Discovered Online
Meta-Gradient Reinforcement Learning with an Objective Discovered Online
Zhongwen Xu
H. V. Hasselt
Matteo Hessel
Junhyuk Oh
Satinder Singh
David Silver
27
77
0
16 Jul 2020
Distributed Reinforcement Learning of Targeted Grasping with Active
  Vision for Mobile Manipulators
Distributed Reinforcement Learning of Targeted Grasping with Active Vision for Mobile Manipulators
Yasuhiro Fujita
Kota Uenishi
Avinash Ummadisingu
P. Nagarajan
Shimpei Masuda
M. Castro
32
18
0
16 Jul 2020
Lifelong Learning using Eigentasks: Task Separation, Skill Acquisition,
  and Selective Transfer
Lifelong Learning using Eigentasks: Task Separation, Skill Acquisition, and Selective Transfer
Aswin Raghavan
Jesse Hostetler
Indranil Sur
Abrar Rahman
Ajay Divakaran
CLL
24
7
0
14 Jul 2020
Relational-Grid-World: A Novel Relational Reasoning Environment and An
  Agent Model for Relational Information Extraction
Relational-Grid-World: A Novel Relational Reasoning Environment and An Agent Model for Relational Information Extraction
Faruk Küçüksubasi
Elif Surer
24
2
0
12 Jul 2020
Learning Retrospective Knowledge with Reverse Reinforcement Learning
Learning Retrospective Knowledge with Reverse Reinforcement Learning
Shangtong Zhang
Vivek Veeriah
Shimon Whiteson
OffRL
AI4TS
16
13
0
09 Jul 2020
Tracking-by-Trackers with a Distilled and Reinforced Model
Tracking-by-Trackers with a Distilled and Reinforced Model
Matteo Dunnhofer
N. Martinel
C. Micheloni
VOT
OffRL
27
4
0
08 Jul 2020
Guided Exploration with Proximal Policy Optimization using a Single
  Demonstration
Guided Exploration with Proximal Policy Optimization using a Single Demonstration
Gabriele Libardi
Gianni De Fabritiis
12
24
0
07 Jul 2020
TDprop: Does Jacobi Preconditioning Help Temporal Difference Learning?
TDprop: Does Jacobi Preconditioning Help Temporal Difference Learning?
Joshua Romoff
Peter Henderson
David Kanaa
Emmanuel Bengio
Ahmed Touati
Pierre-Luc Bacon
Joelle Pineau
26
3
0
06 Jul 2020
Scaling Imitation Learning in Minecraft
Scaling Imitation Learning in Minecraft
Artemij Amiranashvili
Nicolai Dorka
Wolfram Burgard
V. Koltun
Thomas Brox
MLAU
21
15
0
06 Jul 2020
Integrating Distributed Architectures in Highly Modular RL Libraries
Integrating Distributed Architectures in Highly Modular RL Libraries
Albert Bou
Sebastian Dittert
Gianni De Fabritiis
31
0
0
06 Jul 2020
Verifiably Safe Exploration for End-to-End Reinforcement Learning
Verifiably Safe Exploration for End-to-End Reinforcement Learning
Nathan Hunt
Nathan Fulton
Sara Magliacane
Nghia Hoang
Subhro Das
Armando Solar-Lezama
OffRL
17
49
0
02 Jul 2020
Gradient Temporal-Difference Learning with Regularized Corrections
Gradient Temporal-Difference Learning with Regularized Corrections
Sina Ghiassian
Andrew Patterson
Shivam Garg
Dhawal Gupta
Adam White
Martha White
16
42
0
01 Jul 2020
Perception-Prediction-Reaction Agents for Deep Reinforcement Learning
Perception-Prediction-Reaction Agents for Deep Reinforcement Learning
Adam Stooke
Valentin Dalibard
Siddhant M. Jayakumar
Wojciech M. Czarnecki
Max Jaderberg
22
1
0
26 Jun 2020
The NetHack Learning Environment
The NetHack Learning Environment
Heinrich Küttler
Nantas Nardelli
Alexander H. Miller
Roberta Raileanu
Marco Selvatici
Edward Grefenstette
Tim Rocktaschel
28
177
0
24 Jun 2020
Previous
123...131415...181920
Next