ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1802.01561
  4. Cited By
IMPALA: Scalable Distributed Deep-RL with Importance Weighted
  Actor-Learner Architectures
v1v2v3 (latest)

IMPALA: Scalable Distributed Deep-RL with Importance Weighted Actor-Learner Architectures

5 February 2018
L. Espeholt
Hubert Soyer
Rémi Munos
Karen Simonyan
Volodymyr Mnih
Tom Ward
Yotam Doron
Vlad Firoiu
Tim Harley
Iain Dunning
Shane Legg
Koray Kavukcuoglu
ArXiv (abs)PDFHTML

Papers citing "IMPALA: Scalable Distributed Deep-RL with Importance Weighted Actor-Learner Architectures"

50 / 1,000 papers shown
Title
Flatland-RL : Multi-Agent Reinforcement Learning on Trains
Flatland-RL : Multi-Agent Reinforcement Learning on Trains
Sharada Mohanty
Erik Nygren
Florian Laurent
Manuel Schneider
Christian Scheller
...
Christian Baumberger
Gereon Vienken
Irene Sturm
Guillaume Sartoretti
G. Spigler
OffRL
98
58
0
10 Dec 2020
Imitating Interactive Intelligence
Imitating Interactive Intelligence
Josh Abramson
Arun Ahuja
Iain Barr
Arthur Brussee
Federico Carnevale
...
Greg Wayne
Duncan Williams
Nathaniel Wong
Chen Yan
Rui Zhu
LM&Ro
106
71
0
10 Dec 2020
An Efficient Asynchronous Method for Integrating Evolutionary and
  Gradient-based Policy Search
An Efficient Asynchronous Method for Integrating Evolutionary and Gradient-based Policy Search
Kyunghyun Lee
Byeong-uk Lee
Ukcheol Shin
In So Kweon
163
23
0
10 Dec 2020
The Architectural Implications of Distributed Reinforcement Learning on
  CPU-GPU Systems
The Architectural Implications of Distributed Reinforcement Learning on CPU-GPU Systems
A. Inci
Evgeny Bolotin
Yaosheng Fu
Gal Dalal
Shie Mannor
D. Nellans
Diana Marculescu
AI4CE
48
13
0
08 Dec 2020
Planning from Pixels using Inverse Dynamics Models
Planning from Pixels using Inverse Dynamics Models
Keiran Paster
Sheila A. McIlraith
Jimmy Ba
BDL
75
41
0
04 Dec 2020
Optimizing the Neural Architecture of Reinforcement Learning Agents
Optimizing the Neural Architecture of Reinforcement Learning Agents
Nina Mazyavkina
S. Moustafa
I. Trofimov
Evgeny Burnaev
AI4CE
78
4
0
30 Nov 2020
Reinforcement Learning for Robust Missile Autopilot Design
Reinforcement Learning for Robust Missile Autopilot Design
Bernardo Cortez
25
2
0
26 Nov 2020
TLeague: A Framework for Competitive Self-Play based Distributed
  Multi-Agent Reinforcement Learning
TLeague: A Framework for Competitive Self-Play based Distributed Multi-Agent Reinforcement Learning
Peng Sun
Jiechao Xiong
Lei Han
Xinghai Sun
Shuxing Li
Jiawei Xu
Meng Fang
Zhengyou Zhang
OffRLLRM
81
19
0
25 Nov 2020
RLlib Flow: Distributed Reinforcement Learning is a Dataflow Problem
RLlib Flow: Distributed Reinforcement Learning is a Dataflow Problem
Eric Liang
Zhanghao Wu
Michael Luo
Sven Mika
Joseph E. Gonzalez
Ion Stoica
AI4CE
74
12
0
25 Nov 2020
Towards Playing Full MOBA Games with Deep Reinforcement Learning
Towards Playing Full MOBA Games with Deep Reinforcement Learning
Deheng Ye
Guibin Chen
Wen Zhang
Sheng Chen
Bo Yuan
...
Tengfei Shi
Qiang Fu
Wei Yang
Lanxiao Huang
Wei Liu
97
188
0
25 Nov 2020
Enhanced Scene Specificity with Sparse Dynamic Value Estimation
Enhanced Scene Specificity with Sparse Dynamic Value Estimation
Jaskirat Singh
Liang Zheng
43
0
0
25 Nov 2020
Generative Adversarial Simulator
Generative Adversarial Simulator
Jonathan Raiman
GAN
19
0
0
23 Nov 2020
Distributed Deep Reinforcement Learning: An Overview
Distributed Deep Reinforcement Learning: An Overview
Mohammad Reza Samsami
Hossein Alimadad
OffRL
43
27
0
22 Nov 2020
Using Unity to Help Solve Intelligence
Using Unity to Help Solve Intelligence
Tom Ward
Andrew Bolt
Nik Hemmings
Simon Carter
Manuel Sanchez
...
Jay Lemmon
J. Coe
Piotr Trochim
T. Handley
Adrian Bolton
71
18
0
18 Nov 2020
Tonic: A Deep Reinforcement Learning Library for Fast Prototyping and
  Benchmarking
Tonic: A Deep Reinforcement Learning Library for Fast Prototyping and Benchmarking
Fabio Pardo
OffRL
69
31
0
15 Nov 2020
Critic PI2: Master Continuous Planning via Policy Improvement with Path
  Integrals and Deep Actor-Critic Reinforcement Learning
Critic PI2: Master Continuous Planning via Policy Improvement with Path Integrals and Deep Actor-Critic Reinforcement Learning
Jiajun Fan
He Ba
Xian Guo
Jianye Hao
OffRL
49
5
0
13 Nov 2020
Hierarchical Reinforcement Learning for Relay Selection and Power
  Optimization in Two-Hop Cooperative Relay Network
Hierarchical Reinforcement Learning for Relay Selection and Power Optimization in Two-Hop Cooperative Relay Network
Yuanzhe Geng
Erwu Liu
Rui Wang
Yiming Liu
22
0
0
10 Nov 2020
Deep reinforcement learning for RAN optimization and control
Deep reinforcement learning for RAN optimization and control
Yu Chen
Jie Chen
G. Krishnamurthi
Huijing Yang
Huahui Wang
Wenjie Zhao
39
1
0
09 Nov 2020
From Eye-blinks to State Construction: Diagnostic Benchmarks for Online
  Representation Learning
From Eye-blinks to State Construction: Diagnostic Benchmarks for Online Representation Learning
Banafsheh Rafiee
Zaheer Abbas
Sina Ghiassian
Raksha Kumaraswamy
R. Sutton
Elliot A. Ludvig
Adam White
OffRL
67
17
0
09 Nov 2020
Hybrid Supervised Reinforced Model for Dialogue Systems
Hybrid Supervised Reinforced Model for Dialogue Systems
Carlos Miranda
Y. Kessaci
BDLOffRL
25
0
0
04 Nov 2020
A Study of Policy Gradient on a Class of Exactly Solvable Models
A Study of Policy Gradient on a Class of Exactly Solvable Models
Gavin McCracken
Colin Daniels
Rosie Zhao
Anna M. Brandenberger
Prakash Panangaden
Doina Precup
45
0
0
03 Nov 2020
Instance based Generalization in Reinforcement Learning
Instance based Generalization in Reinforcement Learning
Martín Bertrán
Natalia Martínez
Mariano Phielipp
Guillermo Sapiro
OffRL
105
17
0
02 Nov 2020
Cooperative Heterogeneous Deep Reinforcement Learning
Cooperative Heterogeneous Deep Reinforcement Learning
Han Zheng
Pengfei Wei
Jing Jiang
Guodong Long
Qinghua Lu
Chengqi Zhang
96
12
0
02 Nov 2020
Reinforcement Learning of Causal Variables Using Mediation Analysis
Reinforcement Learning of Causal Variables Using Mediation Analysis
Tue Herlau
Rasmus Larsen
OODCML
66
8
0
29 Oct 2020
Understanding the Pathologies of Approximate Policy Evaluation when
  Combined with Greedification in Reinforcement Learning
Understanding the Pathologies of Approximate Policy Evaluation when Combined with Greedification in Reinforcement Learning
Kenny Young
R. Sutton
48
9
0
28 Oct 2020
Refactoring Policy for Compositional Generalizability using
  Self-Supervised Object Proposals
Refactoring Policy for Compositional Generalizability using Self-Supervised Object Proposals
Tongzhou Mu
Jiayuan Gu
Zhiwei Jia
Hao Tang
Hao Su
77
13
0
26 Oct 2020
Meta-trained agents implement Bayes-optimal agents
Meta-trained agents implement Bayes-optimal agents
Vladimir Mikulik
Grégoire Delétang
Tom McGrath
Tim Genewein
Miljan Martic
Shane Legg
Pedro A. Ortega
OODFedML
93
43
0
21 Oct 2020
Deep Q-Network-based Adaptive Alert Threshold Selection Policy for
  Payment Fraud Systems in Retail Banking
Deep Q-Network-based Adaptive Alert Threshold Selection Policy for Payment Fraud Systems in Retail Banking
Hongda Shen
Eren Kurshan
72
21
0
21 Oct 2020
Improving Generalization in Reinforcement Learning with Mixture
  Regularization
Improving Generalization in Reinforcement Learning with Mixture Regularization
Kaixin Wang
Bingyi Kang
Jie Shao
Jiashi Feng
188
120
0
21 Oct 2020
Negotiating Team Formation Using Deep Reinforcement Learning
Negotiating Team Formation Using Deep Reinforcement Learning
Yoram Bachrach
Richard Everett
Edward Hughes
Angeliki Lazaridou
Joel Z Leibo
Marc Lanctot
Michael Bradley Johanson
Wojciech M. Czarnecki
T. Graepel
109
36
0
20 Oct 2020
Model-free conventions in multi-agent reinforcement learning with
  heterogeneous preferences
Model-free conventions in multi-agent reinforcement learning with heterogeneous preferences
Raphael Köster
Kevin R. McKee
Richard Everett
Laura Weidinger
William S. Isaac
Edward Hughes
Edgar A. Duénez-Guzmán
T. Graepel
M. Botvinick
Joel Z Leibo
90
23
0
18 Oct 2020
A Learning Approach to Robot-Agnostic Force-Guided High Precision
  Assembly
A Learning Approach to Robot-Agnostic Force-Guided High Precision Assembly
Jieliang Luo
Hui Li
101
18
0
15 Oct 2020
Local Search for Policy Iteration in Continuous Control
Local Search for Policy Iteration in Continuous Control
Jost Tobias Springenberg
N. Heess
D. Mankowitz
J. Merel
Arunkumar Byravan
...
Julian Schrittwieser
Yuval Tassa
J. Buchli
Dan Belov
Martin Riedmiller
OffRL
82
15
0
12 Oct 2020
Learning Intrinsic Symbolic Rewards in Reinforcement Learning
Learning Intrinsic Symbolic Rewards in Reinforcement Learning
Hassam Sheikh
Shauharda Khadka
Santiago Miret
Somdeb Majumdar
OffRL
69
7
0
08 Oct 2020
Human-Level Performance in No-Press Diplomacy via Equilibrium Search
Human-Level Performance in No-Press Diplomacy via Equilibrium Search
Jonathan Gray
Adam Lerer
A. Bakhtin
Noam Brown
123
51
0
06 Oct 2020
Episodic Memory for Learning Subjective-Timescale Models
Episodic Memory for Learning Subjective-Timescale Models
Alexey Zakharov
Matthew Crosby
Zafeirios Fountas
41
4
0
03 Oct 2020
Beyond Tabula-Rasa: a Modular Reinforcement Learning Approach for
  Physically Embedded 3D Sokoban
Beyond Tabula-Rasa: a Modular Reinforcement Learning Approach for Physically Embedded 3D Sokoban
Peter Karkus
M. Berk Mirza
A. Guez
Andrew Jaegle
Timothy Lillicrap
Lars Buesing
N. Heess
T. Weber
OffRL
67
8
0
03 Oct 2020
D3C: Reducing the Price of Anarchy in Multi-Agent Learning
D3C: Reducing the Price of Anarchy in Multi-Agent Learning
I. Gemp
Kevin R. McKee
Richard Everett
Edgar A. Duénez-Guzmán
Yoram Bachrach
David Balduzzi
Andrea Tacchetti
122
18
0
01 Oct 2020
Align-RUDDER: Learning From Few Demonstrations by Reward Redistribution
Align-RUDDER: Learning From Few Demonstrations by Reward Redistribution
Vihang Patil
M. Hofmarcher
Marius-Constantin Dinu
Matthias Dorfer
P. Blies
Johannes Brandstetter
Jose A. Arjona-Medina
Sepp Hochreiter
140
44
0
29 Sep 2020
Lucid Dreaming for Experience Replay: Refreshing Past States with the
  Current Policy
Lucid Dreaming for Experience Replay: Refreshing Past States with the Current Policy
Yunshu Du
Garrett A. Warnell
A. Gebremedhin
Peter Stone
Matthew E. Taylor
58
11
0
29 Sep 2020
Enhancing Continuous Control of Mobile Robots for End-to-End Visual
  Active Tracking
Enhancing Continuous Control of Mobile Robots for End-to-End Visual Active Tracking
Alessandro Devo
Alberto Dionigi
G. Costante
29
27
0
28 Sep 2020
Distributed Structured Actor-Critic Reinforcement Learning for Universal
  Dialogue Management
Distributed Structured Actor-Critic Reinforcement Learning for Universal Dialogue Management
Zhi Chen
Lu Chen
Xiaoyuan Liu
Kai Yu
87
20
0
22 Sep 2020
Efficient Reinforcement Learning Development with RLzoo
Efficient Reinforcement Learning Development with RLzoo
Zihan Ding
Tianyang Yu
Yanhua Huang
Hongming Zhang
Guo Li
Quancheng Guo
Kai Zou
Hao Dong
OffRLOnRL
44
6
0
18 Sep 2020
Physically Embedded Planning Problems: New Challenges for Reinforcement
  Learning
Physically Embedded Planning Problems: New Challenges for Reinforcement Learning
M. Berk Mirza
Andrew Jaegle
Jonathan J. Hunt
A. Guez
S. Tunyasuvunakool
...
Peter Karkus
S. Racanière
Lars Buesing
Timothy Lillicrap
N. Heess
AI4CE
79
12
0
11 Sep 2020
Multi-Task Learning with Deep Neural Networks: A Survey
Multi-Task Learning with Deep Neural Networks: A Survey
M. Crawshaw
CVBM
234
630
0
10 Sep 2020
Importance Weighted Policy Learning and Adaptation
Importance Weighted Policy Learning and Adaptation
Alexandre Galashov
Jakub Sygnowski
Guillaume Desjardins
Jan Humplik
Leonard Hasenclever
Rae Jeong
Yee Whye Teh
N. Heess
OffRL
88
1
0
10 Sep 2020
Phasic Policy Gradient
Phasic Policy Gradient
K. Cobbe
Jacob Hilton
Oleg Klimov
John Schulman
OffRL
100
160
0
09 Sep 2020
Grounded Language Learning Fast and Slow
Grounded Language Learning Fast and Slow
Felix Hill
O. Tieleman
Tamara von Glehn
Nathaniel Wong
Hamza Merzic
S. Clark
LM&Ro
166
81
0
03 Sep 2020
Sample-Efficient Automated Deep Reinforcement Learning
Sample-Efficient Automated Deep Reinforcement Learning
Jörg Franke
Gregor Koehler
André Biedenkapp
Frank Hutter
114
42
0
03 Sep 2020
Dynamic Scheduling for Stochastic Edge-Cloud Computing Environments
  using A3C learning and Residual Recurrent Neural Networks
Dynamic Scheduling for Stochastic Edge-Cloud Computing Environments using A3C learning and Residual Recurrent Neural Networks
Shreshth Tuli
Shashikant Ilager
K. Ramamohanarao
Rajkumar Buyya
69
179
0
01 Sep 2020
Previous
123...131415...181920
Next