Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
1802.01561
Cited By
IMPALA: Scalable Distributed Deep-RL with Importance Weighted Actor-Learner Architectures
5 February 2018
L. Espeholt
Hubert Soyer
Rémi Munos
Karen Simonyan
Volodymyr Mnih
Tom Ward
Yotam Doron
Vlad Firoiu
Tim Harley
Iain Dunning
Shane Legg
Koray Kavukcuoglu
Re-assign community
ArXiv
PDF
HTML
Papers citing
"IMPALA: Scalable Distributed Deep-RL with Importance Weighted Actor-Learner Architectures"
50 / 982 papers shown
Title
Communication Efficient Parallel Reinforcement Learning
Mridul Agarwal
Bhargav Ganguly
Vaneet Aggarwal
58
9
0
22 Feb 2021
Decoupling Value and Policy for Generalization in Reinforcement Learning
Roberta Raileanu
Rob Fergus
DRL
OffRL
24
95
0
20 Feb 2021
On Proximal Policy Optimization's Heavy-tailed Gradients
Saurabh Garg
Joshua Zhanson
Emilio Parisotto
Adarsh Prasad
J. Zico Kolter
Zachary Chase Lipton
Sivaraman Balakrishnan
Ruslan Salakhutdinov
Pradeep Ravikumar
27
11
0
20 Feb 2021
Adaptive Rational Activations to Boost Deep Reinforcement Learning
Quentin Delfosse
P. Schramowski
Martin Mundt
Alejandro Molina
Kristian Kersting
42
15
0
18 Feb 2021
Finite-Sample Analysis of Off-Policy Natural Actor-Critic Algorithm
S. Khodadadian
Zaiwei Chen
S. T. Maguluri
CML
OffRL
74
26
0
18 Feb 2021
End-to-End Egospheric Spatial Memory
Daniel Lenton
Stephen James
R. Clark
Andrew J. Davison
24
5
0
15 Feb 2021
Sparse Attention Guided Dynamic Value Estimation for Single-Task Multi-Scene Reinforcement Learning
Jaskirat Singh
Liang Zheng
OffRL
21
3
0
14 Feb 2021
Modelling Cooperation in Network Games with Spatio-Temporal Complexity
Michiel A. Bakker
Richard Everett
Laura Weidinger
Iason Gabriel
William S. Isaac
Joel Z. Leibo
Edward Hughes
16
5
0
13 Feb 2021
Discovery of Options via Meta-Learned Subgoals
Vivek Veeriah
Tom Zahavy
Matteo Hessel
Zhongwen Xu
Junhyuk Oh
Iurii Kemaev
H. V. Hasselt
David Silver
Satinder Singh
29
33
0
12 Feb 2021
Measuring Progress in Deep Reinforcement Learning Sample Efficiency
Florian E. Dorner
25
12
0
09 Feb 2021
Reverb: A Framework For Experience Replay
Albin Cassirer
Gabriel Barth-Maron
E. Brevdo
Sabela Ramos
Toby Boyd
Thibault Sottiaux
M. Kroiss
VLM
OffRL
32
38
0
09 Feb 2021
Adversarially Guided Actor-Critic
Yannis Flet-Berliac
Johan Ferret
Olivier Pietquin
Philippe Preux
M. Geist
35
71
0
08 Feb 2021
Grid-to-Graph: Flexible Spatial Relational Inductive Biases for Reinforcement Learning
Zhengyao Jiang
Pasquale Minervini
Minqi Jiang
Tim Rocktaschel
AI4CE
24
7
0
08 Feb 2021
Alchemy: A benchmark and analysis toolkit for meta-reinforcement learning agents
Jane X. Wang
Michael King
Nicolas Porcel
Z. Kurth-Nelson
Tina Zhu
...
Neil C. Rabinowitz
Loic Matthey
Demis Hassabis
Alexander Lerchner
M. Botvinick
OffRL
23
30
0
04 Feb 2021
Neural Recursive Belief States in Multi-Agent Reinforcement Learning
Pol Moreno
Edward Hughes
Kevin R. McKee
Bernardo Avila-Pires
T. Weber
31
21
0
03 Feb 2021
A Lyapunov Theory for Finite-Sample Guarantees of Asynchronous Q-Learning and TD-Learning Variants
Zaiwei Chen
S. T. Maguluri
Sanjay Shakkottai
Karthikeyan Shanmugam
OffRL
105
54
0
02 Feb 2021
Acting in Delayed Environments with Non-Stationary Markov Policies
E. Derman
Gal Dalal
Shie Mannor
27
34
0
28 Jan 2021
Finite Sample Analysis of Two-Time-Scale Natural Actor-Critic Algorithm
S. Khodadadian
Thinh T. Doan
Justin Romberg
S. T. Maguluri
40
42
0
26 Jan 2021
GST: Group-Sparse Training for Accelerating Deep Reinforcement Learning
Juhyoung Lee
Sangyeob Kim
Sangjin Kim
Wooyoung Jo
H. Yoo
OffRL
29
9
0
24 Jan 2021
Evaluating Soccer Player: from Live Camera to Deep Reinforcement Learning
Paul Garnier
T. Gregoir
OffRL
27
12
0
13 Jan 2021
Asymmetric self-play for automatic goal discovery in robotic manipulation
OpenAI OpenAI
Matthias Plappert
Raul Sampedro
Tao Xu
Ilge Akkaya
...
Hyeonwoo Noh
Lilian Weng
Qiming Yuan
Casey Chu
Wojciech Zaremba
SSL
82
76
0
13 Jan 2021
Geometric Entropic Exploration
Z. Guo
M. G. Azar
Alaa Saade
S. Thakoor
Bilal Piot
Bernardo Avila-Pires
Michal Valko
Thomas Mesnard
Tor Lattimore
Rémi Munos
38
30
0
06 Jan 2021
Reinforcement Learning with Latent Flow
Wenling Shang
Xiaofei Wang
A. Srinivas
Aravind Rajeswaran
Yang Gao
Pieter Abbeel
Michael Laskin
OffRL
31
23
0
06 Jan 2021
A Survey on Deep Reinforcement Learning for Audio-Based Applications
S. Latif
Heriberto Cuayáhuitl
Farrukh Pervez
Fahad Shamshad
Hafiz Shehbaz Ali
Min Zhang
OffRL
60
73
0
01 Jan 2021
Towards Understanding Asynchronous Advantage Actor-critic: Convergence and Linear Speedup
Han Shen
Kaipeng Zhang
Min-Fong Hong
Tianyi Chen
35
28
0
31 Dec 2020
Towards Continual Reinforcement Learning: A Review and Perspectives
Khimya Khetarpal
Matthew D Riemer
Irina Rish
Doina Precup
CLL
OffRL
56
311
0
25 Dec 2020
Auto-Agent-Distiller: Towards Efficient Deep Reinforcement Learning Agents via Neural Architecture Search
Y. Fu
Zhongzhi Yu
Yongan Zhang
Yingyan Lin
27
4
0
24 Dec 2020
Augmenting Policy Learning with Routines Discovered from a Single Demonstration
Zelin Zhao
Chuang Gan
Jiajun Wu
Xiaoxiao Guo
J. Tenenbaum
OffRL
24
5
0
23 Dec 2020
Learning to Play Imperfect-Information Games by Imitating an Oracle Planner
Rinu Boney
Alexander Ilin
Arno Solin
Jarno Seppänen
9
3
0
22 Dec 2020
High-Throughput Synchronous Deep RL
Iou-Jen Liu
Raymond A. Yeh
Alex Schwing
OffRL
30
12
0
17 Dec 2020
Planning from Pixels in Atari with Learned Symbolic Representations
Andrea Dittadi
Frederik K. Drachmann
Thomas Bolander
28
11
0
16 Dec 2020
How to Train PointGoal Navigation Agents on a (Sample and Compute) Budget
Erik Wijmans
Irfan Essa
Dhruv Batra
3DPC
32
10
0
11 Dec 2020
Flatland-RL : Multi-Agent Reinforcement Learning on Trains
Sharada Mohanty
Erik Nygren
Florian Laurent
Manuel Schneider
Christian Scheller
...
Christian Baumberger
Gereon Vienken
Irene Sturm
Guillaume Sartoretti
G. Spigler
OffRL
47
58
0
10 Dec 2020
Imitating Interactive Intelligence
Josh Abramson
Arun Ahuja
Iain Barr
Arthur Brussee
Federico Carnevale
...
Greg Wayne
Duncan Williams
Nathaniel Wong
Chen Yan
Rui Zhu
LM&Ro
24
71
0
10 Dec 2020
An Efficient Asynchronous Method for Integrating Evolutionary and Gradient-based Policy Search
Kyunghyun Lee
Byeong-uk Lee
Ukcheol Shin
In So Kweon
30
22
0
10 Dec 2020
The Architectural Implications of Distributed Reinforcement Learning on CPU-GPU Systems
A. Inci
Evgeny Bolotin
Yaosheng Fu
Gal Dalal
Shie Mannor
D. Nellans
Diana Marculescu
AI4CE
17
13
0
08 Dec 2020
Planning from Pixels using Inverse Dynamics Models
Keiran Paster
Sheila A. McIlraith
Jimmy Ba
BDL
14
41
0
04 Dec 2020
Optimizing the Neural Architecture of Reinforcement Learning Agents
Nina Mazyavkina
S. Moustafa
I. Trofimov
Evgeny Burnaev
AI4CE
17
4
0
30 Nov 2020
Reinforcement Learning for Robust Missile Autopilot Design
Bernardo Cortez
16
2
0
26 Nov 2020
TLeague: A Framework for Competitive Self-Play based Distributed Multi-Agent Reinforcement Learning
Peng Sun
Jiechao Xiong
Lei Han
Xinghai Sun
Shuxing Li
Jiawei Xu
Meng Fang
Zhengyou Zhang
OffRL
LRM
33
19
0
25 Nov 2020
RLlib Flow: Distributed Reinforcement Learning is a Dataflow Problem
Eric Liang
Zhanghao Wu
Michael Luo
Sven Mika
Joseph E. Gonzalez
Ion Stoica
AI4CE
23
9
0
25 Nov 2020
Towards Playing Full MOBA Games with Deep Reinforcement Learning
Deheng Ye
Guibin Chen
Wen Zhang
Sheng Chen
Bo Yuan
...
Tengfei Shi
Qiang Fu
Wei Yang
Lanxiao Huang
Wei Liu
22
180
0
25 Nov 2020
Enhanced Scene Specificity with Sparse Dynamic Value Estimation
Jaskirat Singh
Liang Zheng
29
0
0
25 Nov 2020
Generative Adversarial Simulator
Jonathan Raiman
GAN
13
0
0
23 Nov 2020
Distributed Deep Reinforcement Learning: An Overview
Mohammad Reza Samsami
Hossein Alimadad
OffRL
14
27
0
22 Nov 2020
Using Unity to Help Solve Intelligence
Tom Ward
Andrew Bolt
Nik Hemmings
Simon Carter
Manuel Sanchez
...
Jay Lemmon
J. Coe
Piotr Trochim
T. Handley
Adrian Bolton
27
18
0
18 Nov 2020
Tonic: A Deep Reinforcement Learning Library for Fast Prototyping and Benchmarking
Fabio Pardo
OffRL
10
31
0
15 Nov 2020
Critic PI2: Master Continuous Planning via Policy Improvement with Path Integrals and Deep Actor-Critic Reinforcement Learning
Jiajun Fan
He Ba
Xian Guo
Jianye Hao
OffRL
21
5
0
13 Nov 2020
Hierarchical Reinforcement Learning for Relay Selection and Power Optimization in Two-Hop Cooperative Relay Network
Yuanzhe Geng
Erwu Liu
Rui Wang
Yiming Liu
16
0
0
10 Nov 2020
Deep reinforcement learning for RAN optimization and control
Yu Chen
Jie Chen
G. Krishnamurthi
Huijing Yang
Huahui Wang
Wenjie Zhao
19
1
0
09 Nov 2020
Previous
1
2
3
...
12
13
14
...
18
19
20
Next