Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
1802.01561
Cited By
IMPALA: Scalable Distributed Deep-RL with Importance Weighted Actor-Learner Architectures
5 February 2018
L. Espeholt
Hubert Soyer
Rémi Munos
Karen Simonyan
Volodymyr Mnih
Tom Ward
Yotam Doron
Vlad Firoiu
Tim Harley
Iain Dunning
Shane Legg
Koray Kavukcuoglu
Re-assign community
ArXiv
PDF
HTML
Papers citing
"IMPALA: Scalable Distributed Deep-RL with Importance Weighted Actor-Learner Architectures"
50 / 982 papers shown
Title
On the Use and Misuse of Absorbing States in Multi-agent Reinforcement Learning
Andrew Cohen
Ervin Teng
Vincent-Pierre Berges
Ruo-Ping Dong
Hunter Henry
Marwan Mattar
Alexander Zook
Sujoy Ganguly
24
33
0
10 Nov 2021
Generalization in Dexterous Manipulation via Geometry-Aware Multi-Task Learning
Wenlong Huang
Igor Mordatch
Pieter Abbeel
Deepak Pathak
45
63
0
04 Nov 2021
A System for General In-Hand Object Re-Orientation
Tao Chen
Jie Xu
Pulkit Agrawal
45
251
0
04 Nov 2021
Towards an Understanding of Default Policies in Multitask Policy Optimization
Theodore H. Moskovitz
Michael Arbel
Jack Parker-Holder
Aldo Pacchiano
30
9
0
04 Nov 2021
Global Optimality and Finite Sample Analysis of Softmax Off-Policy Actor Critic under State Distribution Mismatch
Shangtong Zhang
Rémi Tachet des Combes
Romain Laroche
35
10
0
04 Nov 2021
Human-Level Control without Server-Grade Hardware
Brett Daley
Chris Amato
BDL
OffRL
13
0
0
01 Nov 2021
Generalized Proximal Policy Optimization with Sample Reuse
James Queeney
I. Paschalidis
Christos G. Cassandras
OffRL
42
47
0
29 Oct 2021
Wasserstein Distance Maximizing Intrinsic Control
Ishan Durugkar
Steven Hansen
Stephen Spencer
Volodymyr Mnih
26
6
0
28 Oct 2021
Self-Consistent Models and Values
Roy Miles
Kate Baumli
Zita Marinho
Angelos Filos
Matteo Hessel
Hado van Hasselt
David Silver
38
8
0
25 Oct 2021
A Distributed Deep Reinforcement Learning Technique for Application Placement in Edge and Fog Computing Environments
M. Goudarzi
M. Palaniswami
Rajkumar Buyya
OffRL
40
85
0
24 Oct 2021
Fully Distributed Actor-Critic Architecture for Multitask Deep Reinforcement Learning
John Harwell
Angel Sylvester
Aleksi Tukiainen
Enrique Munoz de Cote
31
4
0
23 Oct 2021
A Versatile and Efficient Reinforcement Learning Framework for Autonomous Driving
Guan-Bo Wang
Haoyi Niu
Desheng Zhu
Jianming Hu
Xianyuan Zhan
Guyue Zhou
OffRL
27
2
0
22 Oct 2021
Statistical discrimination in learning agents
Edgar A. Duénez-Guzmán
Kevin R. McKee
Yiran Mao
Ben Coppin
Silvia Chiappa
...
Yoram Bachrach
Suzanne Sadedin
William S. Isaac
K. Tuyls
Joel Z. Leibo
47
7
0
21 Oct 2021
On games and simulators as a platform for development of artificial intelligence for command and control
Vinicius G. Goecks
Nicholas R. Waytowich
Derrik E. Asher
Song Jun Park
Mark R. Mittrick
...
Anne Logie
Mark S. Dennison
T. Trout
Priya Narayanan
Alexander Kott
41
26
0
21 Oct 2021
SILG: The Multi-environment Symbolic Interactive Language Grounding Benchmark
Victor Zhong
Austin W. Hanjie
Sida Wang
Karthik Narasimhan
Luke Zettlemoyer
19
12
0
20 Oct 2021
CORA: Benchmarks, Baselines, and Metrics as a Platform for Continual Reinforcement Learning Agents
Sam Powers
Eliot Xing
Eric Kolve
Roozbeh Mottaghi
Abhinav Gupta
OffRL
36
38
0
19 Oct 2021
Variance Reduction based Experience Replay for Policy Optimization
Hua Zheng
Wei Xie
M. Feng
OffRL
41
2
0
17 Oct 2021
Collaborating with Humans without Human Data
D. Strouse
Kevin R. McKee
M. Botvinick
Edward Hughes
Richard Everett
124
161
0
15 Oct 2021
Containerized Distributed Value-Based Multi-Agent Reinforcement Learning
Siyang Wu
Tonghan Wang
Chenghao Li
Yang Hu
Chongjie Zhang
OffRL
29
1
0
15 Oct 2021
Safe Driving via Expert Guided Policy Optimization
Zhenghao Peng
Quanyi Li
Chunxiao Liu
Bolei Zhou
OffRL
31
41
0
13 Oct 2021
Feudal Reinforcement Learning by Reading Manuals
Kai Wang
Zhonghao Wang
Mo Yu
Humphrey Shi
OffRL
48
0
0
13 Oct 2021
Learning to Coordinate in Multi-Agent Systems: A Coordinated Actor-Critic Algorithm and Finite-Time Guarantees
Siliang Zeng
Tianyi Chen
Alfredo García
Mingyi Hong
52
11
0
11 Oct 2021
Learning a subspace of policies for online adaptation in Reinforcement Learning
Jean-Baptiste Gaya
Laure Soulier
Ludovic Denoyer
OffRL
40
15
0
11 Oct 2021
Recurrent Model-Free RL Can Be a Strong Baseline for Many POMDPs
Tianwei Ni
Benjamin Eysenbach
Ruslan Salakhutdinov
26
103
0
11 Oct 2021
Medical Dead-ends and Learning to Identify High-risk States and Treatments
Mehdi Fatemi
Taylor W. Killian
J. Subramanian
Marzyeh Ghassemi
OffRL
36
37
0
08 Oct 2021
No-Press Diplomacy from Scratch
A. Bakhtin
David J. Wu
Adam Lerer
Noam Brown
100
42
0
06 Oct 2021
Colmena: Scalable Machine-Learning-Based Steering of Ensemble Simulations for High Performance Computing
Logan T. Ward
Ganesh Sivaraman
J. G. Pauloski
Y. Babuji
Ryan Chard
...
R. Assary
Kyle Chard
L. Curtiss
R. Thakur
Ian Foster
27
39
0
06 Oct 2021
CARL: A Benchmark for Contextual and Adaptive Reinforcement Learning
C. Benjamins
Theresa Eimer
Frederik Schubert
André Biedenkapp
Bodo Rosenhahn
Frank Hutter
Marius Lindauer
OffRL
41
23
0
05 Oct 2021
Collective eXplainable AI: Explaining Cooperative Strategies and Agent Contribution in Multiagent Reinforcement Learning with Shapley Values
Alexandre Heuillet
Fabien Couthouis
Natalia Díaz Rodríguez
27
57
0
04 Oct 2021
Parallel Actors and Learners: A Framework for Generating Scalable RL Implementations
Chi Zhang
S. Kuppannagari
Viktor Prasanna
OffRL
21
7
0
03 Oct 2021
An Unsupervised Video Game Playstyle Metric via State Discretization
Chiu-Chou Lin
W. Chiu
I-Chen Wu
16
3
0
03 Oct 2021
Batch size-invariance for policy optimization
Jacob Hilton
K. Cobbe
John Schulman
27
11
0
01 Oct 2021
Divergence-Regularized Multi-Agent Actor-Critic
Kefan Su
Zongqing Lu
46
25
0
01 Oct 2021
Genealogical Population-Based Training for Hyperparameter Optimization
Antoine Scardigli
P. Fournier
Matteo Vilucchio
D. Naccache
GP
22
0
0
30 Sep 2021
Deep Reinforcement Learning Versus Evolution Strategies: A Comparative Survey
Amjad Yousef Majid
Serge Saaybi
Tomas van Rietbergen
Vincent François-Lavet
R. V. Prasad
Chris Verhoeven
OffRL
67
55
0
28 Sep 2021
Faster Improvement Rate Population Based Training
Valentin Dalibard
Max Jaderberg
25
10
0
28 Sep 2021
Learning to Superoptimize Real-world Programs
Alex Shypula
Pengcheng Yin
Jeremy Lacomis
Claire Le Goues
Edward N. Schwartz
Graham Neubig
NAI
121
10
0
28 Sep 2021
MiniHack the Planet: A Sandbox for Open-Ended Reinforcement Learning Research
Mikayel Samvelyan
Robert Kirk
Vitaly Kurin
Jack Parker-Holder
Minqi Jiang
Eric Hambro
Fabio Petroni
Heinrich Küttler
Edward Grefenstette
Tim Rocktaschel
OffRL
238
89
0
27 Sep 2021
Applying supervised and reinforcement learning methods to create neural-network-based agents for playing StarCraft II
Michal Opanowicz
23
0
0
26 Sep 2021
Parameter-free Reduction of the Estimation Bias in Deep Reinforcement Learning for Deterministic Policy Gradients
Baturay Saglam
Furkan B. Mutlu
Dogan C. Cicek
Suleyman Serdar Kozat
OffRL
22
3
0
24 Sep 2021
Improved Soft Actor-Critic: Mixing Prioritized Off-Policy Samples with On-Policy Experience
C. Banerjee
Zhiyong Chen
N. Noman
21
30
0
24 Sep 2021
On Bonus-Based Exploration Methods in the Arcade Learning Environment
Adrien Ali Taïga
W. Fedus
Marlos C. Machado
Aaron Courville
Marc G. Bellemare
24
58
0
22 Sep 2021
Estimation Error Correction in Deep Reinforcement Learning for Deterministic Actor-Critic Methods
Baturay Saglam
Enes Duran
Dogan C. Cicek
Furkan B. Mutlu
Suleyman Serdar Kozat
OffRL
50
12
0
22 Sep 2021
Autonomous Blimp Control using Deep Reinforcement Learning
Y. Liu
Eric Price
Pascal Goldschmid
Michael J. Black
Aamir Ahmad
AI4CE
32
3
0
22 Sep 2021
Learning Natural Language Generation from Scratch
Alice Martin Donati
Guillaume Quispe
Charles Ollion
Sylvain Le Corff
Florian Strub
Olivier Pietquin
LRM
36
4
0
20 Sep 2021
CompilerGym: Robust, Performant Compiler Optimization Environments for AI Research
Chris Cummins
Bram Wasti
Jiadong Guo
Brandon Cui
Jason Ansel
...
Jia-Wei Liu
O. Teytaud
Benoit Steiner
Yuandong Tian
Hugh Leather
35
69
0
17 Sep 2021
Conservative Data Sharing for Multi-Task Offline Reinforcement Learning
Tianhe Yu
Aviral Kumar
Yevgen Chebotar
Karol Hausman
Sergey Levine
Chelsea Finn
OffRL
37
77
0
16 Sep 2021
Direct Advantage Estimation
Hsiao-Ru Pan
Nico Gürtler
Alexander Neitz
Bernhard Schölkopf
OffRL
CML
14
11
0
13 Sep 2021
An Empirical Comparison of Off-policy Prediction Learning Algorithms in the Four Rooms Environment
Sina Ghiassian
R. Sutton
AAML
OffRL
21
6
0
10 Sep 2021
Bootstrapped Meta-Learning
Sebastian Flennerhag
Yannick Schroecker
Tom Zahavy
Hado van Hasselt
David Silver
Satinder Singh
43
59
0
09 Sep 2021
Previous
1
2
3
...
9
10
11
...
18
19
20
Next