Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
1802.01561
Cited By
IMPALA: Scalable Distributed Deep-RL with Importance Weighted Actor-Learner Architectures
5 February 2018
L. Espeholt
Hubert Soyer
Rémi Munos
Karen Simonyan
Volodymyr Mnih
Tom Ward
Yotam Doron
Vlad Firoiu
Tim Harley
Iain Dunning
Shane Legg
Koray Kavukcuoglu
Re-assign community
ArXiv
PDF
HTML
Papers citing
"IMPALA: Scalable Distributed Deep-RL with Importance Weighted Actor-Learner Architectures"
50 / 982 papers shown
Title
Semantic Exploration from Language Abstractions and Pretrained Representations
Allison C. Tam
Neil C. Rabinowitz
Andrew Kyle Lampinen
Nicholas A. Roy
Stephanie C. Y. Chan
D. Strouse
Jane X. Wang
Andrea Banino
Felix Hill
LM&Ro
44
68
0
08 Apr 2022
Federated Reinforcement Learning with Environment Heterogeneity
Hao Jin
Yang Peng
Wenhao Yang
Shusen Wang
Zhihua Zhang
65
68
0
06 Apr 2022
Imitate and Repurpose: Learning Reusable Robot Movement Skills From Human and Animal Behaviors
Steven Bohez
S. Tunyasuvunakool
Philemon Brakel
Fereshteh Sadeghi
Leonard Hasenclever
...
Nathan Batchelor
Federico Casarini
J. Merel
R. Hadsell
N. Heess
43
51
0
31 Mar 2022
PerfectDou: Dominating DouDizhu with Perfect Information Distillation
Yang Guan
Minghuan Liu
Weijun Hong
Weinan Zhang
Fei Fang
Guangjun Zeng
Yue Lin
33
26
0
30 Mar 2022
Marginalized Operators for Off-policy Reinforcement Learning
Yunhao Tang
Mark Rowland
Rémi Munos
Michal Valko
OffRL
32
0
0
30 Mar 2022
Asynchronous Reinforcement Learning for Real-Time Control of Physical Robots
Yufeng Yuan
Rupam Mahmood
OffRL
36
19
0
23 Mar 2022
Insights From the NeurIPS 2021 NetHack Challenge
Eric Hambro
Sharada Mohanty
Dmitrii Babaev
Mi-Ra Byeon
Dipam Chakraborty
...
Dan Rothermel
Mikayel Samvelyan
Dmitry Sorokin
Maciej Sypetkowski
Michal Sypetkowski
25
19
0
22 Mar 2022
Tactile Pose Estimation and Policy Learning for Unknown Object Manipulation
Tarik Kelestemur
Robert Platt
T. Padır
33
32
0
21 Mar 2022
Symmetry-Based Representations for Artificial and Biological General Intelligence
I. Higgins
S. Racanière
Danilo Jimenez Rezende
AI4CE
39
44
0
17 Mar 2022
Zipfian environments for Reinforcement Learning
Stephanie C. Y. Chan
Andrew Kyle Lampinen
Pierre Harvey Richemond
Felix Hill
OffRL
17
15
0
15 Mar 2022
Switch Trajectory Transformer with Distributional Value Approximation for Multi-Task Reinforcement Learning
Qinjie Lin
Han Liu
B. Sengupta
OffRL
32
11
0
14 Mar 2022
Temporal Difference Learning for Model Predictive Control
Nicklas Hansen
Xiaolong Wang
H. Su
PINN
MU
41
226
0
09 Mar 2022
The Unsurprising Effectiveness of Pre-Trained Vision Models for Control
Simone Parisi
Aravind Rajeswaran
Senthil Purushwalkam
Abhinav Gupta
LM&Ro
36
187
0
07 Mar 2022
Hierarchically Structured Scheduling and Execution of Tasks in a Multi-Agent Environment
Diogo S. Carvalho
B. Sengupta
33
2
0
06 Mar 2022
AutoDIME: Automatic Design of Interesting Multi-Agent Environments
I. Kanitscheider
Harrison Edwards
32
0
0
04 Mar 2022
Avalanche RL: a Continual Reinforcement Learning Library
Nicolo Lucchesi
Antonio Carta
Vincenzo Lomonaco
Davide Bacciu
42
6
0
28 Feb 2022
Collaborative Training of Heterogeneous Reinforcement Learning Agents in Environments with Sparse Rewards: What and When to Share?
Alain Andres
Esther Villar-Rodriguez
Javier Del Ser
22
9
0
24 Feb 2022
Improving Intrinsic Exploration with Language Abstractions
Jesse Mu
Victor Zhong
Roberta Raileanu
Minqi Jiang
Noah D. Goodman
Tim Rocktaschel
Edward Grefenstette
103
63
0
17 Feb 2022
MineRL Diamond 2021 Competition: Overview, Results, and Lessons Learned
Anssi Kanervisto
Stephanie Milani
Karolis Ramanauskas
Nicholay Topin
Zichuan Lin
...
Franccois Fleuret
Alexander Nikulin
Yury Belousov
Oleg Svidchenko
A. Shpilman
OffRL
60
32
0
17 Feb 2022
Beyond the Policy Gradient Theorem for Efficient Policy Updates in Actor-Critic Algorithms
Romain Laroche
Rémi Tachet des Combes
51
2
0
15 Feb 2022
Compute Trends Across Three Eras of Machine Learning
J. Sevilla
Lennart Heim
A. Ho
T. Besiroglu
Marius Hobbhahn
Pablo Villalobos
39
272
0
11 Feb 2022
A Modern Self-Referential Weight Matrix That Learns to Modify Itself
Kazuki Irie
Imanol Schlag
Róbert Csordás
Jürgen Schmidhuber
21
26
0
11 Feb 2022
PRIMA: Planner-Reasoner Inside a Multi-task Reasoning Agent
Daoming Lyu
Bo Liu
Jianshu Chen
LRM
38
1
0
01 Feb 2022
Accelerating Deep Reinforcement Learning for Digital Twin Network Optimization with Evolutionary Strategies
Carlos Güemes-Palau
Paul Almasan
Shihan Xiao
Xiangle Cheng
Xiang Shi
Pere Barlet-Ros
A. Cabellos-Aparicio
37
9
0
01 Feb 2022
You May Not Need Ratio Clipping in PPO
Mingfei Sun
Vitaly Kurin
Guoqing Liu
Sam Devlin
Tao Qin
Katja Hofmann
Shimon Whiteson
21
15
0
31 Jan 2022
DeepRNG: Towards Deep Reinforcement Learning-Assisted Generative Testing of Software
Chuan-Yung Tsai
Graham W. Taylor
11
2
0
29 Jan 2022
Efficient Embedding of Semantic Similarity in Control Policies via Entangled Bisimulation
Martín Bertrán
Walter A. Talbott
Nitish Srivastava
J. Susskind
50
3
0
28 Jan 2022
Leveraging class abstraction for commonsense reinforcement learning via residual policy gradient methods
Niklas Höpner
Ilaria Tiddi
H. V. Hoof
42
3
0
28 Jan 2022
Chaining Value Functions for Off-Policy Learning
Simon Schmitt
John Shawe-Taylor
Hado van Hasselt
OffRL
28
2
0
17 Jan 2022
Weakly Supervised Scene Text Detection using Deep Reinforcement Learning
Emanuel Metzenthin
Christian Bartz
Christoph Meinel
OffRL
38
2
0
13 Jan 2022
Automated Reinforcement Learning (AutoRL): A Survey and Open Problems
Jack Parker-Holder
Raghunandan Rajan
Xingyou Song
André Biedenkapp
Yingjie Miao
...
Vu-Linh Nguyen
Roberto Calandra
Aleksandra Faust
Frank Hutter
Marius Lindauer
AI4CE
38
100
0
11 Jan 2022
The Effects of Reward Misspecification: Mapping and Mitigating Misaligned Models
Alexander Pan
Kush S. Bhatia
Jacob Steinhardt
58
172
0
10 Jan 2022
Hidden Agenda: a Social Deduction Game with Diverse Learned Equilibria
Kavya Kopparapu
Edgar A. Duénez-Guzmán
Jayd Matyas
A. Vezhnevets
J. Agapiou
Kevin R. McKee
Richard Everett
J. Marecki
Joel Z. Leibo
T. Graepel
22
7
0
05 Jan 2022
Evaluating the Robustness of Deep Reinforcement Learning for Autonomous Policies in a Multi-agent Urban Driving Environment
Aizaz Sharif
D. Marijan
24
5
0
22 Dec 2021
Graph augmented Deep Reinforcement Learning in the GameRLand3D environment
E. Beeching
Maxim Peter
Philippe Marcotte
Jilles Debangoye
Olivier Simonin
Joshua Romoff
Christian Wolf
16
5
0
22 Dec 2021
Feature-Attending Recurrent Modules for Generalization in Reinforcement Learning
Wilka Carvalho
Andrew Kyle Lampinen
Kyriacos Nikiforou
Felix Hill
Murray Shanahan
OffRL
45
0
0
15 Dec 2021
Learning Generalizable Behavior via Visual Rewrite Rules
Yiheng Xie
Mingxuan Li
Shangqun Yu
Michael Littman
DRL
19
1
0
09 Dec 2021
CoMPS: Continual Meta Policy Search
Glen Berseth
Zhiwei Zhang
Grace Zhang
Chelsea Finn
Sergey Levine
CLL
OffRL
33
16
0
08 Dec 2021
A Review for Deep Reinforcement Learning in Atari:Benchmarks, Challenges, and Solutions
Jiajun Fan
OffRL
34
19
0
08 Dec 2021
Tell me why! Explanations support learning relational and causal structure
Andrew Kyle Lampinen
Nicholas A. Roy
Ishita Dasgupta
Stephanie C. Y. Chan
Allison C. Tam
...
Chen Yan
Adam Santoro
Neil C. Rabinowitz
Jane X. Wang
Felix Hill
40
45
0
07 Dec 2021
Godot Reinforcement Learning Agents
E. Beeching
Jilles Debangoye
Olivier Simonin
Christian Wolf
GP
OnRL
24
5
0
07 Dec 2021
Combining Learning from Human Feedback and Knowledge Engineering to Solve Hierarchical Tasks in Minecraft
Vinicius G. Goecks
Nicholas R. Waytowich
David Watkins
Bharat Prakash
18
7
0
07 Dec 2021
Offline Pre-trained Multi-Agent Decision Transformer: One Big Sequence Model Tackles All SMAC Tasks
Linghui Meng
Muning Wen
Yaodong Yang
Chenyang Le
Xiyun Li
Weinan Zhang
Ying Wen
Haifeng Zhang
Jun Wang
Bo Xu
OffRL
31
38
0
06 Dec 2021
Improving Zero-shot Generalization in Offline Reinforcement Learning using Generalized Similarity Functions
Bogdan Mazoure
Ilya Kostrikov
Ofir Nachum
Jonathan Tompson
OffRL
51
21
0
29 Nov 2021
Interesting Object, Curious Agent: Learning Task-Agnostic Exploration
Simone Parisi
Victoria Dean
Deepak Pathak
Abhinav Gupta
LM&Ro
44
50
0
25 Nov 2021
How does AI play football? An analysis of RL and real-world football strategies
Atom Scott
Keisuke Fujii
Masaki Onishi
78
13
0
24 Nov 2021
Off-Policy Correction For Multi-Agent Reinforcement Learning
Michał Zawalski
Bla.zej Osiñski
Henryk Michalewski
Piotr Milo's
OffRL
32
2
0
22 Nov 2021
Fast and Data-Efficient Training of Rainbow: an Experimental Study on Atari
Dominik Schmidt
Thomas Schmied
OffRL
28
12
0
19 Nov 2021
AI in Human-computer Gaming: Techniques, Challenges and Opportunities
Qiyue Yin
Jun Yang
Kaiqi Huang
Meijing Zhao
Wancheng Ni
Bin Liang
Yan Huang
Shu Wu
Liangsheng Wang
23
20
0
15 Nov 2021
RLOps: Development Life-cycle of Reinforcement Learning Aided Open RAN
Peizheng Li
Jonathan D. Thomas
Xiaoyang Wang
Ahmed Khalil
A. Ahmad
...
S. Kapoor
Arjun Parekh
A. Doufexi
Arman Shojaeifard
Robert Piechocki
AI4TS
16
37
0
12 Nov 2021
Previous
1
2
3
...
8
9
10
...
18
19
20
Next