Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
1802.01561
Cited By
v1
v2
v3 (latest)
IMPALA: Scalable Distributed Deep-RL with Importance Weighted Actor-Learner Architectures
5 February 2018
L. Espeholt
Hubert Soyer
Rémi Munos
Karen Simonyan
Volodymyr Mnih
Tom Ward
Yotam Doron
Vlad Firoiu
Tim Harley
Iain Dunning
Shane Legg
Koray Kavukcuoglu
Re-assign community
ArXiv (abs)
PDF
HTML
Papers citing
"IMPALA: Scalable Distributed Deep-RL with Importance Weighted Actor-Learner Architectures"
50 / 1,000 papers shown
Title
Self-Activating Neural Ensembles for Continual Reinforcement Learning
Sam Powers
Eliot Xing
Abhinav Gupta
KELM
CLL
86
5
0
31 Dec 2022
On Transforming Reinforcement Learning by Transformer: The Development Trajectory
Shengchao Hu
Li Shen
Ya Zhang
Yixin Chen
Dacheng Tao
OffRL
148
30
0
29 Dec 2022
Behavioral Cloning via Search in Video PreTraining Latent Space
Federico Malato
Florian Leopold
Amogh Raut
Ville Hautamaki
Andrew Melnik
LM&Ro
40
10
0
27 Dec 2022
Learning Generalizable Representations for Reinforcement Learning via Adaptive Meta-learner of Behavioral Similarities
Jianda Chen
Sinno Jialin Pan
SSL
56
6
0
26 Dec 2022
On Realization of Intelligent Decision-Making in the Real World: A Foundation Decision Model Perspective
Ying Wen
Bo Liu
M. Zhou
Shufang Hou
Zhe Cao
Chenyang Le
Jingxiao Chen
Zheng Tian
Weinan Zhang
Jun Wang
AI4CE
73
11
0
24 Dec 2022
Imitation Is Not Enough: Robustifying Imitation with Reinforcement Learning for Challenging Driving Scenarios
Yiren Lu
Justin Fu
George Tucker
Xinlei Pan
Eli Bronstein
...
Brandyn White
Aleksandra Faust
Shimon Whiteson
Drago Anguelov
Sergey Levine
OffRL
111
97
0
21 Dec 2022
Lifelong Reinforcement Learning with Modulating Masks
Eseoghene Ben-Iwhiwhu
Saptarshi Nath
Praveen K. Pilly
Soheil Kolouri
Andrea Soltoggio
CLL
OffRL
100
23
0
21 Dec 2022
Offline Reinforcement Learning for Visual Navigation
Dhruv Shah
Arjun Bhorkar
Hrish Leen
Ilya Kostrikov
Nicholas Rhinehart
Sergey Levine
OffRL
61
30
0
16 Dec 2022
CACTI: A Framework for Scalable Multi-Task Multi-Scene Visual Imitation Learning
Zhao Mandi
Homanga Bharadhwaj
Vincent Moens
Shuran Song
Aravind Rajeswaran
Vikash Kumar
LM&Ro
115
77
0
12 Dec 2022
Coordinate Ascent for Off-Policy RL with Global Convergence Guarantees
Hsin-En Su
Yen-Ju Chen
Ping-Chun Hsieh
Xi Liu
OffRL
70
0
0
10 Dec 2022
System Design for an Integrated Lifelong Reinforcement Learning Agent for Real-Time Strategy Games
Indranil Sur
Z. Daniels
Abrar Rahman
Kamil Faber
Gianmarco J. Gallardo
...
Roberto Corizzo
Ajay Divakaran
M. Piacentino
Jesse Hostetler
Aswin Raghavan
CLL
OffRL
78
4
0
08 Dec 2022
Launchpad: Learning to Schedule Using Offline and Online RL Methods
V. Venkataswamy
J. E. Grigsby
A. Grimshaw
Yanjun Qi
OffRL
OnRL
63
1
0
01 Dec 2022
Distributed Deep Reinforcement Learning: A Survey and A Multi-Player Multi-Agent Learning Toolbox
Qiyue Yin
Tongtong Yu
S. Shen
Jun Yang
Meijing Zhao
Kaiqi Huang
Bin Liang
Liangsheng Wang
OffRL
76
13
0
01 Dec 2022
Multi-Task Imitation Learning for Linear Dynamical Systems
Thomas T. Zhang
Katie Kang
Bruce D. Lee
Claire Tomlin
Sergey Levine
Stephen Tu
Nikolai Matni
117
24
0
01 Dec 2022
ConvLab-3: A Flexible Dialogue System Toolkit Based on a Unified Data Format
Qi Zhu
Christian Geishauser
Hsien-Chin Lin
Carel van Niekerk
Baolin Peng
...
Dazhen Wan
Xiaochen Zhu
Jianfeng Gao
Milica Gavsić
Minlie Huang
108
23
0
30 Nov 2022
The Effectiveness of World Models for Continual Reinforcement Learning
Samuel Kessler
M. Ostaszewski
Michal Bortkiewicz
M. Żarski
Maciej Wołczyk
Jack Parker-Holder
Stephen J. Roberts
Piotr Milo's
KELM
OffRL
CLL
85
8
0
29 Nov 2022
Offline Q-Learning on Diverse Multi-Task Data Both Scales And Generalizes
Aviral Kumar
Rishabh Agarwal
Xinyang Geng
George Tucker
Sergey Levine
OffRL
137
51
0
28 Nov 2022
AcceRL: Policy Acceleration Framework for Deep Reinforcement Learning
Hongjie Zhang
OffRL
40
0
0
28 Nov 2022
Melting Pot 2.0
J. Agapiou
A. Vezhnevets
Edgar A. Duénez-Guzmán
Jayd Matyas
Yiran Mao
...
Sukhdeep Singh
Julia Haas
Igor Mordatch
D. Mobbs
Joel Z Leibo
124
34
0
24 Nov 2022
Actively Learning Costly Reward Functions for Reinforcement Learning
André Eberhard
Houssam Metni
G. Fahland
A. Stroh
Pascal Friederich
OffRL
108
0
0
23 Nov 2022
Improving Multimodal Interactive Agents with Reinforcement Learning from Human Feedback
Josh Abramson
Arun Ahuja
Federico Carnevale
Petko Georgiev
Alex Goldin
...
Tamara von Glehn
Greg Wayne
Nathaniel Wong
Chen Yan
Rui Zhu
81
29
0
21 Nov 2022
Exploring through Random Curiosity with General Value Functions
Aditya A. Ramesh
Louis Kirsch
Sjoerd van Steenkiste
Jürgen Schmidhuber
110
10
0
18 Nov 2022
Explainability Via Causal Self-Talk
Nicholas A. Roy
Junkyung Kim
Neil C. Rabinowitz
CML
87
7
0
17 Nov 2022
Dynamic Collaborative Multi-Agent Reinforcement Learning Communication for Autonomous Drone Reforestation
P. D. Siedler
AI4CE
67
4
0
14 Nov 2022
Global and Local Analysis of Interestingness for Competency-Aware Deep Reinforcement Learning
Pedro Sequeira
Jesse Hostetler
Melinda Gervasio
51
0
0
11 Nov 2022
Efficient Deep Reinforcement Learning with Predictive Processing Proximal Policy Optimization
Burcu Küçükoglu
Walraaf Borkent
Bodo Rueckauer
Nasir Ahmad
Umut Güçlü
Marcel van Gerven
102
2
0
11 Nov 2022
Foundation Models for Semantic Novelty in Reinforcement Learning
Tarun Gupta
Peter Karkus
Tong Che
Danfei Xu
Marco Pavone
VLM
OffRL
LRM
72
9
0
09 Nov 2022
Progress and summary of reinforcement learning on energy management of MPS-EV
Jincheng Hu
Yang Lin
Liang Chu
Zhuoran Hou
Jihan Li
Jingjing Jiang
Yuanjian Zhang
130
13
0
08 Nov 2022
Curriculum-based Asymmetric Multi-task Reinforcement Learning
H. Huang
Deheng Ye
Li Shen
Wen Liu
88
14
0
07 Nov 2022
On learning history based policies for controlling Markov decision processes
Gandharv Patil
Aditya Mahajan
Doina Precup
OffRL
94
5
0
06 Nov 2022
Contrastive Value Learning: Implicit Models for Simple Offline RL
Bogdan Mazoure
Benjamin Eysenbach
Ofir Nachum
Jonathan Tompson
SSL
OffRL
112
9
0
03 Nov 2022
Learning to Solve Voxel Building Embodied Tasks from Pixels and Natural Language Instructions
Alexey Skrynnik
Zoya Volovikova
Marc-Alexandre Côté
Anton Voronov
Artem Zholus
...
Milagro Teruel
Ahmed Hassan Awadallah
Aleksandr I. Panov
Andrey Kravchenko
Julia Kiseleva
LM&Ro
107
11
0
01 Nov 2022
Learning to Navigate Wikipedia by Taking Random Walks
Manzil Zaheer
Kenneth Marino
Will Grathwohl
John Schultz
Wendy Shang
Sheila Babayan
Arun Ahuja
Ishita Dasgupta
Christine Kaeser-Chen
Rob Fergus
52
5
0
31 Oct 2022
Towards Versatile Embodied Navigation
Hongru Wang
Wei Liang
Luc Van Gool
Wenguan Wang
LM&Ro
100
25
0
30 Oct 2022
Entity Divider with Language Grounding in Multi-Agent Reinforcement Learning
Ziluo Ding
Wanpeng Zhang
Junpeng Yue
Xiangjun Wang
Tiejun Huang
Zongqing Lu
LLMAG
AI4CE
37
5
0
25 Oct 2022
Avalon: A Benchmark for RL Generalization Using Procedurally Generated Worlds
Joshua Albrecht
Abraham J. Fetterman
Bryden Fogelman
Ellie Kitanidis
Bartosz Wróblewski
...
Michael Rosenthal
Maksis Knutins
Zachary Polizzi
James B. Simon
Kanjun Qiu
OffRL
87
23
0
24 Oct 2022
Evaluating Long-Term Memory in 3D Mazes
J. Pašukonis
Timothy Lillicrap
Danijar Hafner
3DV
88
23
0
24 Oct 2022
Rethinking Value Function Learning for Generalization in Reinforcement Learning
Seungyong Moon
JunYeong Lee
Hyun Oh Song
OOD
OffRL
69
16
0
18 Oct 2022
RPM: Generalizable Behaviors for Multi-Agent Reinforcement Learning
Wei Qiu
Xiao Ma
Bo An
S. Obraztsova
Shuicheng Yan
Zhongwen Xu
72
2
0
18 Oct 2022
WILD-SCAV: Benchmarking FPS Gaming AI on Unity3D-based Environments
Xi Chen
Tianyuan Shi
Qing Zhao
Yuchen Sun
Yunfei Gao
Xiangjun Wang
62
2
0
14 Oct 2022
A Scalable Finite Difference Method for Deep Reinforcement Learning
Matthew Allen
John C. Raisbeck
Hakho Lee
50
0
0
14 Oct 2022
Bootstrap Advantage Estimation for Policy Optimization in Reinforcement Learning
Md Masudur Rahman
Yexiang Xue
OffRL
23
0
0
13 Oct 2022
Harfang3D Dog-Fight Sandbox: A Reinforcement Learning Research Platform for the Customized Control Tasks of Fighter Aircrafts
Muhammed Murat Özbek
S. Yildirim
Muhammet Aksoy
Eric Kernin
E. Koyuncu
64
5
0
13 Oct 2022
Object-Category Aware Reinforcement Learning
Qi Yi
Rui Zhang
Shaohui Peng
Jiaming Guo
Xingui Hu
Zidong Du
Xishan Zhang
Qi Guo
Yunji Chen
CML
LRM
81
7
0
13 Oct 2022
Reinforcement Learning with Automated Auxiliary Loss Search
Tairan He
Yuge Zhang
Kan Ren
Minghuan Liu
Che Wang
Weinan Zhang
Yuqing Yang
Dongsheng Li
113
16
0
12 Oct 2022
Contrastive Retrospection: honing in on critical steps for rapid learning and generalization in RL
Chen Sun
Wannan Yang
Thomas Jiralerspong
Dane Malenfant
Benjamin Alsbury-Nealy
Yoshua Bengio
Blake A. Richards
OffRL
64
2
0
12 Oct 2022
Exploration via Elliptical Episodic Bonuses
Mikael Henaff
Roberta Raileanu
Minqi Jiang
Tim Rocktaschel
OffRL
112
41
0
11 Oct 2022
Discovered Policy Optimisation
Chris Xiaoxuan Lu
J. Kuba
Alistair Letcher
Luke Metz
Christian Schroeder de Witt
Jakob N. Foerster
OffRL
111
79
0
11 Oct 2022
LECO: Learnable Episodic Count for Task-Specific Intrinsic Reward
DaeJin Jo
Sungwoong Kim
D. W. Nam
Taehwan Kwon
Seungeun Rho
Jongmin Kim
Donghoon Lee
OffRL
73
10
0
11 Oct 2022
Pre-Training for Robots: Offline RL Enables Learning New Tasks from a Handful of Trials
Aviral Kumar
Anika Singh
F. Ebert
Mitsuhiko Nakamoto
Yanlai Yang
Chelsea Finn
Sergey Levine
OffRL
OnRL
218
71
0
11 Oct 2022
Previous
1
2
3
...
6
7
8
...
18
19
20
Next