Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
1802.01561
Cited By
v1
v2
v3 (latest)
IMPALA: Scalable Distributed Deep-RL with Importance Weighted Actor-Learner Architectures
5 February 2018
L. Espeholt
Hubert Soyer
Rémi Munos
Karen Simonyan
Volodymyr Mnih
Tom Ward
Yotam Doron
Vlad Firoiu
Tim Harley
Iain Dunning
Shane Legg
Koray Kavukcuoglu
Re-assign community
ArXiv (abs)
PDF
HTML
Papers citing
"IMPALA: Scalable Distributed Deep-RL with Importance Weighted Actor-Learner Architectures"
50 / 1,000 papers shown
Title
Mobile-Bench: An Evaluation Benchmark for LLM-based Mobile Agents
Shihan Deng
Weikai Xu
Hongda Sun
Wei Liu
Tao Tan
...
Ang Li
Jian Luan
Bin Wang
Rui Yan
Shuo Shang
LLMAG
96
21
0
01 Jul 2024
SiT: Symmetry-Invariant Transformers for Generalisation in Reinforcement Learning
Matthias Weissenbacher
Rishabh Agarwal
Yoshinobu Kawahara
OffRL
64
1
0
21 Jun 2024
Advantage Alignment Algorithms
Juan Agustin Duque
Milad Aghajohari
Tim Cooijmans
Tianyu Zhang
Rameswar Panda
Gauthier Gidel
Aaron Courville
84
2
0
20 Jun 2024
Explore-Go: Leveraging Exploration for Generalisation in Deep Reinforcement Learning
Max Weltevrede
Felix Kaubek
M. Spaan
Wendelin Bohmer
109
0
0
12 Jun 2024
PufferLib: Making Reinforcement Learning Libraries and Environments Play Nice
Joseph Suarez
AI4CE
97
4
0
11 Jun 2024
World Models with Hints of Large Language Models for Goal Achieving
Zeyuan Liu
Ziyu Huan
Xiyao Wang
Jiafei Lyu
Jian Tao
Xiu Li
Furong Huang
Huazhe Xu
LM&Ro
LRM
AI4CE
83
2
0
11 Jun 2024
Massively Multiagent Minigames for Training Generalist Agents
Kyoung Whan Choe
Ryan Sullivan
Joseph Suárez
AI4CE
73
0
0
07 Jun 2024
Transductive Off-policy Proximal Policy Optimization
Yaozhong Gan
Renye Yan
Xiaoyang Tan
Zhe Wu
Junliang Xing
OffRL
74
3
0
06 Jun 2024
A Bayesian Approach to Online Planning
Nir Greshler
David Ben-Eli
Carmel Rabinovitz
Gabi Guetta
Liran Gispan
Guy Zohar
Aviv Tamar
33
0
0
04 Jun 2024
Learning-based legged locomotion; state of the art and future perspectives
Sehoon Ha
Joonho Lee
M. van de Panne
Zhaoming Xie
Wenhao Yu
Majid Khadiv
147
20
0
03 Jun 2024
FuRL: Visual-Language Models as Fuzzy Rewards for Reinforcement Learning
Yuwei Fu
Haichao Zhang
Di Wu
Wei Xu
Benoit Boulet
VLM
123
15
0
02 Jun 2024
RLeXplore: Accelerating Research in Intrinsically-Motivated Reinforcement Learning
Mingqi Yuan
Roger Creus Castanyer
Bo Li
Xin Jin
Glen Berseth
Wenjun Zeng
176
0
0
29 May 2024
Highway Reinforcement Learning
Yuhui Wang
M. Strupl
Francesco Faccio
Qingyuan Wu
Haozhe Liu
Michal Grudzieñ
Xiaoyang Tan
Jürgen Schmidhuber
OffRL
73
4
0
28 May 2024
Fast TRAC: A Parameter-Free Optimizer for Lifelong Reinforcement Learning
Aneesh Muppidi
Zhiyu Zhang
Heng Yang
78
6
0
26 May 2024
Pausing Policy Learning in Non-stationary Reinforcement Learning
Hyunin Lee
Ming Jin
Javad Lavaei
Somayeh Sojoudi
OffRL
117
2
0
25 May 2024
Neural Network Compression for Reinforcement Learning Tasks
Dmitry A. Ivanov
D. Larionov
Oleg V. Maslennikov
V. Voevodin
OffRL
AI4CE
83
2
0
13 May 2024
A Methodology-Oriented Study of Catastrophic Forgetting in Incremental Deep Neural Networks
Ashutosh Kumar
Sonali Agarwal
D. J. Hemanth
74
0
0
11 May 2024
The Curse of Diversity in Ensemble-Based Exploration
Zhixuan Lin
P. DÓro
Evgenii Nikishin
Rameswar Panda
101
1
0
07 May 2024
Is Sora a World Simulator? A Comprehensive Survey on General World Models and Beyond
Zheng Zhu
Xiaofeng Wang
Wangbo Zhao
Chen Min
Nianchen Deng
...
Dawei Zhao
Liang Xiao
Jian-jun Zhao
Jiwen Lu
Guan Huang
VGen
LM&Ro
176
48
0
06 May 2024
Off-OAB: Off-Policy Policy Gradient Method with Optimal Action-Dependent Baseline
Wenjia Meng
Qian Zheng
Long Yang
Yilong Yin
Gang Pan
OffRL
84
0
0
04 May 2024
MESA: Cooperative Meta-Exploration in Multi-Agent Learning through Exploiting State-Action Space Structure
Zhicheng Zhang
Yancheng Liang
Yi Wu
Fei Fang
72
2
0
01 May 2024
A Meta-Game Evaluation Framework for Deep Multiagent Reinforcement Learning
Zun Li
Michael P. Wellman
75
1
0
30 Apr 2024
Shared learning of powertrain control policies for vehicle fleets
Lindsey Kerbel
B. Ayalew
Andrej Ivanco
74
1
0
27 Apr 2024
Learning to Beat ByteRL: Exploitability of Collectible Card Game Agents
Radovan Haluška
Martin Schmid
LLMAG
86
0
0
25 Apr 2024
Impedance Matching: Enabling an RL-Based Running Jump in a Quadruped Robot
Neil Guan
Shangqun Yu
Shifan Zhu
Donghyun Kim
92
0
0
23 Apr 2024
Generalized Population-Based Training for Hyperparameter Optimization in Reinforcement Learning
Hui Bai
Ran Cheng
88
6
0
12 Apr 2024
GO4Align: Group Optimization for Multi-Task Alignment
Jiayi Shen
Cheems Wang
Zehao Xiao
Nanne van Noord
M. Worring
93
6
0
09 Apr 2024
Securing the Skies: An IRS-Assisted AoI-Aware Secure Multi-UAV System with Efficient Task Offloading
Poorvi Joshi
Alakesh Kalita
Gurusamy Mohan
39
0
0
06 Apr 2024
Compressed Federated Reinforcement Learning with a Generative Model
Ali Beikmohammadi
Sarit Khirirat
Sindri Magnússon
FedML
125
3
0
26 Mar 2024
EnvGen: Generating and Adapting Environments via LLMs for Training Embodied Agents
Abhaysinh Zala
Jaemin Cho
Han Lin
Jaehong Yoon
Mohit Bansal
91
13
0
18 Mar 2024
Scaling Instructable Agents Across Many Simulated Worlds
Sima Team
Maria Abi Raad
Arun Ahuja
Catarina Barros
F. Besse
...
Daan Wierstra
Duncan Williams
Nathaniel Wong
Sarah York
Nick Young
LM&Ro
210
42
0
13 Mar 2024
Mastering Memory Tasks with World Models
Mohammad Reza Samsami
Artem Zholus
Janarthanan Rajendran
Sarath Chandar
CLL
OffRL
102
28
0
07 Mar 2024
Stop Regressing: Training Value Functions via Classification for Scalable Deep RL
Jesse Farebrother
Jordi Orbay
Q. Vuong
Adrien Ali Taïga
Yevgen Chebotar
...
Sergey Levine
Pablo Samuel Castro
Aleksandra Faust
Aviral Kumar
Rishabh Agarwal
OffRL
105
67
0
06 Mar 2024
Scalable Volt-VAR Optimization using RLlib-IMPALA Framework: A Reinforcement Learning Approach
Alaa Selim
Yanzhu Ye
Junbo Zhao
Bo Yang
27
0
0
24 Feb 2024
Skill or Luck? Return Decomposition via Advantage Functions
Hsiao-Ru Pan
Bernhard Schölkopf
OffRL
45
5
0
20 Feb 2024
In value-based deep reinforcement learning, a pruned network is a good network
J. Obando-Ceron
Rameswar Panda
Pablo Samuel Castro
OffRL
121
26
0
19 Feb 2024
Jack of All Trades, Master of Some, a Multi-Purpose Transformer Agent
Quentin Gallouedec
E. Beeching
Clément Romac
Emmanuel Dellandrea
47
11
0
15 Feb 2024
Mixtures of Experts Unlock Parameter Scaling for Deep RL
J. Obando-Ceron
Ghada Sokar
Timon Willi
Clare Lyle
Jesse Farebrother
Jakob N. Foerster
Gintare Karolina Dziugaite
Doina Precup
Pablo Samuel Castro
180
43
0
13 Feb 2024
NavFormer: A Transformer Architecture for Robot Target-Driven Navigation in Unknown and Dynamic Environments
Haitong Wang
Aaron Hao Tan
G. Nejat
102
14
0
09 Feb 2024
Off-policy Distributional Q(
λ
λ
λ
): Distributional RL without Importance Sampling
Yunhao Tang
Mark Rowland
Rémi Munos
Bernardo Avila-Pires
Will Dabney
OffRL
60
1
0
08 Feb 2024
Private Knowledge Sharing in Distributed Learning: A Survey
Yasas Supeksala
Dinh C. Nguyen
Ming Ding
Thilina Ranbaduge
Calson Chua
Jun Zhang
Jun Li
H. Vincent Poor
96
0
0
08 Feb 2024
A computational approach to visual ecology with deep reinforcement learning
Sacha Sokoloski
Jure Majnik
Philipp Berens
25
0
0
07 Feb 2024
Learning Diverse Policies with Soft Self-Generated Guidance
Guojian Wang
Faguo Wu
Xiao Zhang
Jianxiang Liu
OffRL
63
4
0
07 Feb 2024
Just Cluster It: An Approach for Exploration in High-Dimensions using Clustering and Pre-Trained Representations
Stefan Sylvius Wagner
Stefan Harmeling
65
2
0
05 Feb 2024
Open RL Benchmark: Comprehensive Tracked Experiments for Reinforcement Learning
Shengyi Huang
Quentin Gallouedec
Florian Felten
Antonin Raffin
Rousslan Fernand Julien Dossa
...
Alexander Nikulin
Xiao Hu
Tianlin Liu
Jongwook Choi
Brent Yi
OffRL
91
10
0
05 Feb 2024
Augmenting Replay in World Models for Continual Reinforcement Learning
Luke Yang
L. Kuhlmann
Gideon Kowadlo
VLM
KELM
CLL
OffRL
148
0
0
30 Jan 2024
Zero-shot Imitation Policy via Search in Demonstration Dataset
Federico Malato
Florian Leopold
Andrew Melnik
Ville Hautamaki
LM&Ro
OffRL
48
7
0
29 Jan 2024
Visual Imitation Learning with Calibrated Contrastive Representation
Yunke Wang
Linwei Tao
Bo Du
Yutian Lin
Chang Xu
68
0
0
21 Jan 2024
Reinforcement Learning from LLM Feedback to Counteract Goal Misgeneralization
Houda Nait El Barj
Théophile Sautory
106
2
0
14 Jan 2024
Scaling Is All You Need: Autonomous Driving with JAX-Accelerated Reinforcement Learning
Moritz Harmel
Anubhav Paras
Andreas Pasternak
Nicholas Roy
Gary Linscott
LRM
98
1
0
23 Dec 2023
Previous
1
2
3
4
5
6
...
18
19
20
Next