Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
1802.01561
Cited By
IMPALA: Scalable Distributed Deep-RL with Importance Weighted Actor-Learner Architectures
5 February 2018
L. Espeholt
Hubert Soyer
Rémi Munos
Karen Simonyan
Volodymyr Mnih
Tom Ward
Yotam Doron
Vlad Firoiu
Tim Harley
Iain Dunning
Shane Legg
Koray Kavukcuoglu
Re-assign community
ArXiv
PDF
HTML
Papers citing
"IMPALA: Scalable Distributed Deep-RL with Importance Weighted Actor-Learner Architectures"
50 / 981 papers shown
Title
MESA: Cooperative Meta-Exploration in Multi-Agent Learning through Exploiting State-Action Space Structure
Zhicheng Zhang
Yancheng Liang
Yi Wu
Fei Fang
31
2
0
01 May 2024
A Meta-Game Evaluation Framework for Deep Multiagent Reinforcement Learning
Zun Li
Michael P. Wellman
42
1
0
30 Apr 2024
Shared learning of powertrain control policies for vehicle fleets
Lindsey Kerbel
B. Ayalew
Andrej Ivanco
31
0
0
27 Apr 2024
Learning to Beat ByteRL: Exploitability of Collectible Card Game Agents
Radovan Haluška
Martin Schmid
LLMAG
45
0
0
25 Apr 2024
Impedance Matching: Enabling an RL-Based Running Jump in a Quadruped Robot
Neil Guan
Shangqun Yu
Shifan Zhu
Donghyun Kim
37
0
0
23 Apr 2024
Generalized Population-Based Training for Hyperparameter Optimization in Reinforcement Learning
Hui Bai
Ran Cheng
55
4
0
12 Apr 2024
GO4Align: Group Optimization for Multi-Task Alignment
Jiayi Shen
Cheems Wang
Zehao Xiao
Nanne van Noord
M. Worring
37
4
0
09 Apr 2024
Securing the Skies: An IRS-Assisted AoI-Aware Secure Multi-UAV System with Efficient Task Offloading
Poorvi Joshi
Alakesh Kalita
Gurusamy Mohan
32
0
0
06 Apr 2024
Compressed Federated Reinforcement Learning with a Generative Model
Ali Beikmohammadi
Sarit Khirirat
Sindri Magnússon
FedML
39
2
0
26 Mar 2024
EnvGen: Generating and Adapting Environments via LLMs for Training Embodied Agents
Abhaysinh Zala
Jaemin Cho
Han Lin
Jaehong Yoon
Mohit Bansal
41
13
0
18 Mar 2024
Scaling Instructable Agents Across Many Simulated Worlds
Sima Team
Maria Abi Raad
Arun Ahuja
Catarina Barros
F. Besse
...
Daan Wierstra
Duncan Williams
Nathaniel Wong
Sarah York
Nick Young
LM&Ro
115
39
0
13 Mar 2024
Mastering Memory Tasks with World Models
Mohammad Reza Samsami
Artem Zholus
Janarthanan Rajendran
Sarath Chandar
CLL
OffRL
34
23
0
07 Mar 2024
Stop Regressing: Training Value Functions via Classification for Scalable Deep RL
Jesse Farebrother
Jordi Orbay
Q. Vuong
Adrien Ali Taïga
Yevgen Chebotar
...
Sergey Levine
Pablo Samuel Castro
Aleksandra Faust
Aviral Kumar
Rishabh Agarwal
OffRL
61
57
0
06 Mar 2024
Scalable Volt-VAR Optimization using RLlib-IMPALA Framework: A Reinforcement Learning Approach
Alaa Selim
Yanzhu Ye
Junbo Zhao
Bo Yang
19
0
0
24 Feb 2024
Skill or Luck? Return Decomposition via Advantage Functions
Hsiao-Ru Pan
Bernhard Schölkopf
OffRL
25
3
0
20 Feb 2024
In value-based deep reinforcement learning, a pruned network is a good network
J. Obando-Ceron
Rameswar Panda
Pablo Samuel Castro
OffRL
48
18
0
19 Feb 2024
Jack of All Trades, Master of Some, a Multi-Purpose Transformer Agent
Quentin Gallouedec
E. Beeching
Clément Romac
Emmanuel Dellandrea
26
11
0
15 Feb 2024
Mixtures of Experts Unlock Parameter Scaling for Deep RL
J. Obando-Ceron
Ghada Sokar
Timon Willi
Clare Lyle
Jesse Farebrother
Jakob N. Foerster
Gintare Karolina Dziugaite
Doina Precup
Pablo Samuel Castro
63
31
0
13 Feb 2024
NavFormer: A Transformer Architecture for Robot Target-Driven Navigation in Unknown and Dynamic Environments
Haitong Wang
Aaron Hao Tan
G. Nejat
49
12
0
09 Feb 2024
Off-policy Distributional Q(
λ
λ
λ
): Distributional RL without Importance Sampling
Yunhao Tang
Mark Rowland
Rémi Munos
Bernardo Avila-Pires
Will Dabney
OffRL
15
1
0
08 Feb 2024
Private Knowledge Sharing in Distributed Learning: A Survey
Yasas Supeksala
Dinh C. Nguyen
Ming Ding
Thilina Ranbaduge
Calson Chua
Jun Zhang
Jun Li
H. Vincent Poor
49
0
0
08 Feb 2024
A computational approach to visual ecology with deep reinforcement learning
Sacha Sokoloski
Jure Majnik
Philipp Berens
11
0
0
07 Feb 2024
Learning Diverse Policies with Soft Self-Generated Guidance
Guojian Wang
Faguo Wu
Xiao Zhang
Jianxiang Liu
OffRL
31
4
0
07 Feb 2024
Just Cluster It: An Approach for Exploration in High-Dimensions using Clustering and Pre-Trained Representations
Stefan Sylvius Wagner
Stefan Harmeling
29
2
0
05 Feb 2024
Open RL Benchmark: Comprehensive Tracked Experiments for Reinforcement Learning
Shengyi Huang
Quentin Gallouedec
Florian Felten
Antonin Raffin
Rousslan Fernand Julien Dossa
...
Alexander Nikulin
Xiao Hu
Tianlin Liu
Jongwook Choi
Brent Yi
OffRL
29
8
0
05 Feb 2024
Augmenting Replay in World Models for Continual Reinforcement Learning
Luke Yang
L. Kuhlmann
Gideon Kowadlo
VLM
KELM
CLL
OffRL
44
0
0
30 Jan 2024
Zero-shot Imitation Policy via Search in Demonstration Dataset
Federico Malato
Florian Leopold
Andrew Melnik
Ville Hautamaki
LM&Ro
OffRL
26
6
0
29 Jan 2024
Visual Imitation Learning with Calibrated Contrastive Representation
Yunke Wang
Linwei Tao
Bo Du
Yutian Lin
Chang Xu
28
0
0
21 Jan 2024
Reinforcement Learning from LLM Feedback to Counteract Goal Misgeneralization
Houda Nait El Barj
Théophile Sautory
27
2
0
14 Jan 2024
Scaling Is All You Need: Autonomous Driving with JAX-Accelerated Reinforcement Learning
Moritz Harmel
Anubhav Paras
Andreas Pasternak
Nicholas Roy
Gary Linscott
LRM
21
1
0
23 Dec 2023
XLand-MiniGrid: Scalable Meta-Reinforcement Learning Environments in JAX
Alexander Nikulin
Vladislav Kurenkov
Ilya Zisman
Artem Agarkov
Viacheslav Sinii
Sergey Kolesnikov
35
26
0
19 Dec 2023
Learning to Act without Actions
Dominik Schmidt
Minqi Jiang
OffRL
34
31
0
17 Dec 2023
Multi-agent Reinforcement Learning: A Comprehensive Survey
Dom Huh
Prasant Mohapatra
AI4CE
36
8
0
15 Dec 2023
Auto MC-Reward: Automated Dense Reward Design with Large Language Models for Minecraft
Hao Li
Xue Yang
Zhaokai Wang
Xizhou Zhu
Jie Zhou
Yu Qiao
Xiaogang Wang
Hongsheng Li
Lewei Lu
Jifeng Dai
43
32
0
14 Dec 2023
The Effective Horizon Explains Deep RL Performance in Stochastic Environments
Cassidy Laidlaw
Banghua Zhu
Stuart J. Russell
Anca Dragan
36
2
0
13 Dec 2023
An Invitation to Deep Reinforcement Learning
Bernhard Jaeger
Andreas Geiger
OffRL
OOD
80
5
0
13 Dec 2023
Spreeze: High-Throughput Parallel Reinforcement Learning Framework
Jing Hou
Guang Chen
Ruiqi Zhang
Zhijun Li
Shangding Gu
Changjun Jiang
OffRL
32
2
0
11 Dec 2023
Bad Students Make Great Teachers: Active Learning Accelerates Large-Scale Visual Understanding
Talfan Evans
Shreya Pathak
Hamza Merzic
Jonathan Schwarz
Ryutaro Tanno
Olivier J. Hénaff
23
16
0
08 Dec 2023
Efficient Parallel Reinforcement Learning Framework using the Reactor Model
Jacky Kwok
Marten Lohstroh
Edward A. Lee
26
0
0
07 Dec 2023
Colour versus Shape Goal Misgeneralization in Reinforcement Learning: A Case Study
Karolis Ramanauskas
Özgür Simsek
29
0
0
05 Dec 2023
Visual Encoders for Data-Efficient Imitation Learning in Modern Video Games
Lukas Schäfer
Logan Jones
Anssi Kanervisto
Yuhan Cao
Tabish Rashid
Raluca Georgescu
David Bignell
Siddhartha Sen
Andrea Trevino Gavito
Sam Devlin
93
3
0
04 Dec 2023
Harnessing Discrete Representations For Continual Reinforcement Learning
Edan Meyer
Adam White
Marlos C. Machado
OffRL
46
4
0
02 Dec 2023
Handling Cost and Constraints with Off-Policy Deep Reinforcement Learning
Jared Markowitz
Jesse Silverberg
Gary Collins
OffRL
23
0
0
30 Nov 2023
Replay across Experiments: A Natural Extension of Off-Policy RL
Dhruva Tirumala
Thomas Lampe
José Enrique Chen
Tuomas Haarnoja
Sandy Huang
...
Tim Hertweck
Leonard Hasenclever
Martin Riedmiller
N. Heess
Markus Wulfmeier
OffRL
40
8
0
27 Nov 2023
Agent as Cerebrum, Controller as Cerebellum: Implementing an Embodied LMM-based Agent on Drones
Haoran Zhao
Fengxing Pan
Huqiuyue Ping
Yaoming Zhou
AI4CE
50
12
0
25 Nov 2023
Probabilistic Inference in Reinforcement Learning Done Right
Jean Tarbouriech
Tor Lattimore
Brendan O'Donoghue
BDL
OffRL
30
4
0
22 Nov 2023
minimax: Efficient Baselines for Autocurricula in JAX
Minqi Jiang
Michael Dennis
Edward Grefenstette
Tim Rocktaschel
27
8
0
21 Nov 2023
On-Policy Policy Gradient Reinforcement Learning Without On-Policy Sampling
Nicholas Corrado
Josiah P. Hanna
OffRL
20
1
0
14 Nov 2023
An introduction to reinforcement learning for neuroscience
Kristopher T. Jensen
OOD
OffRL
AI4CE
36
1
0
13 Nov 2023
Towards Continual Reinforcement Learning for Quadruped Robots
G. Minelli
V. Vassiliades
CLL
33
1
0
12 Nov 2023
Previous
1
2
3
4
5
6
...
18
19
20
Next