ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1802.01561
  4. Cited By
IMPALA: Scalable Distributed Deep-RL with Importance Weighted
  Actor-Learner Architectures

IMPALA: Scalable Distributed Deep-RL with Importance Weighted Actor-Learner Architectures

5 February 2018
L. Espeholt
Hubert Soyer
Rémi Munos
Karen Simonyan
Volodymyr Mnih
Tom Ward
Yotam Doron
Vlad Firoiu
Tim Harley
Iain Dunning
Shane Legg
Koray Kavukcuoglu
ArXivPDFHTML

Papers citing "IMPALA: Scalable Distributed Deep-RL with Importance Weighted Actor-Learner Architectures"

50 / 981 papers shown
Title
Asynchronous RLHF: Faster and More Efficient Off-Policy RL for Language Models
Asynchronous RLHF: Faster and More Efficient Off-Policy RL for Language Models
Michael Noukhovitch
Shengyi Huang
Sophie Xhonneux
Arian Hosseini
Rishabh Agarwal
Rameswar Panda
OffRL
85
6
0
23 Oct 2024
Towards Map-Agnostic Policies for Adaptive Informative Path Planning
Towards Map-Agnostic Policies for Adaptive Informative Path Planning
Julius Ruckin
David Morilla-Cabello
C. Stachniss
Eduardo Montijano
Marija Popović
38
0
0
22 Oct 2024
Rethinking Soft Actor-Critic in High-Dimensional Action Spaces: The Cost of Ignoring Distribution Shift
Rethinking Soft Actor-Critic in High-Dimensional Action Spaces: The Cost of Ignoring Distribution Shift
Yanjun Chen
Jiahui Geng
Xianghui Wang
Zhiqiang Xu
Xiaoyu Shen
Wei Zhang
19
0
0
22 Oct 2024
TF-DDRL: A Transformer-enhanced Distributed DRL Technique for Scheduling
  IoT Applications in Edge and Cloud Computing Environments
TF-DDRL: A Transformer-enhanced Distributed DRL Technique for Scheduling IoT Applications in Edge and Cloud Computing Environments
Zhiyu Wang
M. Goudarzi
Rajkumar Buyya
OffRL
35
3
0
18 Oct 2024
Mitigating Suboptimality of Deterministic Policy Gradients in Complex
  Q-functions
Mitigating Suboptimality of Deterministic Policy Gradients in Complex Q-functions
Ayush Jain
Norio Kosaka
Xinhu Li
Kyung-Min Kim
Erdem Bıyık
Joseph J. Lim
OffRL
21
0
0
15 Oct 2024
SimBa: Simplicity Bias for Scaling Up Parameters in Deep Reinforcement
  Learning
SimBa: Simplicity Bias for Scaling Up Parameters in Deep Reinforcement Learning
Hojoon Lee
Dongyoon Hwang
Donghu Kim
Hyunseung Kim
Jun Jet Tai
K. Subramanian
Peter R. Wurman
Jaegul Choo
Peter Stone
Takuma Seno
OffRL
78
7
0
13 Oct 2024
Retrieval-Augmented Decision Transformer: External Memory for In-context
  RL
Retrieval-Augmented Decision Transformer: External Memory for In-context RL
Thomas Schmied
Fabian Paischer
Vihang Patil
M. Hofmarcher
Razvan Pascanu
Sepp Hochreiter
OffRL
48
6
0
09 Oct 2024
Training Interactive Agent in Large FPS Game Map with Rule-enhanced
  Reinforcement Learning
Training Interactive Agent in Large FPS Game Map with Rule-enhanced Reinforcement Learning
Chen Zhang
Huan Hu
Yuan Zhou
Qiyang Cao
Ruochen Liu
Wenya Wei
Elvis S. Liu
AI4CE
27
0
0
07 Oct 2024
Breaking the mold: The challenge of large scale MARL specialization
Breaking the mold: The challenge of large scale MARL specialization
Stefan Juang
Hugh Cao
Arielle Zhou
Ruochen Liu
Nevin L. Zhang
Elvis Liu
26
1
0
03 Oct 2024
Don't flatten, tokenize! Unlocking the key to SoftMoE's efficacy in deep RL
Don't flatten, tokenize! Unlocking the key to SoftMoE's efficacy in deep RL
Ghada Sokar
J. Obando-Ceron
Rameswar Panda
Hugo Larochelle
Pablo Samuel Castro
MoE
165
2
0
02 Oct 2024
Autonomous Network Defence using Reinforcement Learning
Autonomous Network Defence using Reinforcement Learning
Myles Foley
Chris Hicks
Kate Highnam
V. Mavroudis
AAML
21
29
0
26 Sep 2024
Exploring Semantic Clustering in Deep Reinforcement Learning for Video
  Games
Exploring Semantic Clustering in Deep Reinforcement Learning for Video Games
Liang Zhang
Justin Lieffers
A. Pyarelal
29
0
0
25 Sep 2024
The unknotting number, hard unknot diagrams, and reinforcement learning
The unknotting number, hard unknot diagrams, and reinforcement learning
Taylor Applebaum
Sam Blackwell
Alex Davies
Thomas Edlich
András Juhász
Marc Lackenby
Nenad Tomašev
Daniel Zheng
21
3
0
13 Sep 2024
DemoStart: Demonstration-led auto-curriculum applied to sim-to-real with
  multi-fingered robots
DemoStart: Demonstration-led auto-curriculum applied to sim-to-real with multi-fingered robots
Maria Bauzá
José Enrique Chen
Valentin Dalibard
Nimrod Gileadi
Roland Hafner
...
Martin Riedmiller
Jon Scholz
Konstantinos Bousmalis
Francesco Nori
Nicolas Heess
34
5
0
10 Sep 2024
Real-Time Recurrent Learning using Trace Units in Reinforcement Learning
Real-Time Recurrent Learning using Trace Units in Reinforcement Learning
Esraa Elelimy
Adam White
Michael Bowling
Martha White
OffRL
40
2
0
02 Sep 2024
Semantically Controllable Augmentations for Generalizable Robot Learning
Semantically Controllable Augmentations for Generalizable Robot Learning
Zoey Chen
Zhao Mandi
Homanga Bharadhwaj
Mohit Sharma
Shuran Song
Abhishek Gupta
Vikash Kumar
LM&Ro
37
5
0
02 Sep 2024
Efficient Exploration and Discriminative World Model Learning with an Object-Centric Abstraction
Efficient Exploration and Discriminative World Model Learning with an Object-Centric Abstraction
Anthony GX-Chen
Kenneth Marino
Rob Fergus
OCL
60
1
0
21 Aug 2024
D5RL: Diverse Datasets for Data-Driven Deep Reinforcement Learning
D5RL: Diverse Datasets for Data-Driven Deep Reinforcement Learning
Rafael Rafailov
Kyle Hatch
Anikait Singh
Laura Smith
Aviral Kumar
...
Victor Kolev
Philip J. Ball
Jiajun Wu
Chelsea Finn
Sergey Levine
OffRL
34
3
0
15 Aug 2024
An Introduction to Reinforcement Learning: Fundamental Concepts and
  Practical Applications
An Introduction to Reinforcement Learning: Fundamental Concepts and Practical Applications
Majid Ghasemi
Amir Hossein Moosavi
Ibrahim Sorkhoh
Anjali Agrawal
Fadi Alzhouri
Dariush Ebrahimi
OffRL
52
0
0
13 Aug 2024
Reinforcement Learning based Workflow Scheduling in Cloud and Edge
  Computing Environments: A Taxonomy, Review and Future Directions
Reinforcement Learning based Workflow Scheduling in Cloud and Edge Computing Environments: A Taxonomy, Review and Future Directions
Amanda Jayanetti
Saman K. Halgamuge
Rajkumar Buyya
20
0
0
06 Aug 2024
Enabling High Data Throughput Reinforcement Learning on GPUs: A Domain
  Agnostic Framework for Data-Driven Scientific Research
Enabling High Data Throughput Reinforcement Learning on GPUs: A Domain Agnostic Framework for Data-Driven Scientific Research
Tian Lan
Huan Wang
Caiming Xiong
Silvio Savarese
AI4CE
34
0
0
01 Aug 2024
How to Choose a Reinforcement-Learning Algorithm
How to Choose a Reinforcement-Learning Algorithm
Fabian Bongratz
Vladimir Golkov
Lukas Mautner
Luca Della Libera
Frederik Heetmeyer
Felix Czaja
Julian Rodemann
Daniel Cremers
34
1
0
30 Jul 2024
SAPG: Split and Aggregate Policy Gradients
SAPG: Split and Aggregate Policy Gradients
Jayesh Singla
Ananye Agarwal
Deepak Pathak
OffRL
OnRL
42
3
0
29 Jul 2024
Dataset Distillation for Offline Reinforcement Learning
Dataset Distillation for Offline Reinforcement Learning
Jonathan Light
Yuanzhe Liu
Ziniu Hu
DD
40
2
0
29 Jul 2024
NAVIX: Scaling MiniGrid Environments with JAX
NAVIX: Scaling MiniGrid Environments with JAX
Eduardo Pignatelli
Jarek Liesen
R. T. Lange
Chris Xiaoxuan Lu
Pablo Samuel Castro
Laura Toni
45
3
0
28 Jul 2024
The Cross-environment Hyperparameter Setting Benchmark for Reinforcement
  Learning
The Cross-environment Hyperparameter Setting Benchmark for Reinforcement Learning
Andrew Patterson
Samuel Neumann
Raksha Kumaraswamy
Martha White
Adam White
26
2
0
26 Jul 2024
Proximal Policy Distillation
Proximal Policy Distillation
Giacomo Spigler
OffRL
28
1
0
21 Jul 2024
Instruction Following with Goal-Conditioned Reinforcement Learning in
  Virtual Environments
Instruction Following with Goal-Conditioned Reinforcement Learning in Virtual Environments
Zoya Volovikova
A. Skrynnik
Petr Kuderov
Aleksandr I. Panov
LLMAG
LM&Ro
49
1
0
12 Jul 2024
Structural Design Through Reinforcement Learning
Structural Design Through Reinforcement Learning
Thomas Rochefort-Beaudoin
Aurelian Vadean
Niels Aage
S. Achiche
AI4CE
31
0
0
10 Jul 2024
Generalizing soft actor-critic algorithms to discrete action spaces
Generalizing soft actor-critic algorithms to discrete action spaces
Le Zhang
Yong Gu
Xin Zhao
Yanshuo Zhang
Shu Zhao
Yifei Jin
Xinxin Wu
34
0
0
08 Jul 2024
Mobile-Bench: An Evaluation Benchmark for LLM-based Mobile Agents
Mobile-Bench: An Evaluation Benchmark for LLM-based Mobile Agents
Shihan Deng
Weikai Xu
Hongda Sun
Wei Liu
Tao Tan
...
Ang Li
Jian Luan
Bin Wang
Rui Yan
Shuo Shang
LLMAG
52
8
0
01 Jul 2024
SiT: Symmetry-Invariant Transformers for Generalisation in Reinforcement
  Learning
SiT: Symmetry-Invariant Transformers for Generalisation in Reinforcement Learning
Matthias Weissenbacher
Rishabh Agarwal
Yoshinobu Kawahara
OffRL
34
1
0
21 Jun 2024
Advantage Alignment Algorithms
Advantage Alignment Algorithms
Juan Agustin Duque
Milad Aghajohari
Tim Cooijmans
Tianyu Zhang
Rameswar Panda
Gauthier Gidel
Aaron Courville
30
0
0
20 Jun 2024
Explore-Go: Leveraging Exploration for Generalisation in Deep
  Reinforcement Learning
Explore-Go: Leveraging Exploration for Generalisation in Deep Reinforcement Learning
Max Weltevrede
Felix Kaubek
M. Spaan
Wendelin Bohmer
57
0
0
12 Jun 2024
PufferLib: Making Reinforcement Learning Libraries and Environments Play
  Nice
PufferLib: Making Reinforcement Learning Libraries and Environments Play Nice
Joseph Suarez
AI4CE
51
2
0
11 Jun 2024
World Models with Hints of Large Language Models for Goal Achieving
World Models with Hints of Large Language Models for Goal Achieving
Zeyuan Liu
Ziyu Huan
Xiyao Wang
Jiafei Lyu
Jian Tao
Xiu Li
Furong Huang
Huazhe Xu
LM&Ro
LRM
AI4CE
48
1
0
11 Jun 2024
Massively Multiagent Minigames for Training Generalist Agents
Massively Multiagent Minigames for Training Generalist Agents
Kyoung Whan Choe
Ryan Sullivan
Joseph Suárez
AI4CE
34
0
0
07 Jun 2024
Transductive Off-policy Proximal Policy Optimization
Transductive Off-policy Proximal Policy Optimization
Yaozhong Gan
Renye Yan
Xiaoyang Tan
Zhe Wu
Junliang Xing
OffRL
29
2
0
06 Jun 2024
A Bayesian Approach to Online Planning
A Bayesian Approach to Online Planning
Nir Greshler
David Ben-Eli
Carmel Rabinovitz
Gabi Guetta
Liran Gispan
Guy Zohar
Aviv Tamar
23
0
0
04 Jun 2024
Learning-based legged locomotion; state of the art and future
  perspectives
Learning-based legged locomotion; state of the art and future perspectives
Sehoon Ha
Joonho Lee
M. van de Panne
Zhaoming Xie
Wenhao Yu
Majid Khadiv
47
17
0
03 Jun 2024
FuRL: Visual-Language Models as Fuzzy Rewards for Reinforcement Learning
FuRL: Visual-Language Models as Fuzzy Rewards for Reinforcement Learning
Yuwei Fu
Haichao Zhang
Di Wu
Wei Xu
Benoit Boulet
VLM
34
12
0
02 Jun 2024
RLeXplore: Accelerating Research in Intrinsically-Motivated Reinforcement Learning
RLeXplore: Accelerating Research in Intrinsically-Motivated Reinforcement Learning
Mingqi Yuan
Roger Creus Castanyer
Bo Li
Xin Jin
Glen Berseth
Wenjun Zeng
40
0
0
29 May 2024
Highway Reinforcement Learning
Highway Reinforcement Learning
Yuhui Wang
M. Strupl
Francesco Faccio
Qingyuan Wu
Haozhe Liu
Michal Grudzieñ
Xiaoyang Tan
Jürgen Schmidhuber
OffRL
39
4
0
28 May 2024
Fast TRAC: A Parameter-Free Optimizer for Lifelong Reinforcement
  Learning
Fast TRAC: A Parameter-Free Optimizer for Lifelong Reinforcement Learning
Aneesh Muppidi
Zhiyu Zhang
Heng Yang
39
4
0
26 May 2024
Pausing Policy Learning in Non-stationary Reinforcement Learning
Pausing Policy Learning in Non-stationary Reinforcement Learning
Hyunin Lee
Ming Jin
Javad Lavaei
Somayeh Sojoudi
OffRL
41
2
0
25 May 2024
Neural Network Compression for Reinforcement Learning Tasks
Neural Network Compression for Reinforcement Learning Tasks
Dmitry A. Ivanov
D. Larionov
Oleg V. Maslennikov
V. Voevodin
OffRL
AI4CE
55
0
0
13 May 2024
A Methodology-Oriented Study of Catastrophic Forgetting in Incremental
  Deep Neural Networks
A Methodology-Oriented Study of Catastrophic Forgetting in Incremental Deep Neural Networks
Ashutosh Kumar
Sonali Agarwal
D. J. Hemanth
43
0
0
11 May 2024
The Curse of Diversity in Ensemble-Based Exploration
The Curse of Diversity in Ensemble-Based Exploration
Zhixuan Lin
P. DÓro
Evgenii Nikishin
Rameswar Panda
44
1
0
07 May 2024
Is Sora a World Simulator? A Comprehensive Survey on General World
  Models and Beyond
Is Sora a World Simulator? A Comprehensive Survey on General World Models and Beyond
Zheng Zhu
Xiaofeng Wang
Wangbo Zhao
Chen Min
Nianchen Deng
...
Dawei Zhao
Liang Xiao
Jian-jun Zhao
Jiwen Lu
Guan Huang
VGen
LM&Ro
87
38
0
06 May 2024
Off-OAB: Off-Policy Policy Gradient Method with Optimal Action-Dependent
  Baseline
Off-OAB: Off-Policy Policy Gradient Method with Optimal Action-Dependent Baseline
Wenjia Meng
Qian Zheng
Long Yang
Yilong Yin
Gang Pan
OffRL
34
0
0
04 May 2024
Previous
12345...181920
Next