ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1802.01561
  4. Cited By
IMPALA: Scalable Distributed Deep-RL with Importance Weighted
  Actor-Learner Architectures
v1v2v3 (latest)

IMPALA: Scalable Distributed Deep-RL with Importance Weighted Actor-Learner Architectures

5 February 2018
L. Espeholt
Hubert Soyer
Rémi Munos
Karen Simonyan
Volodymyr Mnih
Tom Ward
Yotam Doron
Vlad Firoiu
Tim Harley
Iain Dunning
Shane Legg
Koray Kavukcuoglu
ArXiv (abs)PDFHTML

Papers citing "IMPALA: Scalable Distributed Deep-RL with Importance Weighted Actor-Learner Architectures"

50 / 1,000 papers shown
Title
PIMAEX: Multi-Agent Exploration through Peer Incentivization
Michael Kolle
Johannes Tochtermann
Julian Schonberger
Gerhard Stenzel
Philipp Altmann
Claudia Linnhoff-Popien
105
0
0
03 Jan 2025
Reinforcement Learning with a Focus on Adjusting Policies to Reach
  Targets
Reinforcement Learning with a Focus on Adjusting Policies to Reach Targets
Akane Tsuboya
Yu Kono
Tatsuji Takahashi
70
0
0
23 Dec 2024
Enabling Realtime Reinforcement Learning at Scale with Staggered
  Asynchronous Inference
Enabling Realtime Reinforcement Learning at Scale with Staggered Asynchronous Inference
Matthew D Riemer
G. Subbaraj
Glen Berseth
Irina Rish
OffRL
140
2
0
18 Dec 2024
When Should We Prefer State-to-Visual DAgger Over Visual Reinforcement
  Learning?
When Should We Prefer State-to-Visual DAgger Over Visual Reinforcement Learning?
Tongzhou Mu
Zhaoyang Li
Stanisław Wiktor Strzelecki
Xiu Yuan
Yunchao Yao
Litian Liang
H. Su
OffRL
127
2
0
18 Dec 2024
Stabilizing Reinforcement Learning in Differentiable Multiphysics Simulation
Stabilizing Reinforcement Learning in Differentiable Multiphysics Simulation
Eliot Xing
Vernon Luk
Jean Oh
184
1
0
16 Dec 2024
Sample-efficient Unsupervised Policy Cloning from Ensemble Self-supervised Labeled Videos
Sample-efficient Unsupervised Policy Cloning from Ensemble Self-supervised Labeled Videos
Xin Liu
Yaran Chen
Haoran Li
SSL
218
0
0
14 Dec 2024
Bilinear Convolution Decomposition for Causal RL Interpretability
Bilinear Convolution Decomposition for Causal RL Interpretability
Narmeen Oozeer
Sinem Erisken
Alice Rigg
74
0
0
01 Dec 2024
ReinFog: A DRL Empowered Framework for Resource Management in Edge and
  Cloud Computing Environments
ReinFog: A DRL Empowered Framework for Resource Management in Edge and Cloud Computing Environments
Zhiyu Wang
M. Goudarzi
Rajkumar Buyya
106
1
0
20 Nov 2024
AMAGO-2: Breaking the Multi-Task Barrier in Meta-Reinforcement Learning with Transformers
Jake Grigsby
Justin Sasek
Samyak Parajuli
Daniel Adebi
Amy Zhang
Yuke Zhu
OffRL
72
6
0
17 Nov 2024
Scaling Laws for Pre-training Agents and World Models
Scaling Laws for Pre-training Agents and World Models
Tim Pearce
Tabish Rashid
Dave Bignell
Raluca Georgescu
Sam Devlin
Katja Hofmann
LM&Ro
80
7
0
07 Nov 2024
Hybrid Transfer Reinforcement Learning: Provable Sample Efficiency from
  Shifted-Dynamics Data
Hybrid Transfer Reinforcement Learning: Provable Sample Efficiency from Shifted-Dynamics Data
Chengrui Qu
Laixi Shi
Kishan Panaganti
Pengcheng You
Adam Wierman
OffRLOnRL
100
2
0
06 Nov 2024
Beyond The Rainbow: High Performance Deep Reinforcement Learning on a Desktop PC
Beyond The Rainbow: High Performance Deep Reinforcement Learning on a Desktop PC
Tyler Clark
Mark Towers
Christine Evers
Jonathon Hare
OffRL
158
1
0
06 Nov 2024
Sample-Efficient Alignment for LLMs
Sample-Efficient Alignment for LLMs
Zichen Liu
Changyu Chen
Chao Du
Wee Sun Lee
Min Lin
102
4
0
03 Nov 2024
CALE: Continuous Arcade Learning Environment
CALE: Continuous Arcade Learning Environment
Jesse Farebrother
Pablo Samuel Castro
ELM
68
0
0
31 Oct 2024
Kinetix: Investigating the Training of General Agents through Open-Ended Physics-Based Control Tasks
Kinetix: Investigating the Training of General Agents through Open-Ended Physics-Based Control Tasks
Michael T. Matthews
Michael Beukman
Chris Xiaoxuan Lu
Jakob Foerster
OffRLAI4CE
125
8
0
30 Oct 2024
Predicting Future Actions of Reinforcement Learning Agents
Predicting Future Actions of Reinforcement Learning Agents
Stephen Chung
Scott Niekum
David M. Krueger
80
3
0
29 Oct 2024
A Large Recurrent Action Model: xLSTM enables Fast Inference for Robotics Tasks
A Large Recurrent Action Model: xLSTM enables Fast Inference for Robotics Tasks
Thomas Schmied
Thomas Adler
Vihang Patil
M. Beck
Korbinian Poppel
Johannes Brandstetter
Günter Klambauer
Razvan Pascanu
Sepp Hochreiter
207
7
0
29 Oct 2024
A Multi-Agent Reinforcement Learning Testbed for Cognitive Radio
  Applications
A Multi-Agent Reinforcement Learning Testbed for Cognitive Radio Applications
Sriniketh Vangaru
Daniel Rosen
Dylan Green
Raphael Rodriguez
Maxwell Wiecek
Amos Johnson
Alyse M. Jones
William C. Headley
66
1
0
28 Oct 2024
Efficient Diversity-based Experience Replay for Deep Reinforcement Learning
Efficient Diversity-based Experience Replay for Deep Reinforcement Learning
Kaiyan Zhao
Yiming Wang
Yuyang Chen
Yan Li
Leong Hou U
Xiaoguang Niu
129
1
0
27 Oct 2024
OGBench: Benchmarking Offline Goal-Conditioned RL
OGBench: Benchmarking Offline Goal-Conditioned RL
Seohong Park
Kevin Frans
Benjamin Eysenbach
Sergey Levine
OffRL
150
29
0
26 Oct 2024
Asynchronous RLHF: Faster and More Efficient Off-Policy RL for Language Models
Asynchronous RLHF: Faster and More Efficient Off-Policy RL for Language Models
Michael Noukhovitch
Shengyi Huang
Sophie Xhonneux
Arian Hosseini
Rishabh Agarwal
Rameswar Panda
OffRL
183
11
0
23 Oct 2024
Towards Map-Agnostic Policies for Adaptive Informative Path Planning
Towards Map-Agnostic Policies for Adaptive Informative Path Planning
Julius Ruckin
David Morilla-Cabello
C. Stachniss
Eduardo Montijano
Marija Popović
88
0
0
22 Oct 2024
Rethinking Soft Actor-Critic in High-Dimensional Action Spaces: The Cost of Ignoring Distribution Shift
Rethinking Soft Actor-Critic in High-Dimensional Action Spaces: The Cost of Ignoring Distribution Shift
Yanjun Chen
Wei Wei
Xianghui Wang
Zhiqiang Xu
Xiaoyu Shen
Wei Zhang
38
0
0
22 Oct 2024
TF-DDRL: A Transformer-enhanced Distributed DRL Technique for Scheduling
  IoT Applications in Edge and Cloud Computing Environments
TF-DDRL: A Transformer-enhanced Distributed DRL Technique for Scheduling IoT Applications in Edge and Cloud Computing Environments
Zhiyu Wang
M. Goudarzi
Rajkumar Buyya
OffRL
115
4
0
18 Oct 2024
Mitigating Suboptimality of Deterministic Policy Gradients in Complex
  Q-functions
Mitigating Suboptimality of Deterministic Policy Gradients in Complex Q-functions
Ayush Jain
Norio Kosaka
Xinhu Li
Kyung-Min Kim
Erdem Bıyık
Joseph J. Lim
OffRL
49
0
0
15 Oct 2024
SimBa: Simplicity Bias for Scaling Up Parameters in Deep Reinforcement Learning
SimBa: Simplicity Bias for Scaling Up Parameters in Deep Reinforcement Learning
Hojoon Lee
Dongyoon Hwang
Donghu Kim
Hyunseung Kim
Jun Jet Tai
K. Subramanian
Peter R. Wurman
Jaegul Choo
Peter Stone
Takuma Seno
OffRL
189
17
0
13 Oct 2024
Retrieval-Augmented Decision Transformer: External Memory for In-context
  RL
Retrieval-Augmented Decision Transformer: External Memory for In-context RL
Thomas Schmied
Fabian Paischer
Vihang Patil
M. Hofmarcher
Razvan Pascanu
Sepp Hochreiter
OffRL
101
7
0
09 Oct 2024
Training Interactive Agent in Large FPS Game Map with Rule-enhanced
  Reinforcement Learning
Training Interactive Agent in Large FPS Game Map with Rule-enhanced Reinforcement Learning
Chen Zhang
Huan Hu
Yuan Zhou
Qiyang Cao
Ruochen Liu
Wenya Wei
Elvis S. Liu
AI4CE
116
0
0
07 Oct 2024
Breaking the mold: The challenge of large scale MARL specialization
Breaking the mold: The challenge of large scale MARL specialization
Stefan Juang
Hugh Cao
Arielle Zhou
Ruochen Liu
Nevin L. Zhang
Elvis Liu
54
1
0
03 Oct 2024
Don't flatten, tokenize! Unlocking the key to SoftMoE's efficacy in deep RL
Don't flatten, tokenize! Unlocking the key to SoftMoE's efficacy in deep RL
Ghada Sokar
J. Obando-Ceron
Rameswar Panda
Hugo Larochelle
Pablo Samuel Castro
MoE
338
7
0
02 Oct 2024
Autonomous Network Defence using Reinforcement Learning
Autonomous Network Defence using Reinforcement Learning
Myles Foley
Chris Hicks
Kate Highnam
V. Mavroudis
AAML
55
31
0
26 Sep 2024
Exploring Semantic Clustering in Deep Reinforcement Learning for Video
  Games
Exploring Semantic Clustering in Deep Reinforcement Learning for Video Games
Liang Zhang
Justin Lieffers
A. Pyarelal
115
0
0
25 Sep 2024
The unknotting number, hard unknot diagrams, and reinforcement learning
The unknotting number, hard unknot diagrams, and reinforcement learning
Taylor Applebaum
Sam Blackwell
Alex Davies
Thomas Edlich
András Juhász
Marc Lackenby
Nenad Tomašev
Daniel Zheng
46
3
0
13 Sep 2024
DemoStart: Demonstration-led auto-curriculum applied to sim-to-real with
  multi-fingered robots
DemoStart: Demonstration-led auto-curriculum applied to sim-to-real with multi-fingered robots
Maria Bauzá
José Enrique Chen
Valentin Dalibard
Nimrod Gileadi
Roland Hafner
...
Martin Riedmiller
Jon Scholz
Konstantinos Bousmalis
Francesco Nori
Nicolas Heess
71
6
0
10 Sep 2024
Real-Time Recurrent Learning using Trace Units in Reinforcement Learning
Real-Time Recurrent Learning using Trace Units in Reinforcement Learning
Esraa Elelimy
Adam White
Michael Bowling
Martha White
OffRL
92
3
0
02 Sep 2024
Semantically Controllable Augmentations for Generalizable Robot Learning
Semantically Controllable Augmentations for Generalizable Robot Learning
Zoey Chen
Zhao Mandi
Homanga Bharadhwaj
Mohit Sharma
Shuran Song
Abhishek Gupta
Vikash Kumar
LM&Ro
107
7
0
02 Sep 2024
Efficient Exploration and Discriminative World Model Learning with an Object-Centric Abstraction
Efficient Exploration and Discriminative World Model Learning with an Object-Centric Abstraction
Anthony GX-Chen
Kenneth Marino
Rob Fergus
OCL
139
1
0
21 Aug 2024
D5RL: Diverse Datasets for Data-Driven Deep Reinforcement Learning
D5RL: Diverse Datasets for Data-Driven Deep Reinforcement Learning
Rafael Rafailov
Kyle Hatch
Anikait Singh
Laura Smith
Aviral Kumar
...
Victor Kolev
Philip J. Ball
Jiajun Wu
Chelsea Finn
Sergey Levine
OffRL
74
8
0
15 Aug 2024
An Introduction to Reinforcement Learning: Fundamental Concepts and
  Practical Applications
An Introduction to Reinforcement Learning: Fundamental Concepts and Practical Applications
Majid Ghasemi
Amir Hossein Moosavi
Ibrahim Sorkhoh
Anjali Agrawal
Fadi Alzhouri
Dariush Ebrahimi
OffRL
107
1
0
13 Aug 2024
Reinforcement Learning based Workflow Scheduling in Cloud and Edge
  Computing Environments: A Taxonomy, Review and Future Directions
Reinforcement Learning based Workflow Scheduling in Cloud and Edge Computing Environments: A Taxonomy, Review and Future Directions
Amanda Jayanetti
Saman K. Halgamuge
Rajkumar Buyya
32
0
0
06 Aug 2024
Enabling High Data Throughput Reinforcement Learning on GPUs: A Domain
  Agnostic Framework for Data-Driven Scientific Research
Enabling High Data Throughput Reinforcement Learning on GPUs: A Domain Agnostic Framework for Data-Driven Scientific Research
Tian Lan
Huan Wang
Caiming Xiong
Silvio Savarese
AI4CE
81
0
0
01 Aug 2024
How to Choose a Reinforcement-Learning Algorithm
How to Choose a Reinforcement-Learning Algorithm
Fabian Bongratz
Vladimir Golkov
Lukas Mautner
Luca Della Libera
Frederik Heetmeyer
Felix Czaja
Julian Rodemann
Daniel Cremers
75
1
0
30 Jul 2024
SAPG: Split and Aggregate Policy Gradients
SAPG: Split and Aggregate Policy Gradients
Jayesh Singla
Ananye Agarwal
Deepak Pathak
OffRLOnRL
86
5
0
29 Jul 2024
Dataset Distillation for Offline Reinforcement Learning
Dataset Distillation for Offline Reinforcement Learning
Jonathan Light
Yuanzhe Liu
Ziniu Hu
DD
91
3
0
29 Jul 2024
NAVIX: Scaling MiniGrid Environments with JAX
NAVIX: Scaling MiniGrid Environments with JAX
Eduardo Pignatelli
Jarek Liesen
R. T. Lange
Chris Xiaoxuan Lu
Pablo Samuel Castro
Laura Toni
144
4
0
28 Jul 2024
The Cross-environment Hyperparameter Setting Benchmark for Reinforcement
  Learning
The Cross-environment Hyperparameter Setting Benchmark for Reinforcement Learning
Andrew Patterson
Samuel Neumann
Raksha Kumaraswamy
Martha White
Adam White
68
2
0
26 Jul 2024
Proximal Policy Distillation
Proximal Policy Distillation
Giacomo Spigler
OffRL
84
1
0
21 Jul 2024
Instruction Following with Goal-Conditioned Reinforcement Learning in
  Virtual Environments
Instruction Following with Goal-Conditioned Reinforcement Learning in Virtual Environments
Zoya Volovikova
A. Skrynnik
Petr Kuderov
Aleksandr I. Panov
LLMAGLM&Ro
88
1
0
12 Jul 2024
Structural Design Through Reinforcement Learning
Structural Design Through Reinforcement Learning
Thomas Rochefort-Beaudoin
Aurelian Vadean
Niels Aage
S. Achiche
AI4CE
40
0
0
10 Jul 2024
Generalizing soft actor-critic algorithms to discrete action spaces
Generalizing soft actor-critic algorithms to discrete action spaces
Le Zhang
Yong Gu
Xin Zhao
Yanshuo Zhang
Shu Zhao
Yifei Jin
Xinxin Wu
93
0
0
08 Jul 2024
Previous
12345...181920
Next