Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
1802.01561
Cited By
IMPALA: Scalable Distributed Deep-RL with Importance Weighted Actor-Learner Architectures
5 February 2018
L. Espeholt
Hubert Soyer
Rémi Munos
Karen Simonyan
Volodymyr Mnih
Tom Ward
Yotam Doron
Vlad Firoiu
Tim Harley
Iain Dunning
Shane Legg
Koray Kavukcuoglu
Re-assign community
ArXiv
PDF
HTML
Papers citing
"IMPALA: Scalable Distributed Deep-RL with Importance Weighted Actor-Learner Architectures"
50 / 976 papers shown
Title
Deep reinforcement learning-based longitudinal control strategy for automated vehicles at signalised intersections
Pankaj Kumar
Aditya Mishra
Pranamesh Chakraborty
Subrahmanya Swamy Peruru
19
0
0
13 May 2025
Combining Bayesian Inference and Reinforcement Learning for Agent Decision Making: A Review
Chengmin Zhou
Ville Kyrki
Pasi Fränti
Laura Ruotsalainen
BDL
AI4CE
42
0
0
12 May 2025
CLAM: Continuous Latent Action Models for Robot Learning from Unlabeled Demonstrations
Anthony Liang
Pavel Czempin
Matthew Hong
Yutai Zhou
Erdem Biyik
Stephen Tu
47
0
0
08 May 2025
Onboard Optimization and Learning: A Survey
Monirul Islam Pavel
Siyi Hu
Mahardhika Pratama
Ryszard Kowalczyk
26
0
0
07 May 2025
Enhancing Diversity in Parallel Agents: A Maximum State Entropy Exploration Story
Vincenzo De Paola
Riccardo Zamboni
Mirco Mutti
Marcello Restelli
19
0
0
02 May 2025
Learning to Drive from a World Model
Mitchell Goff
Greg Hogan
George Hotz
Armand du Parc Locmaria
Kacper Raczy
Harald Schäfer
Adeeb Shihadeh
Weixing Zhang
Yassine Yousfi
39
0
0
27 Apr 2025
Using Reinforcement Learning to Integrate Subjective Wellbeing into Climate Adaptation Decision Making
Arthur Vandervoort
Miguel Costa
Morten W. Petersen
Martin Drews
Sonja Haustein
Karyn Morrissey
Francisco C. Pereira
29
0
0
14 Apr 2025
Pay Attention to What and Where? Interpretable Feature Extractor in Vision-based Deep Reinforcement Learning
Tien Pham
Angelo Cangelosi
31
0
0
14 Apr 2025
Momentum Boosted Episodic Memory for Improving Learning in Long-Tailed RL Environments
Dolton Fernandes
Pramod Kaushik
Harsh Shukla
Bapi Raju Surampudi
19
0
0
08 Apr 2025
World Model Agents with Change-Based Intrinsic Motivation
Jeremias Ferrao
Rafael Cunha
OffRL
MoE
57
0
0
26 Mar 2025
CAE: Repurposing the Critic as an Explorer in Deep Reinforcement Learning
Yexin Li
Pring Wong
Hanfang Zhang
Shuo Chen
Siyuan Qi
OffRL
54
0
0
23 Mar 2025
1000 Layer Networks for Self-Supervised RL: Scaling Depth Can Enable New Goal-Reaching Capabilities
Kevin Wang
Ishaan Javali
Michał Bortkiewicz
Tomasz Trzciñski
Benjamin Eysenbach
SSL
OffRL
72
0
0
19 Mar 2025
SocialJax: An Evaluation Suite for Multi-agent Reinforcement Learning in Sequential Social Dilemmas
Zihao Guo
Richard Willis
Richard Willis
Tristan Tomilin
Joel Z Leibo
Yali Du
58
0
0
18 Mar 2025
Tapered Off-Policy REINFORCE: Stable and efficient reinforcement learning for LLMs
Nicolas Le Roux
Marc G. Bellemare
Jonathan Lebensold
Arnaud Bergeron
Joshua Greaves
Alex Fréchette
Carolyne Pelletier
Eric Thibodeau-Laufer
Sándor Toth
Sam Work
OffRL
91
2
0
18 Mar 2025
Agents Play Thousands of 3D Video Games
Zhongwen Xu
Xianliang Wang
Siyi Li
Tao Yu
Liang Wang
Qiang Fu
Wei Yang
LM&Ro
52
0
0
17 Mar 2025
Contextual Similarity Distillation: Ensemble Uncertainties with a Single Model
Moritz A. Zanger
Pascal R. van der Vaart
Wendelin Bohmer
M. Spaan
UQCV
BDL
173
0
0
14 Mar 2025
Impoola: The Power of Average Pooling for Image-Based Deep Reinforcement Learning
Raphael Trumpp
Ansgar Schäfftlein
Mirco Theile
Marco Caccamo
39
0
0
07 Mar 2025
Eau De
Q
Q
Q
-Network: Adaptive Distillation of Neural Networks in Deep Reinforcement Learning
Théo Vincent
Tim Lukas Faust
Yogesh Tripathi
Jan Peters
Carlo DÉramo
42
0
0
03 Mar 2025
Multi-Agent Reinforcement Learning with Long-Term Performance Objectives for Service Workforce Optimization
Kareem Eissa
Rayal Prasad
Sarith Mohan
Ankur Kapoor
Dorin Comaniciu
V. Singh
44
0
0
03 Mar 2025
Highly Parallelized Reinforcement Learning Training with Relaxed Assignment Dependencies
Zhouyu He
Peng Qiao
Rongchun Li
Yong Dou
Yusong Tan
OffRL
59
0
0
27 Feb 2025
DistRL: An Asynchronous Distributed Reinforcement Learning Framework for On-Device Control Agents
Taiyi Wang
Zhihao Wu
Jianheng Liu
Jianye Hao
Jun Wang
Kun Shao
OffRL
41
13
0
24 Feb 2025
A Large Recurrent Action Model: xLSTM enables Fast Inference for Robotics Tasks
Thomas Schmied
Thomas Adler
Vihang Patil
M. Beck
Korbinian Poppel
Johannes Brandstetter
G. Klambauer
Razvan Pascanu
Sepp Hochreiter
75
5
0
21 Feb 2025
Comply: Learning Sentences with Complex Weights inspired by Fruit Fly Olfaction
Alexei Figueroa
Justus Westerhoff
Golzar Atefi
Dennis Fast
B. Winter
Felix Alexader Gers
Alexander Loser
Wolfang Nejdl
57
0
0
03 Feb 2025
Divergence-Augmented Policy Optimization
Qing Wang
Yingru Li
Jiechao Xiong
Tong Zhang
OffRL
47
16
0
28 Jan 2025
EvoRL: A GPU-accelerated Framework for Evolutionary Reinforcement Learning
Bowen Zheng
Ran Cheng
Kay Chen Tan
42
0
0
25 Jan 2025
Adaptive Data Exploitation in Deep Reinforcement Learning
Mingqi Yuan
Bo Li
Xin Jin
Wenjun Zeng
OffRL
201
0
0
22 Jan 2025
Highway Graph to Accelerate Reinforcement Learning
Zidu Yin
Zhen Zhang
Dong Gong
Stefano V. Albrecht
J. Q. Shi
OffRL
39
0
0
08 Jan 2025
PIMAEX: Multi-Agent Exploration through Peer Incentivization
Michael Kolle
Johannes Tochtermann
Julian Schonberger
Gerhard Stenzel
Philipp Altmann
Claudia Linnhoff-Popien
41
0
0
03 Jan 2025
Reinforcement Learning with a Focus on Adjusting Policies to Reach Targets
Akane Tsuboya
Yu Kono
Tatsuji Takahashi
31
0
0
23 Dec 2024
Enabling Realtime Reinforcement Learning at Scale with Staggered Asynchronous Inference
Matthew D Riemer
G. Subbaraj
Glen Berseth
Irina Rish
OffRL
77
1
0
18 Dec 2024
When Should We Prefer State-to-Visual DAgger Over Visual Reinforcement Learning?
Tongzhou Mu
Zhaoyang Li
Stanisław Wiktor Strzelecki
Xiu Yuan
Yunchao Yao
Litian Liang
H. Su
OffRL
82
2
0
18 Dec 2024
Stabilizing Reinforcement Learning in Differentiable Multiphysics Simulation
Eliot Xing
Vernon Luk
Jean Oh
84
0
0
16 Dec 2024
Sample-efficient Unsupervised Policy Cloning from Ensemble Self-supervised Labeled Videos
Xin Liu
Yaran Chen
Haoran Li
SSL
94
0
0
14 Dec 2024
Bilinear Convolution Decomposition for Causal RL Interpretability
Narmeen Oozeer
Sinem Erisken
Alice Rigg
65
0
0
01 Dec 2024
ReinFog: A DRL Empowered Framework for Resource Management in Edge and Cloud Computing Environments
Zhiyu Wang
M. Goudarzi
Rajkumar Buyya
77
0
0
20 Nov 2024
AMAGO-2: Breaking the Multi-Task Barrier in Meta-Reinforcement Learning with Transformers
Jake Grigsby
Justin Sasek
Samyak Parajuli
Daniel Adebi
Amy Zhang
Yuke Zhu
OffRL
26
3
0
17 Nov 2024
Scaling Laws for Pre-training Agents and World Models
Tim Pearce
Tabish Rashid
Dave Bignell
Raluca Georgescu
Sam Devlin
Katja Hofmann
LM&Ro
42
6
0
07 Nov 2024
Beyond The Rainbow: High Performance Deep Reinforcement Learning On A Desktop PC
Tyler Clark
Mark Towers
Christine Evers
Jonathon Hare
OffRL
38
0
0
06 Nov 2024
Hybrid Transfer Reinforcement Learning: Provable Sample Efficiency from Shifted-Dynamics Data
Chengrui Qu
Laixi Shi
Kishan Panaganti
Pengcheng You
Adam Wierman
OffRL
OnRL
38
0
0
06 Nov 2024
Sample-Efficient Alignment for LLMs
Zichen Liu
Changyu Chen
Chao Du
Wee Sun Lee
Min-Bin Lin
36
3
0
03 Nov 2024
CALE: Continuous Arcade Learning Environment
Jesse Farebrother
Pablo Samuel Castro
ELM
33
0
0
31 Oct 2024
Kinetix: Investigating the Training of General Agents through Open-Ended Physics-Based Control Tasks
Michael T. Matthews
Michael Beukman
Chris Xiaoxuan Lu
Jakob Foerster
OffRL
AI4CE
36
2
0
30 Oct 2024
Predicting Future Actions of Reinforcement Learning Agents
Stephen Chung
Scott Niekum
David M. Krueger
29
1
0
29 Oct 2024
A Multi-Agent Reinforcement Learning Testbed for Cognitive Radio Applications
Sriniketh Vangaru
Daniel Rosen
Dylan Green
Raphael Rodriguez
Maxwell Wiecek
Amos Johnson
Alyse M. Jones
William C. Headley
37
1
0
28 Oct 2024
Efficient Diversity-based Experience Replay for Deep Reinforcement Learning
Kaiyan Zhao
Yiming Wang
Yuyang Chen
Yan Li
Leong Hou U
Xiaoguang Niu
39
1
0
27 Oct 2024
OGBench: Benchmarking Offline Goal-Conditioned RL
Seohong Park
Kevin Frans
Benjamin Eysenbach
Sergey Levine
OffRL
50
9
0
26 Oct 2024
Asynchronous RLHF: Faster and More Efficient Off-Policy RL for Language Models
Michael Noukhovitch
Shengyi Huang
Sophie Xhonneux
Arian Hosseini
Rishabh Agarwal
Rameswar Panda
OffRL
82
5
0
23 Oct 2024
Towards Map-Agnostic Policies for Adaptive Informative Path Planning
Julius Ruckin
David Morilla-Cabello
C. Stachniss
Eduardo Montijano
Marija Popović
36
0
0
22 Oct 2024
TF-DDRL: A Transformer-enhanced Distributed DRL Technique for Scheduling IoT Applications in Edge and Cloud Computing Environments
Zhiyu Wang
M. Goudarzi
Rajkumar Buyya
OffRL
35
3
0
18 Oct 2024
Mitigating Suboptimality of Deterministic Policy Gradients in Complex Q-functions
Ayush Jain
Norio Kosaka
Xinhu Li
Kyung-Min Kim
Erdem Bıyık
Joseph J. Lim
OffRL
21
0
0
15 Oct 2024
1
2
3
4
...
18
19
20
Next