Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
1802.01561
Cited By
IMPALA: Scalable Distributed Deep-RL with Importance Weighted Actor-Learner Architectures
5 February 2018
L. Espeholt
Hubert Soyer
Rémi Munos
Karen Simonyan
Volodymyr Mnih
Tom Ward
Yotam Doron
Vlad Firoiu
Tim Harley
Iain Dunning
Shane Legg
Koray Kavukcuoglu
Re-assign community
ArXiv
PDF
HTML
Papers citing
"IMPALA: Scalable Distributed Deep-RL with Importance Weighted Actor-Learner Architectures"
50 / 981 papers shown
Title
LLM Augmented Hierarchical Agents
Bharat Prakash
Tim Oates
T. Mohsenin
24
4
0
09 Nov 2023
Real-Time Recurrent Reinforcement Learning
Julian Lemmel
Radu Grosu
34
2
0
08 Nov 2023
Handover Protocol Learning for LEO Satellite Networks: Access Delay and Collision Minimization
Ju-Hyung Lee
C. Park
Soohyun Park
A. Molisch
18
8
0
31 Oct 2023
Improving Intrinsic Exploration by Creating Stationary Objectives
Roger Creus Castanyer
Javier Civera
Taihú Pire
OffRL
37
3
0
27 Oct 2023
DSAC-C: Constrained Maximum Entropy for Robust Discrete Soft-Actor Critic
Dexter Neo
Tsuhan Chen
30
1
0
26 Oct 2023
Combining Behaviors with the Successor Features Keyboard
Wilka Carvalho
Andre Saraiva
Angelos Filos
Andrew Kyle Lampinen
Loic Matthey
Richard L. Lewis
Honglak Lee
Satinder Singh
Danilo Jimenez Rezende
Daniel Zoran
18
3
0
24 Oct 2023
μ-DDRL: A QoS-Aware Distributed Deep Reinforcement Learning Technique for Service Offloading in Fog computing Environments
M. Goudarzi
M. A. Rodriguez
Majid Sarvi
Rajkumar Buyya
OffRL
22
1
0
13 Oct 2023
Cross-Episodic Curriculum for Transformer Agents
Lucy Xiaoyang Shi
Yunfan Jiang
Jake Grigsby
Linxi "Jim" Fan
Yuke Zhu
30
5
0
12 Oct 2023
LightZero: A Unified Benchmark for Monte Carlo Tree Search in General Sequential Decision Scenarios
Yazhe Niu
Yuan Pu
Zhenjie Yang
Xueyan Li
Tong Zhou
Jiyuan Ren
Shuai Hu
Hongsheng Li
Yu Liu
98
12
0
12 Oct 2023
GROOT: Learning to Follow Instructions by Watching Gameplay Videos
Shaofei Cai
Bowei Zhang
Zihao Wang
Xiaojian Ma
Guy Van den Broeck
Yitao Liang
83
26
0
12 Oct 2023
Understanding and Controlling a Maze-Solving Policy Network
Ulisse Mini
Peli Grietzer
Mrinank Sharma
Austin Meek
M. MacDiarmid
Alexander Matt Turner
14
15
0
12 Oct 2023
RoboHive: A Unified Framework for Robot Learning
Vikash Kumar
Rutav Shah
Gaoyue Zhou
Vincent Moens
Vittorio Caggiano
Jay Vakil
Abhishek Gupta
Aravind Rajeswaran
24
22
0
10 Oct 2023
Confronting Reward Model Overoptimization with Constrained RLHF
Ted Moskovitz
Aaditya K. Singh
DJ Strouse
T. Sandholm
Ruslan Salakhutdinov
Anca D. Dragan
Stephen Marcus McAleer
41
48
0
06 Oct 2023
Small batch deep reinforcement learning
J. Obando-Ceron
Marc G. Bellemare
Pablo Samuel Castro
VLM
34
14
0
05 Oct 2023
Neural architecture impact on identifying temporally extended Reinforcement Learning tasks
Victor Vadakechirayath George
OffRL
13
0
0
04 Oct 2023
Learning Diverse Skills for Local Navigation under Multi-constraint Optimality
Jin Cheng
Marin Vlastelica
Pavel Kolev
Chenhao Li
Georg Martius
43
6
0
03 Oct 2023
Algebras of actions in an agent's representations of the world
Alexander Dean
Eduardo Alonso
Esther Mondragón
35
0
0
02 Oct 2023
Cleanba: A Reproducible and Efficient Distributed Reinforcement Learning Platform
Shengyi Huang
Jiayi Weng
Rujikorn Charakorn
Min Lin
Zhongwen Xu
Santiago Ontañón
25
3
0
29 Sep 2023
Memory Gym: Towards Endless Tasks to Benchmark Memory Capabilities of Agents
Marco Pleines
Matthias Pallasch
Frank Zimmer
Mike Preuss
OffRL
29
0
0
29 Sep 2023
Controlling Continuous Relaxation for Combinatorial Optimization
Yuma Ichikawa
32
4
0
29 Sep 2023
RLLTE: Long-Term Evolution Project of Reinforcement Learning
Tao Lv
Zequn Zhang
Yang Xu
Shihao Luo
Bo Li
Xin Jin
Wenjun Zeng
OffRL
37
1
0
28 Sep 2023
Distill Knowledge in Multi-task Reinforcement Learning with Optimal-Transport Regularization
Bang Giang Le
Viet-Cuong Ta
OT
39
1
0
27 Sep 2023
Diagnosing and exploiting the computational demands of videos games for deep reinforcement learning
L. Govindarajan
Rex G Liu
Drew Linsley
A. Ashok
Max Reuter
M. Frank
Thomas Serre
OffRL
21
0
0
22 Sep 2023
Boosting Studies of Multi-Agent Reinforcement Learning on Google Research Football Environment: the Past, Present, and Future
Yan Song
He Jiang
Haifeng Zhang
Zheng Tian
Weinan Zhang
Jun Wang
OffRL
28
8
0
22 Sep 2023
Hierarchical reinforcement learning with natural language subgoals
Arun Ahuja
Kavya Kopparapu
Rob Fergus
Ishita Dasgupta
26
1
0
20 Sep 2023
RoboAgent: Generalization and Efficiency in Robot Manipulation via Semantic Augmentations and Action Chunking
Homanga Bharadhwaj
Jay Vakil
Mohit Sharma
Abhi Gupta
Shubham Tulsiani
Vikash Kumar
LM&Ro
21
117
0
05 Sep 2023
LoopTune: Optimizing Tensor Computations with Reinforcement Learning
Dejan Grubisic
Bram Wasti
Chris Cummins
John Mellor-Crummey
A. Zlateski
27
0
0
04 Sep 2023
Neurosymbolic Reinforcement Learning and Planning: A Survey
Kamal Acharya
Waleed Raza
Carlos Dourado
Alvaro Velasquez
Houbing Song
NAI
OffRL
32
16
0
02 Sep 2023
D-VAT: End-to-End Visual Active Tracking for Micro Aerial Vehicles
Alberto Dionigi
Simone Felicioni
Mirko Leomanni
G. Costante
15
9
0
31 Aug 2023
Cyclophobic Reinforcement Learning
Stefan Sylvius Wagner
P. Arndt
Jan Robine
Stefan Harmeling
29
1
0
30 Aug 2023
Adversarial Style Transfer for Robust Policy Optimization in Deep Reinforcement Learning
Md Masudur Rahman
Yexiang Xue
29
4
0
29 Aug 2023
Go Beyond Imagination: Maximizing Episodic Reachability with World Models
Yao Fu
Run Peng
Honglak Lee
24
1
0
25 Aug 2023
Reinforcement Learning Informed Evolutionary Search for Autonomous Systems Testing
D. Humeniuk
Foutse Khomh
G. Antoniol
33
4
0
24 Aug 2023
Reinforced Self-Training (ReST) for Language Modeling
Çağlar Gülçehre
T. Paine
S. Srinivasan
Ksenia Konyushkova
L. Weerts
...
Chenjie Gu
Wolfgang Macherey
Arnaud Doucet
Orhan Firat
Nando de Freitas
OffRL
66
278
0
17 Aug 2023
Scope Loss for Imbalanced Classification and RL Exploration
Hasham Burhani
Xiaolong Shi
Jonathan Jaegerman
Daniel Balicki
24
0
0
08 Aug 2023
AlphaStar Unplugged: Large-Scale Offline Reinforcement Learning
Michaël Mathieu
Sherjil Ozair
Srivatsan Srinivasan
Çağlar Gülçehre
Shangtong Zhang
...
Sergio Gomez Colmenarejo
Aaron van den Oord
Wojciech M. Czarnecki
Nando de Freitas
Oriol Vinyals
OffRL
16
10
0
07 Aug 2023
Bag of Policies for Distributional Deep Exploration
Asen Nachkov
Luchen Li
Giulia Luise
Filippo Valdettaro
Aldo A. Faisal
OffRL
43
0
0
03 Aug 2023
Learning to Model the World with Language
Jessy Lin
Yuqing Du
Olivia Watkins
Danijar Hafner
Pieter Abbeel
Dan Klein
Anca Dragan
LM&Ro
SyDa
44
51
0
31 Jul 2023
Robust Multi-Agent Reinforcement Learning with State Uncertainty
Sihong He
Songyang Han
Sanbao Su
Shuo Han
Shaofeng Zou
Fei Miao
OOD
33
43
0
30 Jul 2023
A new Gradient TD Algorithm with only One Step-size: Convergence Rate Analysis using
L
L
L
-
λ
λ
λ
Smoothness
Hengshuai Yao
21
2
0
29 Jul 2023
Thinker: Learning to Plan and Act
Stephen Chung
Ivan Anokhin
David M. Krueger
LLMAG
OffRL
LRM
30
5
0
27 Jul 2023
HIQL: Offline Goal-Conditioned RL with Latent States as Actions
Seohong Park
Dibya Ghosh
Benjamin Eysenbach
Sergey Levine
OffRL
30
47
0
22 Jul 2023
Scaling Laws for Imitation Learning in Single-Agent Games
Jens Tuyls
Dhruv Madeka
Kari Torkkola
Dean Phillips Foster
Karthik R. Narasimhan
Sham Kakade
34
4
0
18 Jul 2023
IxDRL: A Novel Explainable Deep Reinforcement Learning Toolkit based on Analyses of Interestingness
Pedro Sequeira
Melinda Gervasio
18
2
0
18 Jul 2023
LuckyMera: a Modular AI Framework for Building Hybrid NetHack Agents
Luigi Quarantiello
Simone Marzeddu
Antonio Guzzi
Vincenzo Lomonaco
29
0
0
17 Jul 2023
`It is currently hodgepodge'': Examining AI/ML Practitioners' Challenges during Co-production of Responsible AI Values
R. Varanasi
Nitesh Goyal
37
46
0
14 Jul 2023
A Survey From Distributed Machine Learning to Distributed Deep Learning
Mohammad Dehghani
Zahra Yazdanparast
26
0
0
11 Jul 2023
SpawnNet: Learning Generalizable Visuomotor Skills from Pre-trained Networks
Xingyu Lin
John So
Sashwat Mahalingam
Fangchen Liu
Pieter Abbeel
SSL
30
22
0
07 Jul 2023
Discovering Hierarchical Achievements in Reinforcement Learning via Contrastive Learning
Seungyong Moon
Junyoung Yeom
Bumsoo Park
Hyun Oh Song
OffRL
29
3
0
07 Jul 2023
Eigensubspace of Temporal-Difference Dynamics and How It Improves Value Approximation in Reinforcement Learning
Qiang He
Dinesh Manocha
Meng Fang
S. Maghsudi
34
4
0
29 Jun 2023
Previous
1
2
3
4
5
...
18
19
20
Next