Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2110.05038
Cited By
Recurrent Model-Free RL Can Be a Strong Baseline for Many POMDPs
11 October 2021
Tianwei Ni
Benjamin Eysenbach
Ruslan Salakhutdinov
Re-assign community
ArXiv
PDF
HTML
Papers citing
"Recurrent Model-Free RL Can Be a Strong Baseline for Many POMDPs"
50 / 69 papers shown
Title
AssistanceZero: Scalably Solving Assistance Games
Cassidy Laidlaw
Eli Bronstein
Timothy Guo
Dylan Feng
Lukas Berglund
Justin Svegliato
Stuart J. Russell
Anca Dragan
37
1
0
09 Apr 2025
Partially Observable Reinforcement Learning with Memory Traces
Onno Eberhard
Michael Muehlebach
Claire Vernade
OffRL
41
0
0
19 Mar 2025
Uncertainty Representations in State-Space Layers for Deep Reinforcement Learning under Partial Observability
Carlos E. Luis
A. Bottero
Julia Vinogradska
Felix Berkenkamp
Jan Peters
78
1
0
20 Feb 2025
Memory, Benchmark & Robots: A Benchmark for Solving Complex Tasks with Reinforcement Learning
Egor Cherepanov
Nikita Kachaev
A. Kovalev
Aleksandr I. Panov
OffRL
41
0
0
14 Feb 2025
AMAGO-2: Breaking the Multi-Task Barrier in Meta-Reinforcement Learning with Transformers
Jake Grigsby
Justin Sasek
Samyak Parajuli
Daniel Adebi
Amy Zhang
Yuke Zhu
OffRL
26
3
0
17 Nov 2024
Transformer-Based Fault-Tolerant Control for Fixed-Wing UAVs Using Knowledge Distillation and In-Context Adaptation
Francisco Giral
Ignacio Gómez
Ricardo Vinuesa
S. L. Clainche
32
2
0
05 Nov 2024
When to Localize? A Risk-Constrained Reinforcement Learning Approach
Chak Lam Shek
Kasra Torshizi
Troi Williams
Pratap Tokekar
39
2
0
05 Nov 2024
Stable Hadamard Memory: Revitalizing Memory-Augmented Agents for Reinforcement Learning
Hung Le
Kien Do
D. Nguyen
Sunil Gupta
Svetha Venkatesh
40
0
0
14 Oct 2024
Metalic: Meta-Learning In-Context with Protein Language Models
Jacob Beck
Shikha Surana
Manus McAuliffe
Oliver Bent
Thomas D. Barrett
Juan Jose Garau Luis
Paul Duckworth
AI4CE
35
0
0
10 Oct 2024
Model-Free versus Model-Based Reinforcement Learning for Fixed-Wing UAV Attitude Control Under Varying Wind Conditions
David Olivares
Pierre Fournier
Pavan Vasishta
Julien Marzat
25
0
0
26 Sep 2024
Real-Time Recurrent Learning using Trace Units in Reinforcement Learning
Esraa Elelimy
Adam White
Michael Bowling
Martha White
OffRL
38
2
0
02 Sep 2024
Equivariant Reinforcement Learning under Partial Observability
Hai Nguyen
Andrea Baisero
David M. Klee
Dian Wang
Robert Platt
Christopher Amato
42
14
0
26 Aug 2024
Pessimistic Iterative Planning for Robust POMDPs
Maris F. L. Galesloot
Marnix Suilen
T. D. Simão
Steven Carr
M. Spaan
Ufuk Topcu
Nils Jansen
44
2
0
16 Aug 2024
Graceful task adaptation with a bi-hemispheric RL agent
Grant Nicholas
L. Kuhlmann
Gideon Kowadlo
42
0
0
16 Jul 2024
Mitigating Partial Observability in Sequential Decision Processes via the Lambda Discrepancy
Cameron Allen
Aaron Kirtland
Ruo Yu Tao
Sam Lobel
Daniel Scott
Nicholas Petrocelli
Omer Gottesman
Ronald E. Parr
M. L. Littman
George Konidaris
28
1
0
10 Jul 2024
Intercepting Unauthorized Aerial Robots in Controlled Airspace Using Reinforcement Learning
Francisco Giral
Ignacio Gómez
S. L. Clainche
24
0
0
09 Jul 2024
Diffusion Spectral Representation for Reinforcement Learning
Dmitry Shribak
Chen-Xiao Gao
Yitong Li
Chenjun Xiao
Bo Dai
DiffM
29
3
0
23 Jun 2024
Unifying Interpretability and Explainability for Alzheimer's Disease Progression Prediction
Raja Farrukh Ali
Stephanie Milani
John Woods
Emmanuel Adenij
Ayesha Farooq
Clayton Mansel
Jeffrey Burns
William Hsu
33
0
0
11 Jun 2024
Artificial Generational Intelligence: Cultural Accumulation in Reinforcement Learning
Jonathan Cook
Chris Xiaoxuan Lu
Edward Hughes
Joel Z Leibo
Jakob N. Foerster
38
4
0
01 Jun 2024
Rethinking Transformers in Solving POMDPs
Chenhao Lu
Ruizhe Shi
Yuyao Liu
Kaizhe Hu
Simon S. Du
Huazhe Xu
AI4CE
35
3
0
27 May 2024
Efficient Recurrent Off-Policy RL Requires a Context-Encoder-Specific Learning Rate
Fan Luo
Zuolin Tu
Zefang Huang
Yang Yu
OffRL
36
0
0
24 May 2024
Deep Dive into Model-free Reinforcement Learning for Biological and Robotic Systems: Theory and Practice
Yusheng Jiao
Feng Ling
Sina Heydari
N. Heess
J. Merel
Eva Kanso
31
0
0
19 May 2024
GRSN: Gated Recurrent Spiking Neurons for POMDPs and MARL
Lang Qin
Ziming Wang
Runhao Jiang
Rui Yan
Huajin Tang
36
0
0
24 Apr 2024
Generalizing Multi-Step Inverse Models for Representation Learning to Finite-Memory POMDPs
Lili Wu
Ben Evans
Riashat Islam
Raihan Seraj
Yonathan Efroni
Alex Lamb
52
1
0
22 Apr 2024
Higher Replay Ratio Empowers Sample-Efficient Multi-Agent Reinforcement Learning
Linjie Xu
Zichuan Liu
Alexander Dockhorn
Diego Perez-Liebana
Jinyu Wang
Lei Song
Jiang Bian
48
2
0
15 Apr 2024
MAMBA: an Effective World Model Approach for Meta-Reinforcement Learning
Zohar Rimon
Tom Jurgenson
Orr Krupnik
Gilad Adler
Aviv Tamar
37
8
0
14 Mar 2024
SplAgger: Split Aggregation for Meta-Reinforcement Learning
Jacob Beck
Matthew Jackson
Risto Vuorio
Zheng Xiong
Shimon Whiteson
OffRL
29
2
0
05 Mar 2024
Symmetry-aware Reinforcement Learning for Robotic Assembly under Partial Observability with a Soft Wrist
Hai Nguyen
Tadashi Kozuno
C. C. Beltran-Hernandez
Masashi Hamaya
46
6
0
28 Feb 2024
Learn to Teach: Sample-Efficient Privileged Learning for Humanoid Locomotion over Diverse Terrains
Feiyang Wu
Xavier Nal
Ye Zhao
Anqi Wu
Zhaoyuan Gu
Anqi Wu
Ye Zhao
48
0
0
09 Feb 2024
Do Transformer World Models Give Better Policy Gradients?
Michel Ma
Tianwei Ni
Clement Gehring
P. DÓro
Pierre-Luc Bacon
36
4
0
07 Feb 2024
Dense Reward for Free in Reinforcement Learning from Human Feedback
Alex J. Chan
Hao Sun
Samuel Holt
M. Schaar
18
31
0
01 Feb 2024
Bridging State and History Representations: Understanding Self-Predictive RL
Tianwei Ni
Benjamin Eysenbach
Erfan Seyedsalehi
Michel Ma
Clement Gehring
Aditya Mahajan
Pierre-Luc Bacon
AI4TS
AI4CE
22
20
0
17 Jan 2024
Sparse Mean Field Load Balancing in Large Localized Queueing Systems
Anam Tahir
Kai Cui
Heinz Koeppl
30
0
0
20 Dec 2023
MPC-Inspired Reinforcement Learning for Verifiable Model-Free Control
Yiwen Lu
Zishuo Li
Yihan Zhou
Na Li
Yilin Mo
25
2
0
08 Dec 2023
Real-Time Recurrent Reinforcement Learning
Julian Lemmel
Radu Grosu
34
2
0
08 Nov 2023
Improving Intrinsic Exploration by Creating Stationary Objectives
Roger Creus Castanyer
Javier Civera
Taihú Pire
OffRL
34
3
0
27 Oct 2023
Large Language Models as Generalizable Policies for Embodied Tasks
Andrew Szot
Max Schwarzer
Harsh Agrawal
Bogdan Mazoure
Walter A. Talbott
Katherine Metcalf
Natalie Mackraz
Devon Hjelm
Alexander Toshev
LM&Ro
37
58
0
26 Oct 2023
AMAGO: Scalable In-Context Reinforcement Learning for Adaptive Agents
Jake Grigsby
Linxi Fan
Yuke Zhu
OffRL
LM&Ro
38
10
0
15 Oct 2023
Accountability in Offline Reinforcement Learning: Explaining Decisions with a Corpus of Examples
Hao Sun
Alihan Huyuk
Daniel Jarrett
M. Schaar
OffRL
39
6
0
11 Oct 2023
Reinforcement Learning in the Era of LLMs: What is Essential? What is needed? An RL Perspective on RLHF, Prompting, and Beyond
Hao Sun
OffRL
34
21
0
09 Oct 2023
Reinforcement Learning with Fast and Forgetful Memory
Steven D. Morad
Ryan Kortvelesy
Stephan Liwicki
Amanda Prorok
OffRL
24
4
0
06 Oct 2023
Recurrent Hypernetworks are Surprisingly Strong in Meta-RL
Jacob Beck
Risto Vuorio
Zheng Xiong
Shimon Whiteson
45
9
0
26 Sep 2023
ODE-based Recurrent Model-free Reinforcement Learning for POMDPs
Xu Zhao
Duzhen Zhang
Liyuan Han
Tielin Zhang
Bo Xu
37
7
0
25 Sep 2023
A Deep Recurrent-Reinforcement Learning Method for Intelligent AutoScaling of Serverless Functions
Siddharth Agarwal
M. A. Rodriguez
Rajkumar Buyya
22
8
0
11 Aug 2023
PID-Inspired Inductive Biases for Deep Reinforcement Learning in Partially Observable Control Tasks
I. Char
J. Schneider
26
4
0
12 Jul 2023
When Do Transformers Shine in RL? Decoupling Memory from Credit Assignment
Tianwei Ni
Michel Ma
Benjamin Eysenbach
Pierre-Luc Bacon
OffRL
26
34
0
07 Jul 2023
TGRL: An Algorithm for Teacher Guided Reinforcement Learning
Idan Shenfeld
Zhang-Wei Hong
Aviv Tamar
Pulkit Agrawal
27
12
0
06 Jul 2023
RL
3
^3
3
: Boosting Meta Reinforcement Learning via RL inside RL
2
^2
2
Abhinav Bhatia
Samer B. Nashed
S. Zilberstein
OffRL
22
0
0
28 Jun 2023
Introspective Action Advising for Interpretable Transfer Learning
Joseph Campbell
Yue (Sophie) Guo
Fiona Xie
Simon Stepputtis
Katia P. Sycara
35
1
0
21 Jun 2023
ContraBAR: Contrastive Bayes-Adaptive Deep RL
Era Choshen
Aviv Tamar
BDL
OffRL
8
7
0
04 Jun 2023
1
2
Next