ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2107.06277
  4. Cited By
Why Generalization in RL is Difficult: Epistemic POMDPs and Implicit
  Partial Observability

Why Generalization in RL is Difficult: Epistemic POMDPs and Implicit Partial Observability

13 July 2021
Dibya Ghosh
Jad Rahme
Aviral Kumar
Amy Zhang
Ryan P. Adams
Sergey Levine
    OffRL
ArXivPDFHTML

Papers citing "Why Generalization in RL is Difficult: Epistemic POMDPs and Implicit Partial Observability"

50 / 67 papers shown
Title
Constant-Memory Strategies in Stochastic Games: Best Responses and Equilibria
Constant-Memory Strategies in Stochastic Games: Best Responses and Equilibria
Fengming Zhu
Fangzhen Lin
29
0
0
11 May 2025
Improving Controller Generalization with Dimensionless Markov Decision Processes
Improving Controller Generalization with Dimensionless Markov Decision Processes
V. Charvet
Sebastian Stein
R. Murray-Smith
34
0
0
14 Apr 2025
I Can Hear You Coming: RF Sensing for Uncooperative Satellite Evasion
I Can Hear You Coming: RF Sensing for Uncooperative Satellite Evasion
Cameron Mehlman
Gregory Falco
46
0
0
04 Apr 2025
CoLMDriver: LLM-based Negotiation Benefits Cooperative Autonomous Driving
Changxing Liu
Genjia Liu
ziqi wang
Jinchang Yang
Siheng Chen
62
0
0
11 Mar 2025
Optimizing Test-Time Compute via Meta Reinforcement Fine-Tuning
Yuxiao Qu
Matthew Y. R. Yang
Amrith Rajagopal Setlur
Lewis Tunstall
E. Beeching
Ruslan Salakhutdinov
Aviral Kumar
OffRL
62
13
0
10 Mar 2025
Task Aware Dreamer for Task Generalization in Reinforcement Learning
Task Aware Dreamer for Task Generalization in Reinforcement Learning
Chengyang Ying
Zhongkai Hao
Xinning Zhou
Hang Su
Songming Liu
Dong Yan
Jun Zhu
66
3
0
17 Feb 2025
Benchmarking Model Predictive Control and Reinforcement Learning Based Control for Legged Robot Locomotion in MuJoCo Simulation
Benchmarking Model Predictive Control and Reinforcement Learning Based Control for Legged Robot Locomotion in MuJoCo Simulation
Shivayogi Akki
Tan Chen
43
0
0
28 Jan 2025
WebRL: Training LLM Web Agents via Self-Evolving Online Curriculum Reinforcement Learning
WebRL: Training LLM Web Agents via Self-Evolving Online Curriculum Reinforcement Learning
Zehan Qi
Xiao-Chang Liu
Iat Long Iong
Hanyu Lai
X. Sun
...
Shuntian Yao
Tianjie Zhang
Wei Xu
J. Tang
Yuxiao Dong
103
14
0
28 Jan 2025
RELIEF: Reinforcement Learning Empowered Graph Feature Prompt Tuning
RELIEF: Reinforcement Learning Empowered Graph Feature Prompt Tuning
Jiapeng Zhu
Zichen Ding
Jianxiang Yu
Jiaqi Tan
Xiang Li
Weining Qian
OffRL
146
2
0
20 Jan 2025
Environment Descriptions for Usability and Generalisation in
  Reinforcement Learning
Environment Descriptions for Usability and Generalisation in Reinforcement Learning
Dennis J. N. J. Soemers
Spyridon Samothrakis
Kurt Driessens
M. Winands
OffRL
82
1
0
22 Dec 2024
AMAGO-2: Breaking the Multi-Task Barrier in Meta-Reinforcement Learning with Transformers
Jake Grigsby
Justin Sasek
Samyak Parajuli
Daniel Adebi
Amy Zhang
Yuke Zhu
OffRL
26
3
0
17 Nov 2024
GSL-PCD: Improving Generalist-Specialist Learning with Point Cloud
  Feature-based Task Partitioning
GSL-PCD: Improving Generalist-Specialist Learning with Point Cloud Feature-based Task Partitioning
Xiu Yuan
38
0
0
11 Nov 2024
UEVAVD: A Dataset for Developing UAV's Eye View Active Object Detection
UEVAVD: A Dataset for Developing UAV's Eye View Active Object Detection
Xinhua Jiang
Tianpeng Liu
Li Liu
Zhen Liu
Yongxiang Liu
19
0
0
07 Nov 2024
Incremental Learning of Retrievable Skills For Efficient Continual Task Adaptation
Incremental Learning of Retrievable Skills For Efficient Continual Task Adaptation
Daehee Lee
Minjong Yoo
Woo Kyung Kim
Wonje Choi
Honguk Woo
CLL
93
3
0
30 Oct 2024
Entity-based Reinforcement Learning for Autonomous Cyber Defence
Entity-based Reinforcement Learning for Autonomous Cyber Defence
Isaac Symes Thompson
Alberto Caron
Chris Hicks
V. Mavroudis
AAML
51
2
0
23 Oct 2024
IntersectionZoo: Eco-driving for Benchmarking Multi-Agent Contextual
  Reinforcement Learning
IntersectionZoo: Eco-driving for Benchmarking Multi-Agent Contextual Reinforcement Learning
Vindula Jayawardana
Baptiste Freydt
Ao Qu
Cameron Hickert
Zhongxia Yan
Cathy Wu
45
1
0
19 Oct 2024
Urban Computing for Climate and Environmental Justice: Early
  Perspectives From Two Research Initiatives
Urban Computing for Climate and Environmental Justice: Early Perspectives From Two Research Initiatives
Carolina Veiga
Ashish Sharma
Daniel de Oliveira
Marcos Lage
Fabio Miranda
AI4CE
37
0
0
06 Oct 2024
Towards Interactive and Learnable Cooperative Driving Automation: a
  Large Language Model-Driven Decision-Making Framework
Towards Interactive and Learnable Cooperative Driving Automation: a Large Language Model-Driven Decision-Making Framework
Shiyu Fang
Jiaqi Liu
Mingyu Ding
Yiming Cui
Chen Lv
Peng Hang
Jian-jun Sun
27
7
0
19 Sep 2024
Robust Deep Reinforcement Learning for Inverter-based Volt-Var Control
  in Partially Observable Distribution Networks
Robust Deep Reinforcement Learning for Inverter-based Volt-Var Control in Partially Observable Distribution Networks
Qiong Liu
Ye Guo
Tong Xu
OffRL
21
0
0
13 Aug 2024
TreeSBA: Tree-Transformer for Self-Supervised Sequential Brick Assembly
TreeSBA: Tree-Transformer for Self-Supervised Sequential Brick Assembly
Mengqi Guo
Chen Li
Yuyang Zhao
Gim Hee Lee
ViT
37
0
0
22 Jul 2024
Fourier Controller Networks for Real-Time Decision-Making in Embodied
  Learning
Fourier Controller Networks for Real-Time Decision-Making in Embodied Learning
Hengkai Tan
Songming Liu
Kai Ma
Chengyang Ying
Xingxing Zhang
Hang Su
Jun Zhu
31
2
0
30 May 2024
PEAC: Unsupervised Pre-training for Cross-Embodiment Reinforcement
  Learning
PEAC: Unsupervised Pre-training for Cross-Embodiment Reinforcement Learning
Chengyang Ying
Zhongkai Hao
Xinning Zhou
Xuezhou Xu
Hang Su
Xingxing Zhang
Jun Zhu
32
4
0
23 May 2024
Scrutinize What We Ignore: Reining In Task Representation Shift Of Context-Based Offline Meta Reinforcement Learning
Scrutinize What We Ignore: Reining In Task Representation Shift Of Context-Based Offline Meta Reinforcement Learning
Hai Zhang
Boyuan Zheng
Anqi Guo
Tianying Ji
Anqi Guo
Junqiao Zhao
Lanqing Li
OffRL
39
0
0
20 May 2024
Do Agents Dream of Electric Sheep?: Improving Generalization in
  Reinforcement Learning through Generative Learning
Do Agents Dream of Electric Sheep?: Improving Generalization in Reinforcement Learning through Generative Learning
Giorgio Franceschelli
Mirco Musolesi
AI4CE
40
0
0
12 Mar 2024
SUB-PLAY: Adversarial Policies against Partially Observed Multi-Agent
  Reinforcement Learning Systems
SUB-PLAY: Adversarial Policies against Partially Observed Multi-Agent Reinforcement Learning Systems
Oubo Ma
Yuwen Pu
L. Du
Yang Dai
Ruo Wang
Xiaolei Liu
Yingcai Wu
Shouling Ji
AAML
30
3
0
06 Feb 2024
DRED: Zero-Shot Transfer in Reinforcement Learning via Data-Regularised
  Environment Design
DRED: Zero-Shot Transfer in Reinforcement Learning via Data-Regularised Environment Design
Samuel Garcin
James Doran
Shangmin Guo
Christopher G. Lucas
Stefano V. Albrecht
35
7
0
05 Feb 2024
Multi-Object Navigation in real environments using hybrid policies
Multi-Object Navigation in real environments using hybrid policies
Assem Sadek
G. Bono
Boris Chidlovskii
A. Baskurt
Christian Wolf
47
5
0
24 Jan 2024
The Generalization Gap in Offline Reinforcement Learning
The Generalization Gap in Offline Reinforcement Learning
Ishita Mediratta
Qingfei You
Minqi Jiang
Roberta Raileanu
OffRL
84
10
0
10 Dec 2023
SAGE: Bridging Semantic and Actionable Parts for GEneralizable
  Manipulation of Articulated Objects
SAGE: Bridging Semantic and Actionable Parts for GEneralizable Manipulation of Articulated Objects
Haoran Geng
Songlin Wei
Congyue Deng
Bokui Shen
He-Nan Wang
Leonidas J. Guibas
LM&Ro
39
3
0
03 Dec 2023
Reinforcement Learning with Model Predictive Control for Highway Ramp Metering
Reinforcement Learning with Model Predictive Control for Highway Ramp Metering
Filippo Airaldi
B. de Schutter
Azita Dabiri
40
4
0
15 Nov 2023
Large Language Models for Robotics: A Survey
Large Language Models for Robotics: A Survey
Fanlong Zeng
Wensheng Gan
Yongheng Wang
Ning Liu
Philip S. Yu
LM&Ro
124
125
0
13 Nov 2023
Uni-O4: Unifying Online and Offline Deep Reinforcement Learning with
  Multi-Step On-Policy Optimization
Uni-O4: Unifying Online and Offline Deep Reinforcement Learning with Multi-Step On-Policy Optimization
Kun Lei
Zhengmao He
Chenhao Lu
Kaizhe Hu
Yang Gao
Huazhe Xu
OffRL
OnRL
49
13
0
06 Nov 2023
Sim-to-Real Transfer of Adaptive Control Parameters for AUV
  Stabilization under Current Disturbance
Sim-to-Real Transfer of Adaptive Control Parameters for AUV Stabilization under Current Disturbance
Thomas Chaffre
J. Wheare
A. Lammas
Paulo E. Santos
G. Chenadec
Karl Sammut
Benoit Clement
18
1
0
17 Oct 2023
AMAGO: Scalable In-Context Reinforcement Learning for Adaptive Agents
AMAGO: Scalable In-Context Reinforcement Learning for Adaptive Agents
Jake Grigsby
Linxi Fan
Yuke Zhu
OffRL
LM&Ro
38
10
0
15 Oct 2023
How the level sampling process impacts zero-shot generalisation in deep
  reinforcement learning
How the level sampling process impacts zero-shot generalisation in deep reinforcement learning
Samuel Garcin
James Doran
Shangmin Guo
Christopher G. Lucas
Stefano V. Albrecht
46
0
0
05 Oct 2023
LANCAR: Leveraging Language for Context-Aware Robot Locomotion in
  Unstructured Environments
LANCAR: Leveraging Language for Context-Aware Robot Locomotion in Unstructured Environments
Chak Lam Shek
Xiyang Wu
Wesley A. Suttle
Carl E. Busart
Erin Zaroukian
Dinesh Manocha
Pratap Tokekar
Amrit Singh Bedi
LLMAG
46
8
0
30 Sep 2023
On the Importance of Exploration for Generalization in Reinforcement
  Learning
On the Importance of Exploration for Generalization in Reinforcement Learning
Yiding Jiang
J. Zico Kolter
Roberta Raileanu
UQCV
OffRL
30
20
0
08 Jun 2023
Explore to Generalize in Zero-Shot RL
Explore to Generalize in Zero-Shot RL
E. Zisselman
Itai Lavie
Daniel Soudry
Aviv Tamar
18
15
0
05 Jun 2023
Heterogeneous Knowledge for Augmented Modular Reinforcement Learning
Heterogeneous Knowledge for Augmented Modular Reinforcement Learning
Lorenz Wolf
Mirco Musolesi
OffRL
16
0
0
01 Jun 2023
Learning to Extrapolate: A Transductive Approach
Learning to Extrapolate: A Transductive Approach
Aviv Netanyahu
Abhishek Gupta
Max Simchowitz
K. Zhang
Pulkit Agrawal
49
15
0
27 Apr 2023
CROP: Towards Distributional-Shift Robust Reinforcement Learning using
  Compact Reshaped Observation Processing
CROP: Towards Distributional-Shift Robust Reinforcement Learning using Compact Reshaped Observation Processing
Philipp Altmann
Fabian Ritz
Leonard Feuchtinger
Jonas Nusslein
Claudia Linnhoff-Popien
Thomy Phan
OOD
OffRL
19
5
0
26 Apr 2023
PartManip: Learning Cross-Category Generalizable Part Manipulation
  Policy from Point Cloud Observations
PartManip: Learning Cross-Category Generalizable Part Manipulation Policy from Point Cloud Observations
Haoran Geng
Ziming Li
Yiran Geng
Jiayi Chen
Hao Dong
He-Nan Wang
3DPC
39
41
0
29 Mar 2023
POPGym: Benchmarking Partially Observable Reinforcement Learning
POPGym: Benchmarking Partially Observable Reinforcement Learning
Steven D. Morad
Ryan Kortvelesy
Matteo Bettini
Stephan Liwicki
Amanda Prorok
OffRL
14
37
0
03 Mar 2023
Failure-aware Policy Learning for Self-assessable Robotics Tasks
Failure-aware Policy Learning for Self-assessable Robotics Tasks
Kechun Xu
Runjian Chen
Shuqing Zhao
Zizhang Li
Hongxiang Yu
Ci Chen
Yue Wang
R. Xiong
20
1
0
25 Feb 2023
On the Power of Pre-training for Generalization in RL: Provable Benefits
  and Hardness
On the Power of Pre-training for Generalization in RL: Provable Benefits and Hardness
Haotian Ye
Xiaoyu Chen
Liwei Wang
S. Du
OffRL
24
6
0
19 Oct 2022
RPM: Generalizable Behaviors for Multi-Agent Reinforcement Learning
RPM: Generalizable Behaviors for Multi-Agent Reinforcement Learning
Wei Qiu
Xiao Ma
Bo An
S. Obraztsova
Shuicheng Yan
Zhongwen Xu
11
1
0
18 Oct 2022
The Impact of Task Underspecification in Evaluating Deep Reinforcement
  Learning
The Impact of Task Underspecification in Evaluating Deep Reinforcement Learning
Vindula Jayawardana
Catherine Tang
Sirui Li
Da Suo
Cathy Wu
OffRL
14
13
0
16 Oct 2022
A Comprehensive Survey of Data Augmentation in Visual Reinforcement
  Learning
A Comprehensive Survey of Data Augmentation in Visual Reinforcement Learning
Guozheng Ma
Zhen Wang
Zhecheng Yuan
Xueqian Wang
Bo Yuan
Dacheng Tao
OffRL
35
26
0
10 Oct 2022
Some Supervision Required: Incorporating Oracle Policies in
  Reinforcement Learning via Epistemic Uncertainty Metrics
Some Supervision Required: Incorporating Oracle Policies in Reinforcement Learning via Epistemic Uncertainty Metrics
Jun Jet Tai
Jordan Terry
M. Innocente
J. Brusey
N. Horri
19
1
0
22 Aug 2022
A Game-Theoretic Perspective of Generalization in Reinforcement Learning
A Game-Theoretic Perspective of Generalization in Reinforcement Learning
Chang Yang
Ruiyu Wang
Xinrun Wang
Zhen Wang
OffRL
19
3
0
07 Aug 2022
12
Next