Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2301.13857
Cited By
Learning in POMDPs is Sample-Efficient with Hindsight Observability
31 January 2023
Jonathan Lee
Alekh Agarwal
Christoph Dann
Tong Zhang
Re-assign community
ArXiv
PDF
HTML
Papers citing
"Learning in POMDPs is Sample-Efficient with Hindsight Observability"
18 / 18 papers shown
Title
Statistical Tractability of Off-policy Evaluation of History-dependent Policies in POMDPs
Yuheng Zhang
Nan Jiang
OffRL
61
0
0
03 Mar 2025
SpecDec++: Boosting Speculative Decoding via Adaptive Candidate Lengths
Kaixuan Huang
Xudong Guo
Mengdi Wang
40
19
0
30 May 2024
Preparing for Black Swans: The Antifragility Imperative for Machine Learning
Ming Jin
36
2
0
18 May 2024
Learning Interpretable Policies in Hindsight-Observable POMDPs through Partially Supervised Reinforcement Learning
Michael Lanier
Ying Xu
Nathan Jacobs
Chongjie Zhang
Yevgeniy Vorobeychik
21
2
0
14 Feb 2024
Learn to Teach: Sample-Efficient Privileged Learning for Humanoid Locomotion over Diverse Terrains
Feiyang Wu
Xavier Nal
Ye Zhao
Anqi Wu
Zhaoyuan Gu
Anqi Wu
Ye Zhao
45
0
0
09 Feb 2024
Leveraging Approximate Model-based Shielding for Probabilistic Safety Guarantees in Continuous Environments
Alexander W. Goodall
Francesco Belardinelli
OffRL
30
1
0
01 Feb 2024
Optimizing Heat Alert Issuance with Reinforcement Learning
Ellen M. Considine
Rachel C. Nethery
G. Wellenius
Francesca Dominici
Mauricio Tec
OffRL
29
0
0
21 Dec 2023
Posterior Sampling-based Online Learning for Episodic POMDPs
Dengwang Tang
Dongze Ye
Rahul Jain
A. Nayyar
Pierluigi Nuzzo
OffRL
51
0
0
16 Oct 2023
Prospective Side Information for Latent MDPs
Jeongyeol Kwon
Yonathan Efroni
Shie Mannor
C. Caramanis
28
5
0
11 Oct 2023
Approximate Model-Based Shielding for Safe Reinforcement Learning
Alexander W. Goodall
Francesco Belardinelli
16
0
0
27 Jul 2023
Sample-Efficient Learning of POMDPs with Multiple Observations In Hindsight
Jiacheng Guo
Minshuo Chen
Haiquan Wang
Caiming Xiong
Mengdi Wang
Yu Bai
19
5
0
06 Jul 2023
Provably Efficient UCB-type Algorithms For Learning Predictive State Representations
Ruiquan Huang
Yitao Liang
J. Yang
OffRL
24
5
0
01 Jul 2023
Theoretical Hardness and Tractability of POMDPs in RL with Partial Online State Information
Ming Shi
Yingbin Liang
Ness B. Shroff
29
2
0
14 Jun 2023
Efficient Reinforcement Learning with Impaired Observability: Learning to Act with Delayed and Missing State Observations
Minshuo Chen
Jie Meng
Yunru Bai
Yinyu Ye
H. Vincent Poor
Mengdi Wang
31
0
0
02 Jun 2023
Partially Observable RL with B-Stability: Unified Structural Condition and Sharp Sample-Efficient Algorithms
Fan Chen
Yu Bai
Song Mei
53
22
0
29 Sep 2022
Computationally Efficient PAC RL in POMDPs with Latent Determinism and Conditional Embeddings
Masatoshi Uehara
Ayush Sekhari
Jason D. Lee
Nathan Kallus
Wen Sun
58
6
0
24 Jun 2022
Embed to Control Partially Observed Systems: Representation Learning with Provable Sample Efficiency
Lingxiao Wang
Qi Cai
Zhuoran Yang
Zhaoran Wang
51
17
0
26 May 2022
Model-Agnostic Meta-Learning for Fast Adaptation of Deep Networks
Chelsea Finn
Pieter Abbeel
Sergey Levine
OOD
338
11,684
0
09 Mar 2017
1