Learning in POMDPs is Sample-Efficient with Hindsight Observability

Learning in POMDPs is Sample-Efficient with Hindsight Observability

31 January 2023

Tong Zhang

Papers citing "Learning in POMDPs is Sample-Efficient with Hindsight Observability"

18 / 18 papers shown

Title
Statistical Tractability of Off-policy Evaluation of History-dependent Policies in POMDPs Yuheng Zhang Nan Jiang OffRL 61 0 0 03 Mar 2025
SpecDec++: Boosting Speculative Decoding via Adaptive Candidate Lengths Kaixuan Huang Xudong Guo Mengdi Wang 40 19 0 30 May 2024
Preparing for Black Swans: The Antifragility Imperative for Machine Learning Ming Jin 36 2 0 18 May 2024
Learning Interpretable Policies in Hindsight-Observable POMDPs through Partially Supervised Reinforcement Learning Michael Lanier Ying Xu Nathan Jacobs Chongjie Zhang Yevgeniy Vorobeychik 21 2 0 14 Feb 2024
Learn to Teach: Sample-Efficient Privileged Learning for Humanoid Locomotion over Diverse Terrains Feiyang Wu Xavier Nal Ye Zhao Anqi Wu Zhaoyuan Gu Anqi Wu Ye Zhao 45 0 0 09 Feb 2024
Leveraging Approximate Model-based Shielding for Probabilistic Safety Guarantees in Continuous Environments Alexander W. Goodall Francesco Belardinelli OffRL 30 1 0 01 Feb 2024
Optimizing Heat Alert Issuance with Reinforcement Learning Ellen M. Considine Rachel C. Nethery G. Wellenius Francesca Dominici Mauricio Tec OffRL 29 0 0 21 Dec 2023
Posterior Sampling-based Online Learning for Episodic POMDPs Dengwang Tang Dongze Ye Rahul Jain A. Nayyar Pierluigi Nuzzo OffRL 51 0 0 16 Oct 2023
Prospective Side Information for Latent MDPs Jeongyeol Kwon Yonathan Efroni Shie Mannor C. Caramanis 28 5 0 11 Oct 2023
Approximate Model-Based Shielding for Safe Reinforcement Learning Alexander W. Goodall Francesco Belardinelli 16 0 0 27 Jul 2023
Sample-Efficient Learning of POMDPs with Multiple Observations In Hindsight Jiacheng Guo Minshuo Chen Haiquan Wang Caiming Xiong Mengdi Wang Yu Bai 19 5 0 06 Jul 2023
Provably Efficient UCB-type Algorithms For Learning Predictive State Representations Ruiquan Huang Yitao Liang J. Yang OffRL 24 5 0 01 Jul 2023
Theoretical Hardness and Tractability of POMDPs in RL with Partial Online State Information Ming Shi Yingbin Liang Ness B. Shroff 29 2 0 14 Jun 2023
Efficient Reinforcement Learning with Impaired Observability: Learning to Act with Delayed and Missing State Observations Minshuo Chen Jie Meng Yunru Bai Yinyu Ye H. Vincent Poor Mengdi Wang 31 0 0 02 Jun 2023
Partially Observable RL with B-Stability: Unified Structural Condition and Sharp Sample-Efficient Algorithms Fan Chen Yu Bai Song Mei 53 22 0 29 Sep 2022
Computationally Efficient PAC RL in POMDPs with Latent Determinism and Conditional Embeddings Masatoshi Uehara Ayush Sekhari Jason D. Lee Nathan Kallus Wen Sun 58 6 0 24 Jun 2022
Embed to Control Partially Observed Systems: Representation Learning with Provable Sample Efficiency Lingxiao Wang Qi Cai Zhuoran Yang Zhaoran Wang 51 17 0 26 May 2022
Model-Agnostic Meta-Learning for Fast Adaptation of Deep Networks Chelsea Finn Pieter Abbeel Sergey Levine OOD 338 11,684 0 09 Mar 2017