ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2301.13857
  4. Cited By
Learning in POMDPs is Sample-Efficient with Hindsight Observability

Learning in POMDPs is Sample-Efficient with Hindsight Observability

31 January 2023
Jonathan Lee
Alekh Agarwal
Christoph Dann
Tong Zhang
ArXivPDFHTML

Papers citing "Learning in POMDPs is Sample-Efficient with Hindsight Observability"

18 / 18 papers shown
Title
Statistical Tractability of Off-policy Evaluation of History-dependent Policies in POMDPs
Yuheng Zhang
Nan Jiang
OffRL
61
0
0
03 Mar 2025
SpecDec++: Boosting Speculative Decoding via Adaptive Candidate Lengths
SpecDec++: Boosting Speculative Decoding via Adaptive Candidate Lengths
Kaixuan Huang
Xudong Guo
Mengdi Wang
40
19
0
30 May 2024
Preparing for Black Swans: The Antifragility Imperative for Machine
  Learning
Preparing for Black Swans: The Antifragility Imperative for Machine Learning
Ming Jin
36
2
0
18 May 2024
Learning Interpretable Policies in Hindsight-Observable POMDPs through
  Partially Supervised Reinforcement Learning
Learning Interpretable Policies in Hindsight-Observable POMDPs through Partially Supervised Reinforcement Learning
Michael Lanier
Ying Xu
Nathan Jacobs
Chongjie Zhang
Yevgeniy Vorobeychik
21
2
0
14 Feb 2024
Learn to Teach: Sample-Efficient Privileged Learning for Humanoid Locomotion over Diverse Terrains
Learn to Teach: Sample-Efficient Privileged Learning for Humanoid Locomotion over Diverse Terrains
Feiyang Wu
Xavier Nal
Ye Zhao
Anqi Wu
Zhaoyuan Gu
Anqi Wu
Ye Zhao
45
0
0
09 Feb 2024
Leveraging Approximate Model-based Shielding for Probabilistic Safety
  Guarantees in Continuous Environments
Leveraging Approximate Model-based Shielding for Probabilistic Safety Guarantees in Continuous Environments
Alexander W. Goodall
Francesco Belardinelli
OffRL
30
1
0
01 Feb 2024
Optimizing Heat Alert Issuance with Reinforcement Learning
Optimizing Heat Alert Issuance with Reinforcement Learning
Ellen M. Considine
Rachel C. Nethery
G. Wellenius
Francesca Dominici
Mauricio Tec
OffRL
29
0
0
21 Dec 2023
Posterior Sampling-based Online Learning for Episodic POMDPs
Posterior Sampling-based Online Learning for Episodic POMDPs
Dengwang Tang
Dongze Ye
Rahul Jain
A. Nayyar
Pierluigi Nuzzo
OffRL
51
0
0
16 Oct 2023
Prospective Side Information for Latent MDPs
Prospective Side Information for Latent MDPs
Jeongyeol Kwon
Yonathan Efroni
Shie Mannor
C. Caramanis
28
5
0
11 Oct 2023
Approximate Model-Based Shielding for Safe Reinforcement Learning
Approximate Model-Based Shielding for Safe Reinforcement Learning
Alexander W. Goodall
Francesco Belardinelli
16
0
0
27 Jul 2023
Sample-Efficient Learning of POMDPs with Multiple Observations In
  Hindsight
Sample-Efficient Learning of POMDPs with Multiple Observations In Hindsight
Jiacheng Guo
Minshuo Chen
Haiquan Wang
Caiming Xiong
Mengdi Wang
Yu Bai
19
5
0
06 Jul 2023
Provably Efficient UCB-type Algorithms For Learning Predictive State
  Representations
Provably Efficient UCB-type Algorithms For Learning Predictive State Representations
Ruiquan Huang
Yitao Liang
J. Yang
OffRL
24
5
0
01 Jul 2023
Theoretical Hardness and Tractability of POMDPs in RL with Partial
  Online State Information
Theoretical Hardness and Tractability of POMDPs in RL with Partial Online State Information
Ming Shi
Yingbin Liang
Ness B. Shroff
29
2
0
14 Jun 2023
Efficient Reinforcement Learning with Impaired Observability: Learning
  to Act with Delayed and Missing State Observations
Efficient Reinforcement Learning with Impaired Observability: Learning to Act with Delayed and Missing State Observations
Minshuo Chen
Jie Meng
Yunru Bai
Yinyu Ye
H. Vincent Poor
Mengdi Wang
31
0
0
02 Jun 2023
Partially Observable RL with B-Stability: Unified Structural Condition
  and Sharp Sample-Efficient Algorithms
Partially Observable RL with B-Stability: Unified Structural Condition and Sharp Sample-Efficient Algorithms
Fan Chen
Yu Bai
Song Mei
53
22
0
29 Sep 2022
Computationally Efficient PAC RL in POMDPs with Latent Determinism and
  Conditional Embeddings
Computationally Efficient PAC RL in POMDPs with Latent Determinism and Conditional Embeddings
Masatoshi Uehara
Ayush Sekhari
Jason D. Lee
Nathan Kallus
Wen Sun
58
6
0
24 Jun 2022
Embed to Control Partially Observed Systems: Representation Learning
  with Provable Sample Efficiency
Embed to Control Partially Observed Systems: Representation Learning with Provable Sample Efficiency
Lingxiao Wang
Qi Cai
Zhuoran Yang
Zhaoran Wang
51
17
0
26 May 2022
Model-Agnostic Meta-Learning for Fast Adaptation of Deep Networks
Model-Agnostic Meta-Learning for Fast Adaptation of Deep Networks
Chelsea Finn
Pieter Abbeel
Sergey Levine
OOD
338
11,684
0
09 Mar 2017
1