ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2102.12661
  4. Cited By
Online Learning for Unknown Partially Observable MDPs

Online Learning for Unknown Partially Observable MDPs

25 February 2021
Mehdi Jafarnia-Jahromi
Rahul Jain
A. Nayyar
ArXivPDFHTML

Papers citing "Online Learning for Unknown Partially Observable MDPs"

18 / 18 papers shown
Title
Efficient Learning of POMDPs with Known Observation Model in
  Average-Reward Setting
Efficient Learning of POMDPs with Known Observation Model in Average-Reward Setting
Alessio Russo
Alberto Maria Metelli
Marcello Restelli
26
0
0
02 Oct 2024
Learning Successor Features with Distributed Hebbian Temporal Memory
Learning Successor Features with Distributed Hebbian Temporal Memory
E. Dzhivelikian
Petr Kuderov
Aleksandr I. Panov
30
0
0
20 Oct 2023
Posterior Sampling-based Online Learning for Episodic POMDPs
Posterior Sampling-based Online Learning for Episodic POMDPs
Dengwang Tang
Dongze Ye
Rahul Jain
A. Nayyar
Pierluigi Nuzzo
OffRL
51
0
0
16 Oct 2023
Provably Efficient Representation Learning with Tractable Planning in
  Low-Rank POMDP
Provably Efficient Representation Learning with Tractable Planning in Low-Rank POMDP
Jiacheng Guo
Zihao Li
Huazheng Wang
Mengdi Wang
Zhuoran Yang
Xuezhou Zhang
32
5
0
21 Jun 2023
Bayesian Learning of Optimal Policies in Markov Decision Processes with
  Countably Infinite State-Space
Bayesian Learning of Optimal Policies in Markov Decision Processes with Countably Infinite State-Space
Saghar Adler
V. Subramanian
23
2
0
05 Jun 2023
Safe Posterior Sampling for Constrained MDPs with Bounded Constraint
  Violation
Safe Posterior Sampling for Constrained MDPs with Bounded Constraint Violation
K. C. Kalagarla
Rahul Jain
Pierluigi Nuzzo
26
6
0
27 Jan 2023
Partially Observable RL with B-Stability: Unified Structural Condition
  and Sharp Sample-Efficient Algorithms
Partially Observable RL with B-Stability: Unified Structural Condition and Sharp Sample-Efficient Algorithms
Fan Chen
Yu Bai
Song Mei
53
22
0
29 Sep 2022
Deriving time-averaged active inference from control principles
Deriving time-averaged active inference from control principles
Eli Sennesh
J. Theriault
Jan-Willem van de Meent
L. F. Barrett
K. Quigley
AI4TS
AI4CE
24
3
0
22 Aug 2022
Learning in Observable POMDPs, without Computationally Intractable
  Oracles
Learning in Observable POMDPs, without Computationally Intractable Oracles
Noah Golowich
Ankur Moitra
Dhruv Rohatgi
29
26
0
07 Jun 2022
Pessimism in the Face of Confounders: Provably Efficient Offline
  Reinforcement Learning in Partially Observable Markov Decision Processes
Pessimism in the Face of Confounders: Provably Efficient Offline Reinforcement Learning in Partially Observable Markov Decision Processes
Miao Lu
Yifei Min
Zhaoran Wang
Zhuoran Yang
OffRL
54
22
0
26 May 2022
When Is Partially Observable Reinforcement Learning Not Scary?
When Is Partially Observable Reinforcement Learning Not Scary?
Qinghua Liu
Alan Chung
Csaba Szepesvári
Chi Jin
14
92
0
19 Apr 2022
Common Information based Approximate State Representations in
  Multi-Agent Reinforcement Learning
Common Information based Approximate State Representations in Multi-Agent Reinforcement Learning
Shitao Xiao
V. Subramanian
21
9
0
25 Oct 2021
A Bayesian Learning Algorithm for Unknown Zero-sum Stochastic Games with
  an Arbitrary Opponent
A Bayesian Learning Algorithm for Unknown Zero-sum Stochastic Games with an Arbitrary Opponent
Mehdi Jafarnia-Jahromi
Rahul Jain
A. Nayyar
28
5
0
08 Sep 2021
Sublinear Regret for Learning POMDPs
Sublinear Regret for Learning POMDPs
Yi Xiong
Ningyuan Chen
Xuefeng Gao
Xiang Zhou
21
25
0
08 Jul 2021
Online Learning for Stochastic Shortest Path Model via Posterior
  Sampling
Online Learning for Stochastic Shortest Path Model via Posterior Sampling
Mehdi Jafarnia-Jahromi
Liyu Chen
Rahul Jain
Haipeng Luo
OffRL
66
18
0
09 Jun 2021
Simple Agent, Complex Environment: Efficient Reinforcement Learning with
  Agent States
Simple Agent, Complex Environment: Efficient Reinforcement Learning with Agent States
Shi Dong
Benjamin Van Roy
Zhengyuan Zhou
18
29
0
10 Feb 2021
Provably Efficient Reinforcement Learning with Linear Function
  Approximation Under Adaptivity Constraints
Provably Efficient Reinforcement Learning with Linear Function Approximation Under Adaptivity Constraints
Chi Jin
Zhuoran Yang
Zhaoran Wang
OffRL
122
166
0
06 Jan 2021
Model-free Reinforcement Learning in Infinite-horizon Average-reward
  Markov Decision Processes
Model-free Reinforcement Learning in Infinite-horizon Average-reward Markov Decision Processes
Chen-Yu Wei
Mehdi Jafarnia-Jahromi
Haipeng Luo
Hiteshi Sharma
R. Jain
107
99
0
15 Oct 2019
1