ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2505.06769
19
0

Value Iteration with Guessing for Markov Chains and Markov Decision Processes

10 May 2025
Krishnendu Chatterjee
Mahdi JafariRaviz
Raimundo Saona
Jakub Svoboda
ArXivPDFHTML
Abstract

Two standard models for probabilistic systems are Markov chains (MCs) and Markov decision processes (MDPs). Classic objectives for such probabilistic models for control and planning problems are reachability and stochastic shortest path. The widely studied algorithmic approach for these problems is the Value Iteration (VI) algorithm which iteratively applies local updates called Bellman updates. There are many practical approaches for VI in the literature but they all require exponentially many Bellman updates for MCs in the worst case. A preprocessing step is an algorithm that is discrete, graph-theoretical, and requires linear space. An important open question is whether, after a polynomial-time preprocessing, VI can be achieved with sub-exponentially many Bellman updates. In this work, we present a new approach for VI based on guessing values. Our theoretical contributions are twofold. First, for MCs, we present an almost-linear-time preprocessing algorithm after which, along with guessing values, VI requires only subexponentially many Bellman updates. Second, we present an improved analysis of the speed of convergence of VI for MDPs. Finally, we present a practical algorithm for MDPs based on our new approach. Experimental results show that our approach provides a considerable improvement over existing VI-based approaches on several benchmark examples from the literature.

View on arXiv
@article{chatterjee2025_2505.06769,
  title={ Value Iteration with Guessing for Markov Chains and Markov Decision Processes },
  author={ Krishnendu Chatterjee and Mahdi JafariRaviz and Raimundo Saona and Jakub Svoboda },
  journal={arXiv preprint arXiv:2505.06769},
  year={ 2025 }
}
Comments on this paper