ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2502.18573
56
0

FactReasoner: A Probabilistic Approach to Long-Form Factuality Assessment for Large Language Models

25 February 2025
Radu Marinescu
D. Bhattacharjya
Junkyu Lee
T. Tchrakian
Javier Carnerero-Cano
Yufang Hou
Elizabeth M. Daly
Alessandra Pascale
    HILM
    LRM
ArXivPDFHTML
Abstract

Large language models (LLMs) have demonstrated vast capabilities on generative tasks in recent years, yet they struggle with guaranteeing the factual correctness of the generated content. This makes these models unreliable in realistic situations where factually accurate responses are expected. In this paper, we propose FactReasoner, a new factuality assessor that relies on probabilistic reasoning to assess the factuality of a long-form generated response. Specifically, FactReasoner decomposes the response into atomic units, retrieves relevant contexts for them from an external knowledge source, and constructs a joint probability distribution over the atoms and contexts using probabilistic encodings of the logical relationships (entailment, contradiction) between the textual utterances corresponding to the atoms and contexts. FactReasoner then computes the posterior probability of whether atomic units in the response are supported by the retrieved contexts. Our experiments on labeled and unlabeled benchmark datasets demonstrate clearly that FactReasoner improves considerably over state-of-the-art prompt-based approaches in terms of both factual precision and recall.

View on arXiv
@article{marinescu2025_2502.18573,
  title={ FactReasoner: A Probabilistic Approach to Long-Form Factuality Assessment for Large Language Models },
  author={ Radu Marinescu and Debarun Bhattacharjya and Junkyu Lee and Tigran Tchrakian and Javier Carnerero Cano and Yufang Hou and Elizabeth Daly and Alessandra Pascale },
  journal={arXiv preprint arXiv:2502.18573},
  year={ 2025 }
}
Comments on this paper