ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2505.13408
12
0

CoT-Kinetics: A Theoretical Modeling Assessing LRM Reasoning Process

19 May 2025
Jinhe Bi
Danqi Yan
Yifan Wang
Wenke Huang
Haokun Chen
Guancheng Wan
Mang Ye
Xun Xiao
Hinrich Schuetze
Volker Tresp
Yunpu Ma
    LRM
ArXivPDFHTML
Abstract

Recent Large Reasoning Models significantly improve the reasoning ability of Large Language Models by learning to reason, exhibiting the promising performance in solving complex tasks. LRMs solve tasks that require complex reasoning by explicitly generating reasoning trajectories together with answers. Nevertheless, judging the quality of such an output answer is not easy because only considering the correctness of the answer is not enough and the soundness of the reasoning trajectory part matters as well. Logically, if the soundness of the reasoning part is poor, even if the answer is correct, the confidence of the derived answer should be low. Existing methods did consider jointly assessing the overall output answer by taking into account the reasoning part, however, their capability is still not satisfactory as the causal relationship of the reasoning to the concluded answer cannot properly reflected. In this paper, inspired by classical mechanics, we present a novel approach towards establishing a CoT-Kinetics energy equation. Specifically, our CoT-Kinetics energy equation formulates the token state transformation process, which is regulated by LRM internal transformer layers, as like a particle kinetics dynamics governed in a mechanical field. Our CoT-Kinetics energy assigns a scalar score to evaluate specifically the soundness of the reasoning phase, telling how confident the derived answer could be given the evaluated reasoning. As such, the LRM's overall output quality can be accurately measured, rather than a coarse judgment (e.g., correct or incorrect) anymore.

View on arXiv
@article{bi2025_2505.13408,
  title={ CoT-Kinetics: A Theoretical Modeling Assessing LRM Reasoning Process },
  author={ Jinhe Bi and Danqi Yan and Yifan Wang and Wenke Huang and Haokun Chen and Guancheng Wan and Mang Ye and Xun Xiao and Hinrich Schuetze and Volker Tresp and Yunpu Ma },
  journal={arXiv preprint arXiv:2505.13408},
  year={ 2025 }
}
Comments on this paper