A Statistical Physics of Language Model Reasoning

4 June 2025

Abstract

Transformer LMs show emergent reasoning that resists mechanistic understanding. We offer a statistical physics framework for continuous-time chain-of-thought reasoning dynamics. We model sentence-level hidden state trajectories as a stochastic dynamical system on a lower-dimensional manifold. This drift-diffusion system uses latent regime switching to capture diverse reasoning phases, including misaligned states or failures. Empirical trajectories (8 models, 7 benchmarks) show a rank-40 projection (balancing variance capture and feasibility) explains ~50% variance. We find four latent reasoning regimes. An SLDS model is formulated and validated to capture these features. The framework enables low-cost reasoning simulation, offering tools to study and predict critical transitions like misaligned states or other LM failures.

View on arXiv

@article{carson2025_2506.04374,
  title={ A Statistical Physics of Language Model Reasoning },
  author={ Jack David Carson and Amir Reisizadeh },
  journal={arXiv preprint arXiv:2506.04374},
  year={ 2025 }
}

Comments on this paper