59
0
v1v2 (latest)

Missing Data Imputation by Reducing Mutual Information with Rectified Flows

Main:9 Pages
10 Figures
Bibliography:3 Pages
4 Tables
Appendix:8 Pages
Abstract

This paper introduces a novel iterative method for missing data imputation that sequentially reduces the mutual information between data and their corresponding missing mask. Inspired by GAN-based approaches, which train generators to decrease the predictability of missingness patterns, our method explicitly targets the reduction of mutual information. Specifically, our algorithm iteratively minimizes the KL divergence between the joint distribution of the imputed data and missing mask, and the product of their marginals from the previous iteration. We show that the optimal imputation under this framework corresponds to solving an ODE, whose velocity field minimizes a rectified flow training objective. We further illustrate that some existing imputation techniques can be interpreted as approximate special cases of our mutual-information-reducing framework. Comprehensive experiments on synthetic and real-world datasets validate the efficacy of our proposed approach, demonstrating superior imputation performance.

View on arXiv
@article{yu2025_2505.11749,
  title={ Missing Data Imputation by Reducing Mutual Information with Rectified Flows },
  author={ Jiahao Yu and Qizhen Ying and Leyang Wang and Ziyue Jiang and Song Liu },
  journal={arXiv preprint arXiv:2505.11749},
  year={ 2025 }
}
Comments on this paper

We use cookies and other tracking technologies to improve your browsing experience on our website, to show you personalized content and targeted ads, to analyze our website traffic, and to understand where our visitors are coming from. See our policy.