10
0

Domain-Constrained Diffusion Models to Synthesize Tabular Data: A Case Study in Power Systems

Main:6 Pages
7 Figures
Bibliography:2 Pages
Appendix:1 Pages
Abstract

Growing concerns over privacy, security, and legal barriers are driving the rising demand for synthetic data across domains such as healthcare, finance, and energy. While generative models offer a promising solution to overcome these barriers, their utility depends on the incorporation of domain-specific knowledge. We propose to synthesize data using a guided diffusion model that integrates domain constraints directly into the generative process. We develop the model in the context of power systems, with potential applicability to other domains that involve tabular data. Specifically, we synthesize statistically representative and high-fidelity power flow datasets. To satisfy domain constraints, e.g., Kirchhoff laws, we introduce a gradient-based guidance to steer the sampling trajectory in a feasible direction. Numerical results demonstrate the effectiveness of our approach.

View on arXiv
@article{hoseinpour2025_2506.11281,
  title={ Domain-Constrained Diffusion Models to Synthesize Tabular Data: A Case Study in Power Systems },
  author={ Milad Hoseinpour and Vladimir Dvorkin },
  journal={arXiv preprint arXiv:2506.11281},
  year={ 2025 }
}
Comments on this paper