42
0

Wasserstein Flow Matching: Generative modeling over families of distributions

Abstract

Generative modeling typically concerns transporting a single source distribution to a target distribution via simple probability flows. However, in fields like computer graphics and single-cell genomics, samples themselves can be viewed as distributions, where standard flow matching ignores their inherent geometry. We propose Wasserstein flow matching (WFM), which lifts flow matching onto families of distributions using the Wasserstein geometry. Notably, WFM is the first algorithm capable of generating distributions in high dimensions, whether represented analytically (as Gaussians) or empirically (as point-clouds). Our theoretical analysis establishes that Wasserstein geodesics constitute proper conditional flows over the space of distributions, making for a valid FM objective. Our algorithm leverages optimal transport theory and the attention mechanism, demonstrating versatility across computational regimes: exploiting closed-form optimal transport paths for Gaussian families, while using entropic estimates on point-clouds for general distributions. WFM successfully generates both 2D & 3D shapes and high-dimensional cellular microenvironments from spatial transcriptomics data. Code is available atthis https URL.

View on arXiv
@article{haviv2025_2411.00698,
  title={ Wasserstein Flow Matching: Generative modeling over families of distributions },
  author={ Doron Haviv and Aram-Alexandre Pooladian and Dana Peér and Brandon Amos },
  journal={arXiv preprint arXiv:2411.00698},
  year={ 2025 }
}
Comments on this paper