32

New Algorithmic Directions in Optimal Transport and Applications for Product Spaces

Main:1 Pages
Appendix:33 Pages
Abstract

We study optimal transport between two high-dimensional distributions μ,ν\mu,\nu in RnR^n from an algorithmic perspective: given xμx \sim \mu, find a close yνy \sim \nu in poly(n)poly(n) time, where nn is the dimension of x,yx,y. Thus, running time depends on the dimension rather than the full representation size of μ,ν\mu,\nu. Our main result is a general algorithm for transporting any product distribution μ\mu to any ν\nu with cost Δ+δ\Delta + \delta under pp\ell_p^p, where Δ\Delta is the Knothe-Rosenblatt transport cost and δ\delta is a computational error decreasing with runtime. This requires ν\nu to be "sequentially samplable" with bounded average sampling cost, a new but natural notion.We further prove:An algorithmic version of Talagrand's inequality for transporting the standard Gaussian Φn\Phi^n to arbitrary ν\nu under squared Euclidean cost. For ν=Φn\nu = \Phi^n conditioned on a set S\mathcal{S} of measure ε\varepsilon, we construct the sequential sampler in expected time poly(n/ε)poly(n/\varepsilon) using membership oracle access to S\mathcal{S}. This yields an algorithmic transport from Φn\Phi^n to ΦnS\Phi^n|\mathcal{S} in poly(n/ε)poly(n/\varepsilon) time and expected squared distance O(log1/ε)O(\log 1/\varepsilon), optimal for general S\mathcal{S} of measure ε\varepsilon.As corollary, we obtain the first computational concentration result (Etesami et al. SODA 2020) for Gaussian measure under Euclidean distance with dimension-independent transportation cost, resolving an open question of Etesami et al. Specifically, for any S\mathcal{S} of Gaussian measure ε\varepsilon, most Φn\Phi^n samples can be mapped to S\mathcal{S} within distance O(log1/ε)O(\sqrt{\log 1/\varepsilon}) in poly(n/ε)poly(n/\varepsilon) time.

View on arXiv
Comments on this paper