ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2106.14277
14
0

How many moments does MMD compare?

27 June 2021
Rustem Takhanov
ArXivPDFHTML
Abstract

We present a new way of study of Mercer kernels, by corresponding to a special kernel KKK a pseudo-differential operator p(x,D)p({\mathbf x}, D)p(x,D) such that Fp(x,D)†p(x,D)F−1\mathcal{F} p({\mathbf x}, D)^\dag p({\mathbf x}, D) \mathcal{F}^{-1}Fp(x,D)†p(x,D)F−1 acts on smooth functions in the same way as an integral operator associated with KKK (where F\mathcal{F}F is the Fourier transform). We show that kernels defined by pseudo-differential operators are able to approximate uniformly any continuous Mercer kernel on a compact set. The symbol p(x,y)p({\mathbf x}, {\mathbf y})p(x,y) encapsulates a lot of useful information about the structure of the Maximum Mean Discrepancy distance defined by the kernel KKK. We approximate p(x,y)p({\mathbf x}, {\mathbf y})p(x,y) with the sum of the first rrr terms of the Singular Value Decomposition of ppp, denoted by pr(x,y)p_r({\mathbf x}, {\mathbf y})pr​(x,y). If ordered singular values of the integral operator associated with p(x,y)p({\mathbf x}, {\mathbf y})p(x,y) die down rapidly, the MMD distance defined by the new symbol prp_rpr​ differs from the initial one only slightly. Moreover, the new MMD distance can be interpreted as an aggregated result of comparing rrr local moments of two probability distributions. The latter results holds under the condition that right singular vectors of the integral operator associated with ppp are uniformly bounded. But even if this is not satisfied we can still hold that the Hilbert-Schmidt distance between ppp and prp_rpr​ vanishes. Thus, we report an interesting phenomenon: the MMD distance measures the difference of two probability distributions with respect to a certain number of local moments, r∗r^\astr∗, and this number r∗r^\astr∗ depends on the speed with which singular values of ppp die down.

View on arXiv
Comments on this paper