29
11

Metricizing the Euclidean Space towards Desired Distance Relations in Point Clouds

Abstract

Given a set of points in the Euclidean space R\mathbb{R}^\ell with >1\ell>1, the pairwise distances between the points are determined by their spatial location and the metric dd that we endow R\mathbb{R}^\ell with. Hence, the distance d(x,y)=δd(\mathbf x,\mathbf y)=\delta between two points is fixed by the choice of x\mathbf x and y\mathbf y and dd. We study the related problem of fixing the value δ\delta, and the points x,y\mathbf x,\mathbf y, and ask if there is a topological metric dd that computes the desired distance δ\delta. We demonstrate this problem to be solvable by constructing a metric to simultaneously give desired pairwise distances between up to O()O(\sqrt\ell) many points in R\mathbb{R}^\ell. We then introduce the notion of an ε\varepsilon-semimetric d~\tilde{d} to formulate our main result: for all ε>0\varepsilon>0, for all m1m\geq 1, for any choice of mm points y1,,ymR\mathbf y_1,\ldots,\mathbf y_m\in\mathbb{R}^\ell, and all chosen sets of values {δij0:1i<jm}\{\delta_{ij}\geq 0: 1\leq i<j\leq m\}, there exists an ε\varepsilon-semimetric δ~:R×RR\tilde{\delta}:\mathbb{R}^\ell\times \mathbb{R}^\ell\to\mathbb{R} such that d~(yi,yj)=δij\tilde{d}(\mathbf y_i,\mathbf y_j)=\delta_{ij}, i.e., the desired distances are accomplished, irrespectively of the topology that the Euclidean or other norms would induce. We showcase our results by using them to attack unsupervised learning algorithms, specifically kk-Means and density-based (DBSCAN) clustering algorithms. These have manifold applications in artificial intelligence, and letting them run with externally provided distance measures constructed in the way as shown here, can make clustering algorithms produce results that are pre-determined and hence malleable. This demonstrates that the results of clustering algorithms may not generally be trustworthy, unless there is a standardized and fixed prescription to use a specific distance function.

View on arXiv
Comments on this paper