GICDM: Mitigating Hubness for Reliable Distance-Based Generative Model Evaluation
Nicolas Salvy
Hugues Talbot
Bertrand Thirion
- EGVM
Main:7 Pages
20 Figures
Bibliography:5 Pages
12 Tables
Appendix:18 Pages
Abstract
Generative model evaluation commonly relies on high-dimensional embedding spaces to compute distances between samples. We show that dataset representations in these spaces are affected by the hubness phenomenon, which distorts nearest neighbor relationships and biases distance-based metrics. Building on the classical Iterative Contextual Dissimilarity Measure (ICDM), we introduce Generative ICDM (GICDM), a method to correct neighborhood estimation for both real and generated data. We introduce a multi-scale extension to improve empirical behavior. Extensive experiments on synthetic and real benchmarks demonstrate that GICDM resolves hubness-induced failures, restores reliable metric behavior, and improves alignment with human judgment.
View on arXivComments on this paper
