ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2504.07285
14
1

A Scalable Approach to Clustering Embedding Projections

9 April 2025
Donghao Ren
Fred Hohman
Dominik Moritz
ArXivPDFHTML
Abstract

Interactive visualization of embedding projections is a useful technique for understanding data and evaluating machine learning models. Labeling data within these visualizations is critical for interpretation, as labels provide an overview of the projection and guide user navigation. However, most methods for producing labels require clustering the points, which can be computationally expensive as the number of points grows. In this paper, we describe an efficient clustering approach using kernel density estimation in the projected 2D space instead of points. This algorithm can produce high-quality cluster regions from a 2D density map in a few hundred milliseconds, orders of magnitude faster than current approaches. We contribute the design of the algorithm, benchmarks, and applications that demonstrate the utility of the algorithm, including labeling and summarization.

View on arXiv
@article{ren2025_2504.07285,
  title={ A Scalable Approach to Clustering Embedding Projections },
  author={ Donghao Ren and Fred Hohman and Dominik Moritz },
  journal={arXiv preprint arXiv:2504.07285},
  year={ 2025 }
}
Comments on this paper