ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2002.02487
26
15

Efficient Algorithms for Generating Provably Near-Optimal Cluster Descriptors for Explainability

6 February 2020
Prathyush Sambaturu
Aparna Gupta
Ian Davidson
S. S. Ravi
A. Vullikanti
A. Warren
ArXivPDFHTML
Abstract

Improving the explainability of the results from machine learning methods has become an important research goal. Here, we study the problem of making clusters more interpretable by extending a recent approach of [Davidson et al., NeurIPS 2018] for constructing succinct representations for clusters. Given a set of objects SSS, a partition π\piπ of SSS (into clusters), and a universe TTT of tags such that each element in SSS is associated with a subset of tags, the goal is to find a representative set of tags for each cluster such that those sets are pairwise-disjoint and the total size of all the representatives is minimized. Since this problem is NP-hard in general, we develop approximation algorithms with provable performance guarantees for the problem. We also show applications to explain clusters from datasets, including clusters of genomic sequences that represent different threat levels.

View on arXiv
Comments on this paper