ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2503.11209
41
0

Clustering Items through Bandit Feedback: Finding the Right Feature out of Many

14 March 2025
Maximilian Graf
Victor Thuot
Nicolas Verzélen
ArXivPDFHTML
Abstract

We study the problem of clustering a set of items based on bandit feedback. Each of the nnn items is characterized by a feature vector, with a possibly large dimension ddd. The items are partitioned into two unknown groups such that items within the same group share the same feature vector. We consider a sequential and adaptive setting in which, at each round, the learner selects one item and one feature, then observes a noisy evaluation of the item's feature. The learner's objective is to recover the correct partition of the items, while keeping the number of observations as small as possible. We provide an algorithm which relies on finding a relevant feature for the clustering task, leveraging the Sequential Halving algorithm. With probability at least 1−δ1-\delta1−δ, we obtain an accurate recovery of the partition and derive an upper bound on the budget required. Furthermore, we derive an instance-dependent lower bound, which is tight in some relevant cases.

View on arXiv
@article{graf2025_2503.11209,
  title={ Clustering Items through Bandit Feedback: Finding the Right Feature out of Many },
  author={ Maximilian Graf and Victor Thuot and Nicolas Verzelen },
  journal={arXiv preprint arXiv:2503.11209},
  year={ 2025 }
}
Comments on this paper