ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2207.05376
79
8

Optimal Clustering with Noisy Queries via Multi-Armed Bandit

12 July 2022
Jinghui Xia
Zengfeng Huang
ArXivPDFHTML
Abstract

Motivated by many applications, we study clustering with a faulty oracle. In this problem, there are nnn items belonging to kkk unknown clusters, and the algorithm is allowed to ask the oracle whether two items belong to the same cluster or not. However, the answer from the oracle is correct only with probability 12+δ2\frac{1}{2}+\frac{\delta}{2}21​+2δ​. The goal is to recover the hidden clusters with minimum number of noisy queries. Previous works have shown that the problem can be solved with O(nklog⁡nδ2+poly(k,1δ,log⁡n))O(\frac{nk\log n}{\delta^2} + \text{poly}(k,\frac{1}{\delta}, \log n))O(δ2nklogn​+poly(k,δ1​,logn)) queries, while Ω(nkδ2)\Omega(\frac{nk}{\delta^2})Ω(δ2nk​) queries is known to be necessary. So, for any values of kkk and δ\deltaδ, there is still a non-trivial gap between upper and lower bounds. In this work, we obtain the first matching upper and lower bounds for a wide range of parameters. In particular, a new polynomial time algorithm with O(n(k+log⁡n)δ2+poly(k,1δ,log⁡n))O(\frac{n(k+\log n)}{\delta^2} + \text{poly}(k,\frac{1}{\delta}, \log n))O(δ2n(k+logn)​+poly(k,δ1​,logn)) queries is proposed. Moreover, we prove a new lower bound of Ω(nlog⁡nδ2)\Omega(\frac{n\log n}{\delta^2})Ω(δ2nlogn​), which, combined with the existing Ω(nkδ2)\Omega(\frac{nk}{\delta^2})Ω(δ2nk​) bound, matches our upper bound up to an additive poly(k,1δ,log⁡n)\text{poly}(k,\frac{1}{\delta},\log n)poly(k,δ1​,logn) term. To obtain the new results, our main ingredient is an interesting connection between our problem and multi-armed bandit, which might provide useful insights for other similar problems.

View on arXiv
Comments on this paper