ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1906.02640
8
21

Near Neighbor: Who is the Fairest of Them All?

6 June 2019
Sariel Har-Peled
S. Mahabadi
ArXivPDFHTML
Abstract

\newcommand{\ball}{\mathbb{B}}\newcommand{\dsQ}{{\mathcal{Q}}}\newcommand{\dsS}{{\mathcal{S}}}In this work we study a fair variant of the near neighbor problem. Namely, given a set of nnn points PPP and a parameter rrr, the goal is to preprocess the points, such that given a query point qqq, any point in the rrr-neighborhood of the query, i.e., \ball(q,r)\ball(q,r)\ball(q,r), have the same probability of being reported as the near neighbor. We show that LSH based algorithms can be made fair, without a significant loss in efficiency. Specifically, we show an algorithm that reports a point in the rrr-neighborhood of a query qqq with almost uniform probability. The query time is proportional to O(dns(q.r)\dsQ(n,c))O\bigl( \mathrm{dns}(q.r) \dsQ(n,c) \bigr)O(dns(q.r)\dsQ(n,c)), and its space is O(\dsS(n,c))O(\dsS(n,c))O(\dsS(n,c)), where \dsQ(n,c)\dsQ(n,c)\dsQ(n,c) and \dsS(n,c)\dsS(n,c)\dsS(n,c) are the query time and space of an LSH algorithm for ccc-approximate near neighbor, and dns(q,r)\mathrm{dns}(q,r)dns(q,r) is a function of the local density around qqq. Our approach works more generally for sampling uniformly from a sub-collection of sets of a given collection and can be used in a few other applications. Finally, we run experiments to show performance of our approach on real data.

View on arXiv
Comments on this paper