Multi-class Item Mining under Local Differential Privacy

18 April 2025

Abstract

Item mining, a fundamental task for collecting statistical data from users, has raised increasing privacy concerns. To address these concerns, local differential privacy (LDP) was proposed as a privacy-preserving technique. Existing LDP item mining mechanisms primarily concentrate on global statistics, i.e., those from the entire dataset. Nevertheless, they fall short of user-tailored tasks such as personalized recommendations, whereas classwise statistics can improve task accuracy with fine-grained information. Meanwhile, the introduction of class labels brings new challenges. Label perturbation may result in invalid items for aggregation. To this end, we propose frameworks for multi-class item mining, along with two mechanisms: validity perturbation to reduce the impact of invalid data, and correlated perturbation to preserve the relationship between labels and items. We also apply these optimized methods to two multi-class item mining queries: frequency estimation and top- $k$ item mining. Through theoretical analysis and extensive experiments, we verify the effectiveness and superiority of these methods.

View on arXiv

@article{mao2025_2504.13526,
  title={ Multi-class Item Mining under Local Differential Privacy },
  author={ Yulian Mao and Qingqing Ye and Rong Du and Qi Wang and Kai Huang and Haibo Hu },
  journal={arXiv preprint arXiv:2504.13526},
  year={ 2025 }
}

Comments on this paper