ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2506.01318
53
0
v1v2 (latest)

Unlearning's Blind Spots: Over-Unlearning and Prototypical Relearning Attack

2 June 2025
SeungBum Ha
Saerom Park
Sung Whan Yoon
    MUCLL
ArXiv (abs)PDFHTML
Main:9 Pages
4 Figures
Bibliography:2 Pages
5 Tables
Appendix:4 Pages
Abstract

Machine unlearning (MU) aims to expunge a designated forget set from a trained model without costly retraining, yet the existing techniques overlook two critical blind spots: "over-unlearning" that deteriorates retained data near the forget set, and post-hoc "relearning" attacks that aim to resurrect the forgotten knowledge. We first derive the over-unlearning metric OU@{\epsilon}, which represents the collateral damage to the nearby region of the forget set, where the over-unlearning mainly appears. Next, we expose an unforeseen relearning threat on MU, i.e., the Prototypical Relearning Attack, which exploits the per-class prototype of the forget class with just a few samples, and easily restores the pre-unlearning performance. To counter both blind spots, we introduce Spotter, a plug-and-play objective that combines (i) a masked knowledge-distillation penalty on the nearby region of forget set to suppress OU@{\epsilon}, and (ii) an intra-class dispersion loss that scatters forget-class embeddings, neutralizing prototypical relearning attacks. On CIFAR-10, as one of validations, Spotter reduces OU@{\epsilon}by below the 0.05X of the baseline, drives forget accuracy to 0%, preserves accuracy of the retain set within 1% of difference with the original, and denies the prototype-attack by keeping the forget set accuracy within <1%, without accessing retained data. It confirms that Spotter is a practical remedy of the unlearning's blind spots.

View on arXiv
@article{ha2025_2506.01318,
  title={ Unlearning's Blind Spots: Over-Unlearning and Prototypical Relearning Attack },
  author={ SeungBum Ha and Saerom Park and Sung Whan Yoon },
  journal={arXiv preprint arXiv:2506.01318},
  year={ 2025 }
}
Comments on this paper