ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2405.13922
40
0

Towards Certification of Uncertainty Calibration under Adversarial Attacks

22 May 2024
Cornelius Emde
Francesco Pinto
Thomas Lukasiewicz
Philip H. S. Torr
Adel Bibi
    AAML
ArXivPDFHTML
Abstract

Since neural classifiers are known to be sensitive to adversarial perturbations that alter their accuracy, \textit{certification methods} have been developed to provide provable guarantees on the insensitivity of their predictions to such perturbations. Furthermore, in safety-critical applications, the frequentist interpretation of the confidence of a classifier (also known as model calibration) can be of utmost importance. This property can be measured via the Brier score or the expected calibration error. We show that attacks can significantly harm calibration, and thus propose certified calibration as worst-case bounds on calibration under adversarial perturbations. Specifically, we produce analytic bounds for the Brier score and approximate bounds via the solution of a mixed-integer program on the expected calibration error. Finally, we propose novel calibration attacks and demonstrate how they can improve model calibration through \textit{adversarial calibration training}.

View on arXiv
@article{emde2025_2405.13922,
  title={ Towards Certification of Uncertainty Calibration under Adversarial Attacks },
  author={ Cornelius Emde and Francesco Pinto and Thomas Lukasiewicz and Philip H.S. Torr and Adel Bibi },
  journal={arXiv preprint arXiv:2405.13922},
  year={ 2025 }
}
Comments on this paper