Investigating the Relationship Between Debiasing and Artifact Removal using Saliency Maps

28 February 2025

Abstract

The widespread adoption of machine learning systems has raised critical concerns about fairness and bias, making mitigating harmful biases essential for AI development. In this paper, we investigate the relationship between debiasing and removing artifacts in neural networks for computer vision tasks. First, we introduce a set of novel XAI-based metrics that analyze saliency maps to assess shifts in a model's decision-making process. Then, we demonstrate that successful debiasing methods systematically redirect model focus away from protected attributes. Finally, we show that techniques originally developed for artifact removal can be effectively repurposed for improving fairness. These findings provide evidence for the existence of a bidirectional connection between ensuring fairness and removing artifacts corresponding to protected attributes.

View on arXiv

@article{sztukiewicz2025_2503.00234,
  title={ Investigating the Relationship Between Debiasing and Artifact Removal using Saliency Maps },
  author={ Lukasz Sztukiewicz and Ignacy Stępka and Michał Wiliński and Jerzy Stefanowski },
  journal={arXiv preprint arXiv:2503.00234},
  year={ 2025 }
}

Comments on this paper