Subgroups Matter for Robust Bias Mitigation

27 May 2025

A. Alloula

Main:9 Pages

14 Figures

Bibliography:4 Pages

7 Tables

Appendix:9 Pages

Abstract

Despite the constant development of new bias mitigation methods for machine learning, no method consistently succeeds, and a fundamental question remains unanswered: when and why do bias mitigation techniques fail? In this paper, we hypothesise that a key factor may be the often-overlooked but crucial step shared by many bias mitigation methods: the definition of subgroups. To investigate this, we conduct a comprehensive evaluation of state-of-the-art bias mitigation methods across multiple vision and language classification tasks, systematically varying subgroup definitions, including coarse, fine-grained, intersectional, and noisy subgroups. Our results reveal that subgroup choice significantly impacts performance, with certain groupings paradoxically leading to worse outcomes than no mitigation at all. Our findings suggest that observing a disparity between a set of subgroups is not a sufficient reason to use those subgroups for mitigation. Through theoretical analysis, we explain these phenomena and uncover a counter-intuitive insight that, in some cases, improving fairness with respect to a particular set of subgroups is best achieved by using a different set of subgroups for mitigation. Our work highlights the importance of careful subgroup definition in bias mitigation and suggest it as a alternative lever for improving the robustness and fairness of machine learning models.

View on arXiv

@article{alloula2025_2505.21363,
  title={ Subgroups Matter for Robust Bias Mitigation },
  author={ Anissa Alloula and Charles Jones and Ben Glocker and Bartłomiej W. Papież },
  journal={arXiv preprint arXiv:2505.21363},
  year={ 2025 }
}

Comments on this paper