Fair Representation Learning with Controllable High Confidence Guarantees via Adversarial Inference

23 October 2025

Yuhong Luo

Austin Hoag

Xintong Wang

Philip S Thomas

Przemyslaw A. Grabowicz

FaML

ArXiv (abs)PDF HTML Github

Main:10 Pages

14 Figures

Bibliography:5 Pages

2 Tables

Appendix:23 Pages

Abstract

Representation learning is increasingly applied to generate representations that generalize well across multiple downstream tasks. Ensuring fairness guarantees in representation learning is crucial to prevent unfairness toward specific demographic groups in downstream tasks. In this work, we formally introduce the task of learning representations that achieve high-confidence fairness. We aim to guarantee that demographic disparity in every downstream prediction remains bounded by a *user-defined* error threshold $\epsilon$ , with *controllable* high probability. To this end, we propose the ***F**air **R**epresentation learning with high-confidence **G**uarantees (FRG)* framework, which provides these high-confidence fairness guarantees by leveraging an optimized adversarial model. We empirically evaluate FRG on three real-world datasets, comparing its performance to six state-of-the-art fair representation learning methods. Our results demonstrate that FRG consistently bounds unfairness across a range of downstream models and tasks.

View on arXiv

Comments on this paper