30
0

Recover Experimental Data with Selection Bias using Counterfactual Logic

Main:9 Pages
12 Figures
Bibliography:2 Pages
5 Tables
Appendix:10 Pages
Abstract

Selection bias, arising from the systematic inclusion or exclusion of certain samples, poses a significant challenge to the validity of causal inference. While Bareinboim et al. introduced methods for recovering unbiased observational and interventional distributions from biased data using partial external information, the complexity of the backdoor adjustment and the method's strong reliance on observational data limit its applicability in many practical settings. In this paper, we formally discover the recoverability of P(Yx)P(Y^*_{x^*}) under selection bias with experimental data. By explicitly constructing counterfactual worlds via Structural Causal Models (SCMs), we analyze how selection mechanisms in the observational world propagate to the counterfactual domain. We derive a complete set of graphical and theoretical criteria to determine that the experimental distribution remain unaffected by selection bias. Furthermore, we propose principled methods for leveraging partially unbiased observational data to recover P(Yx)P(Y^*_{x^*}) from biased experimental datasets. Simulation studies replicating realistic research scenarios demonstrate the practical utility of our approach, offering concrete guidance for mitigating selection bias in applied causal inference.

View on arXiv
@article{he2025_2506.00335,
  title={ Recover Experimental Data with Selection Bias using Counterfactual Logic },
  author={ Jingyang He and Shuai Wang and Ang Li },
  journal={arXiv preprint arXiv:2506.00335},
  year={ 2025 }
}
Comments on this paper