Decoupled Classifier-Free Guidance for Counterfactual Diffusion Models

17 June 2025

Main:9 Pages

28 Figures

Bibliography:5 Pages

8 Tables

Appendix:28 Pages

Abstract

Counterfactual image generation aims to simulate realistic visual outcomes under specific causal interventions. Diffusion models have recently emerged as a powerful tool for this task, combining DDIM inversion with conditional generation via classifier-free guidance (CFG). However, standard CFG applies a single global weight across all conditioning variables, which can lead to poor identity preservation and spurious attribute changes - a phenomenon known as attribute amplification. To address this, we propose Decoupled Classifier-Free Guidance (DCFG), a flexible and model-agnostic framework that introduces group-wise conditioning control. DCFG builds on an attribute-split embedding strategy that disentangles semantic inputs, enabling selective guidance on user-defined attribute groups. For counterfactual generation, we partition attributes into intervened and invariant sets based on a causal graph and apply distinct guidance to each. Experiments on CelebA-HQ, MIMIC-CXR, and EMBED show that DCFG improves intervention fidelity, mitigates unintended changes, and enhances reversibility, enabling more faithful and interpretable counterfactual image generation.

View on arXiv

@article{xia2025_2506.14399,
  title={ Decoupled Classifier-Free Guidance for Counterfactual Diffusion Models },
  author={ Tian Xia and Fabio De Sousa Ribeiro and Rajat R Rasal and Avinash Kori and Raghav Mehta and Ben Glocker },
  journal={arXiv preprint arXiv:2506.14399},
  year={ 2025 }
}

Comments on this paper