42
38

On Computing Probabilistic Explanations for Decision Trees

Abstract

Formal XAI (explainable AI) is a growing area that focuses on computing explanations with mathematical guarantees for the decisions made by ML models. Inside formal XAI, one of the most studied cases is that of explaining the choices taken by decision trees, as they are traditionally deemed as one of the most interpretable classes of models. Recent work has focused on studying the computation of "sufficient reasons", a kind of explanation in which given a decision tree TT and an instance xx, one explains the decision T(x)T(x) by providing a subset yy of the features of xx such that for any other instance zz compatible with yy, it holds that T(z)=T(x)T(z) = T(x), intuitively meaning that the features in yy are already enough to fully justify the classification of xx by TT. It has been argued, however, that sufficient reasons constitute a restrictive notion of explanation, and thus the community has started to study their probabilistic counterpart, in which one requires that the probability of T(z)=T(x)T(z) = T(x) must be at least some value δ(0,1]\delta \in (0, 1], where zz is a random instance that is compatible with yy. Our paper settles the computational complexity of δ\delta-sufficient-reasons over decision trees, showing that both (1) finding δ\delta-sufficient-reasons that are minimal in size, and (2) finding δ\delta-sufficient-reasons that are minimal inclusion-wise, do not admit polynomial-time algorithms (unless P=NP). This is in stark contrast with the deterministic case (δ=1\delta = 1) where inclusion-wise minimal sufficient-reasons are easy to compute. By doing this, we answer two open problems originally raised by Izza et al. On the positive side, we identify structural restrictions of decision trees that make the problem tractable, and show how SAT solvers might be able to tackle these problems in practical settings.

View on arXiv
Comments on this paper