270

Nonparametric testing of conditional independence by means of the partial copula

Abstract

We propose a new method to test conditional independence of two real random variables YY and ZZ conditionally on an arbitrary third random variable XX. %with F..F_{.|.} representing conditional distribution functions, The partial copula is introduced, defined as the joint distribution of U=FYX(YX)U=F_{Y|X}(Y|X) and V=FZX(ZX)V=F_{Z|X}(Z|X). We call this transformation of (Y,Z)(Y,Z) into (U,V)(U,V) the partial copula transform. It is easy to show that if YY and ZZ are continuous for any given value of XX, then Y\indZXY\ind Z|X implies U\indVU\ind V. Conditional independence can then be tested by (i) applying the partial copula transform to the data points and (ii) applying a test of ordinary independence to the transformed data. In practice, FYXF_{Y|X} and FZXF_{Z|X} will need to be estimated, which can be done by, e.g., standard kernel methods. We show that under easily satisfied conditions, and for a very large class of test statistics for independence which includes the covariance, Kendall's tau, and Hoeffding's test statistic, the effect of this estimation vanishes asymptotically. Thus, for large samples, the estimation can be ignored and we have a simple method which can be used to apply a wide range of tests of independence, including ones with consistency for arbitrary alternatives, to test for conditional independence. A simulation study indicates good small sample performance. Advantages of the partial copula approach compared to competitors seem to be simplicity and generality.

View on arXiv
Comments on this paper