Testing probability distributions using conditional samples

We study a new framework for property testing of probability distributions, by considering distribution testing algorithms that have access to a conditional sampling oracle.* This is an oracle that takes as input a subset of the domain of the unknown probability distribution and returns a draw from the conditional probability distribution restricted to . This new model allows considerable flexibility in the design of distribution testing algorithms; in particular, testing algorithms in this model can be adaptive. We study a wide range of natural distribution testing problems in this new framework and some of its variants, giving both upper and lower bounds on query complexity. These problems include testing whether is the uniform distribution ; testing whether for an explicitly provided ; testing whether two unknown distributions and are equivalent; and estimating the variation distance between and the uniform distribution. At a high level our main finding is that the new "conditional sampling" framework we consider is a powerful one: while all the problems mentioned above have sample complexity in the standard model (and in some cases the complexity must be almost linear in ), we give -query algorithms (and in some cases -query algorithms independent of ) for all these problems in our conditional sampling setting. *Independently from our work, Chakraborty et al. also considered this framework. We discuss their work in Subsection [1.4].
View on arXiv