24
8

On Distribution Testing in the Conditional Sampling Model

Abstract

Recently, there has been significant work studying distribution testing under the Conditional Sampling model. In this model, a query specifies a subset SS of the domain, and the output received is a sample drawn from the distribution conditioned on being in SS. In this paper, we improve query complexity bounds for several classic distribution testing problems in this model. First, we prove that tolerant uniformity testing in the conditional sampling model can be solved using O~(ε2)\tilde{O}(\varepsilon^{-2}) queries, which is optimal and improves upon the O~(ε20)\tilde{O}(\varepsilon^{-20})-query algorithm of Canonne et al. [CRS15]. This bound even holds under a restricted version of the conditional sampling model called the Pair Conditional Sampling model. Next, we prove that tolerant identity testing in the conditional sampling model can be solved in O~(ε4)\tilde{O}(\varepsilon^{-4}) queries, which is the first known bound independent of the support size of the distribution for this problem. Next, we use our algorithm for tolerant uniformity testing to get an O~(ε4)\tilde{O}(\varepsilon^{-4})-query algorithm for monotonicity testing in the conditional sampling model, improving on the O~(ε22)\tilde{O}(\varepsilon^{-22})-query algorithm of Canonne [Can15]. Finally, we study (non-tolerant) identity testing under the pair conditional sampling model, and provide a tight bound of Θ~(logNε2)\tilde{\Theta}(\sqrt{\log N} \cdot \varepsilon^{-2}) for the query complexity, where the domain of the distribution has size NN. This improves upon both the known upper and lower bounds in [CRS15].

View on arXiv
Comments on this paper