17
11

Unlinked monotone regression

Abstract

We consider so-called univariate unlinked (sometimes ``decoupled,'' or ``shuffled'') regression when the unknown regression curve is monotone. In standard monotone regression, one observes a pair (X,Y)(X,Y) where a response YY is linked to a covariate XX through the model Y=m0(X)+ϵY= m_0(X) + \epsilon, with m0m_0 the (unknown) monotone regression function and ϵ\epsilon the unobserved error (assumed to be independent of XX). In the unlinked regression setting one gets only to observe a vector of realizations from both the response YY and from the covariate XX where now Y=dm0(X)+ϵY \stackrel{d}{=} m_0(X) + \epsilon. There is no (observed) pairing of XX and YY. Despite this, it is actually still possible to derive a consistent non-parametric estimator of m0m_0 under the assumption of monotonicity of m0m_0 and knowledge of the distribution of the noise ϵ\epsilon. In this paper, we establish an upper bound on the rate of convergence of such an estimator under minimal assumption on the distribution of the covariate XX. We discuss extensions to the case in which the distribution of the noise is unknown. We develop a second order algorithm for its computation, and we demonstrate its use on synthetic data. Finally, we apply our method (in a fully data driven way, without knowledge of the error distribution) on longitudinal data from the US Consumer Expenditure Survey.

View on arXiv
Comments on this paper