21
47

Linear Regression with an Unknown Permutation: Statistical and Computational Limits

Abstract

Consider a noisy linear observation model with an unknown permutation, based on observing y=ΠAx+wy = \Pi^* A x^* + w, where xRdx^* \in \mathbb{R}^d is an unknown vector, Π\Pi^* is an unknown n×nn \times n permutation matrix, and wRnw \in \mathbb{R}^n is additive Gaussian noise. We analyze the problem of permutation recovery in a random design setting in which the entries of the matrix AA are drawn i.i.d. from a standard Gaussian distribution, and establish sharp conditions on the SNR, sample size nn, and dimension dd under which Π\Pi^* is exactly and approximately recoverable. On the computational front, we show that the maximum likelihood estimate of Π\Pi^* is NP-hard to compute, while also providing a polynomial time algorithm when d=1d =1.

View on arXiv
Comments on this paper