15
7

Error Bound of Empirical 2\ell_2 Risk Minimization for Noisy Standard and Generalized Phase Retrieval Problems

Abstract

In this paper, we study the estimation performance of empirical 2\ell_2 risk minimization (ERM) in noisy (standard) phase retrieval (NPR) given by yk=αkx02+ηky_k = |\alpha_k^*x_0|^2+\eta_k, or noisy generalized phase retrieval (NGPR) formulated as yk=x0Akx0+ηky_k = x_0^*A_kx_0 + \eta_k, where x0Kdx_0\in\mathbb{K}^d is the desired signal, nn is the sample size, η=(η1,...,ηn)\eta= (\eta_1,...,\eta_n)^\top is the noise vector. We establish new error bounds under different noise patterns, and our proofs are valid for both K=R\mathbb{K}=\mathbb{R} and K=C\mathbb{K}=\mathbb{C}. In NPR under arbitrary noise vector η\eta, we derive a new error bound O(ηdn+1ηn)O\big(\|\eta\|_\infty\sqrt{\frac{d}{n}} + \frac{|\mathbf{1}^\top\eta|}{n}\big), which is tighter than the currently known one O(ηn)O\big(\frac{\|\eta\|}{\sqrt{n}}\big) in many cases. In NGPR, we show O(ηdn)O\big(\|\eta\|\frac{\sqrt{d}}{n}\big) for arbitrary η\eta. In both problems, the bounds for arbitrary noise immediately give rise to O~(dn)\tilde{O}(\sqrt{\frac{d}{n}}) for sub-Gaussian or sub-exponential random noise, with some conventional but inessential assumptions (e.g., independent or zero-mean condition) removed or weakened. In addition, we make a first attempt to ERM under heavy-tailed random noise assumed to have bounded ll-th moment. To achieve a trade-off between bias and variance, we truncate the responses and propose a corresponding robust ERM estimator, which is shown to possess the guarantee O~([dn]11/l)\tilde{O}\big(\big[\sqrt{\frac{d}{n}}\big]^{1-1/l}\big) in both NPR, NGPR. All the error bounds straightforwardly extend to the more general problems of rank-rr matrix recovery, and these results deliver a conclusion that the full-rank frame {Ak}k=1n\{A_k\}_{k=1}^n in NGPR is more robust to biased noise than the rank-1 frame {αkαk}k=1n\{\alpha_k\alpha_k^*\}_{k=1}^n in NPR. Extensive experimental results are presented to illustrate our theoretical findings.

View on arXiv
Comments on this paper