Classification with High-Dimensional Sparse Samples
- VLM

The task of the binary classification problem is to determine which of two distributions has generated a length- test sequence. The two distributions are unknown; however two training sequences of length , one from each distribution, are observed. The distributions share an alphabet of size , which is significantly larger than and . How does affect the probability of classification error? We characterize the achievable error rate in a high-dimensional setting in which all tend to infinity and . The results are: * There exists an asymptotically consistent classifier if and only if . * The best achievable probability of classification error decays as with (shown by achievability and converse results). * A weighted coincidence-based classifier has a non-zero generalized error exponent . * The -norm based classifier has a zero generalized error exponent.
View on arXiv