All Papers
Title |
|---|
Title |
|---|

This paper investigates what can be inferred about an arbitrary continuous probability distribution from a finite sample of observations drawn from it. The central finding is that the sorted sample points partition the real line into segments, each carrying an expected probability mass of exactly . This non-parametric result, which follows from fundamental properties of order statistics, holds regardless of the underlying distribution's shape. This equal-probability partition yields a discrete entropy of bits, which quantifies the information gained from the sample and contrasts with Shannon's results for continuous variables. I compare this partition-based framework to the conventional ECDF and discuss its implications for robust non-parametric inference, particularly in density and tail estimation.
View on arXiv