Balancing Gaussian vectors in high dimension

Motivated by problems in controlled experiments, we study the discrepancy of random matrices with continuous entries where the number of columns is much larger than the number of rows . Our first result shows that if , a matrix with i.i.d. standard Gaussian entries has discrepancy with high probability. This provides sharp guarantees for Gaussian discrepancy in a regime that had not been considered before in the existing literature. Our results also apply to a more general family of random matrices with continuous i.i.d. entries, assuming that . The proof is non-constructive and is an application of the second moment method. Our second result is algorithmic and applies to random matrices whose entries are i.i.d. and have a Lipschitz density. We present a randomized polynomial-time algorithm that achieves discrepancy with high probability, provided that . In the one-dimensional case, this matches the best known algorithmic guarantees due to Karmarkar--Karp. For higher dimensions , this establishes the first efficient algorithm achieving discrepancy smaller than .
View on arXiv