205

Distributed Mean Estimation with Optimal Error Bounds

Abstract

Motivated by applications to distributed optimization and machine learning, we consider the distributed mean estimation problem, in which nn nodes are each assigned a multi-dimensional input vector, and must cooperate to estimate the mean of the input vectors, while minimizing communication. In this paper, we provide the first tight bounds for this problem, in terms of the trade-off between the amount of communication between nodes and the variance of the node estimates relative to the true value of the mean.

View on arXiv
Comments on this paper