Sparse Matrix Multiplication in the Low-Bandwidth Model

We study matrix multiplication in the low-bandwidth model: There are computers, and we need to compute the product of two matrices. Initially computer knows row of each input matrix. In one communication round each computer can send and receive one -bit message. Eventually computer has to output row of the product matrix. We seek to understand the complexity of this problem in the uniformly sparse case: each row and column of each input matrix has at most non-zeros and in the product matrix we only need to know the values of at most elements in each row or column. This is exactly the setting that we have, e.g., when we apply matrix multiplication for triangle detection in graphs of maximum degree . We focus on the supported setting: the structure of the matrices is known in advance; only the numerical values of nonzero elements are unknown. There is a trivial algorithm that solves the problem in rounds, but for a large , better algorithms are known to exist; in the moderately dense regime the problem can be solved in communication rounds, and for very large , the dominant solution is the fast matrix multiplication algorithm using communication rounds (for matrix multiplication over fields and rings supporting fast matrix multiplication). In this work we show that it is possible to overcome quadratic barrier for all values of : we present an algorithm that solves the problem in rounds for fields and rings supporting fast matrix multiplication and rounds for semirings, independent of .
View on arXiv