Private Information Retrieval from Storage Constrained Databases -- Coded Caching meets PIR

Private information retrieval (PIR) allows a user to retrieve a desired message out of possible messages from databases without revealing the identity of the desired message. Majority of existing works on PIR assume the presence of replicated databases, each storing all the messages. In this work, we consider the problem of PIR from storage constrained databases. Each database has a storage capacity of bits, where is the number of messages, is the size of each message in bits, and is the normalized storage. In the storage constrained PIR problem, there are two key design questions: a) how to store content across each database under storage constraints; and b) construction of schemes that allow efficient PIR through storage constrained databases. The main contribution of this work is a general achievable scheme for PIR from storage constrained databases for any value of storage. In particular, for any , with normalized storage , where the parameter can take integer values , we show that our proposed PIR scheme achieves a download cost of . The extreme case when (i.e., ) corresponds to the setting of replicated databases with full storage. For this extremal setting, our scheme recovers the information-theoretically optimal download cost characterized by Sun and Jafar as . For the other extreme, when (i.e., ), the proposed scheme achieves a download cost of . The interesting aspect of the result is that for intermediate values of storage, i.e., , the proposed scheme can strictly outperform memory-sharing between extreme values of storage.
View on arXiv