In cryptography, a private information retrieval (PIR) protocolis a protocol that allows a user to retrieve an item from a server in possession of a database without revealing which item is retrieved. PIR is a weaker version of 1-out-of-n oblivious transfer, where it is also required that the user should not get information about other database items.
One trivial, but very inefficient way to achieve PIR is for the server to send an entire copy of the database to the user. In fact, this is the only possible protocol (in the classical or the quantum setting[1]) that gives the user information theoretic privacy for their query in a single-server setting.[2] There are two ways to address this problem: one is to make the server computationally bounded and the other is to assume that there are multiple non-cooperating servers, each having a copy of the database.
The problem was introduced in 1995 by Chor, Goldreich, Kushilevitz and Sudan[2] in the information-theoretic setting and in 1997 by Kushilevitz and Ostrovsky in the computational setting.[3] Since then, very efficient solutions have been discovered. Single database (computationally private) PIR can be achieved with constant (amortized) communication and k-database (information theoretic) PIR can be done with nO( log logkklogk) communication.
Advances in computational PIR
The first single-database computational PIR scheme to achieve communication complexity less than nnwas created in 1997 by Kushilevitz and Ostrovsky[3] and achieved communication complexity of nϵnϵ for any ϵϵ, where n is the number of bits in the database. The security of their scheme was based on the well-studied Quadratic residuosity problem. In 1999, Christian Cachin, Silvio Micali and Markus Stadler[4] achieved poly-logarithmic communication complexity. The security of their system is based on the Phi-hiding assumption. In 2004, Helger Lipmaa [5] achieved log-squared communication complexity
As shown by Ostrovsky and Skeith,[9] the schemes by Kushilevitz and Ostrovsky[3] and Lipmaa[5] use similar ideas based onhomomorphic encryption. The Kushilevitz and Ostrovsky protocol is based on theGoldwasser–Micali cryptosystem while the protocol by Lipmaa is based on the Damgård–Jurik crypto system.
Advances in information theoretic PIR
Achieving information theoretic security requires the assumption that there are multiple non-cooperating servers, each having a copy of the database. Without this assumption, any information-theoretically secure PIR protocol requires an amount of communication that is at least the size of the database n. Multi-server PIR protocols tolerant of non-responsive or malicious/colluding servers are calledrobust or Byzantine robust respectively. These issues were first considered by Beimel and Stahl (2002). An ℓ-server system that can operate where only k of the servers respond, ν of the servers respond incorrectly, and which can withstand up tot colluding servers without revealing the client’s query is called “t-private ν-Byzantine robustk-out-of-ℓ PIR” [DGH 2012]. In 2012, C. Devet, I. Goldberg, and N. Heninger (DGH 2012) proposed an optimally robust scheme that is Byzantine-robust to ν < k − t − 1 {\displaystyle \nu
Relation to other cryptographic primitives
One-way functions are necessary, but not known to be sufficient, for nontrivial (i.e., with sublinear communication) single database computationally private information retrieval. In fact, such a protocol was proved by Giovanni Di Crescenzo, Tal Malkin and Rafail Ostrovsky to imply oblivious transfer (see below).[11]
Oblivious transfer, also called symmetric PIR, is PIR with the additional restriction that the user may not learn any item other than the one she requested. It is termed symmetric because both the user and the database have a privacy requirement.
Collision-resistant cryptographic hash functions are implied by any one-round computational PIR scheme, as shown by Ishai, Kushilevitz and Ostrovsky.[12]
PIR variations
The basic motivation for Private Information Retrieval is a family of two-party protocols in which one of the parties (thesender) owns a database, and the other part (the receiver) wants to query it with certain privacy restrictions and warranties. So, as a result of the protocol, if thereceiver wants the i-th value in the database he must learn thei-th entry, but the sender must learn nothing about i. In a general PIR protocol, a computationally unboundedsender can learn nothing about i so privacy is theoretically preserved. Since the PIR problem was posed, different approaches to its solution have been pursued and some variations were proposed.
A CPIR (Computationally Private Information Retrieval) protocol is similar to a PIR protocol: thereceiver retrieves an element chosen by him from sender’s database, so that thesender obtains no knowledge about which element was transferred.[7] The only difference is that privacy is safeguarded against a polynomially bounded sender.[13]
Ref
wiki