We gratefully acknowledge support from
the Simons Foundation and member institutions.
Full-text links:

Download:

Current browse context:

cs.IT

Change to browse by:

References & Citations

DBLP - CS Bibliography

Bookmark

(what is this?)
CiteULike logo BibSonomy logo Mendeley logo del.icio.us logo Digg logo Reddit logo ScienceWISE logo

Computer Science > Information Theory

Title: Harnessing Correlations in Distributed Erasure-Coded Key-Value Stores

Abstract: Motivated by applications of distributed storage systems to key-value stores, the multi-version coding problem was formulated to efficiently store frequently updated data in asynchronous decentralized storage systems. Inspired by consistency requirements in distributed systems, the main goal in the multi-version coding problem is to ensure that the latest possible version of the data is decodable, even if the data updates have not reached some servers in the system. In this paper, we study the storage cost of ensuring consistency for the case where the data versions are correlated, in contrast to previous work where data versions were treated as being independent. We provide multi-version code constructions that show that the storage cost can be significantly smaller than the previous constructions depending on the degree of correlation, despite the asynchrony and the decentralized nature. Our achievability results are based on Reed-Solomon codes and random binning. Through an information-theoretic converse, we show that our multi-version codes are nearly-optimal, within a factor of $2$, in certain interesting regimes.
Subjects: Information Theory (cs.IT)
Journal reference: IEEE Transactions on Communications 2019
Cite as: arXiv:1708.06042 [cs.IT]
  (or arXiv:1708.06042v2 [cs.IT] for this version)

Submission history

From: Ramy Ali [view email]
[v1] Mon, 21 Aug 2017 01:02:01 GMT (210kb,D)
[v2] Sat, 9 Mar 2019 19:06:00 GMT (673kb,D)

Link back to: arXiv, form interface, contact.