We gratefully acknowledge support from
the Simons Foundation and member institutions.
Full-text links:


Current browse context:


Change to browse by:


References & Citations

DBLP - CS Bibliography


(what is this?)
CiteULike logo BibSonomy logo Mendeley logo del.icio.us logo Digg logo Reddit logo ScienceWISE logo

Computer Science > Data Structures and Algorithms

Title: Dv2v: A Dynamic Variable-to-Variable Compressor

Abstract: We present Dv2v, a new dynamic (one-pass) variable-to-variable compressor. Variable-to-variable compression aims at using a modeler that gathers variable-length input symbols and a variable-length statistical coder that assigns shorter codewords to the more frequent symbols. In Dv2v, we process the input text word-wise to gather variable-length symbols that can be either terminals (new words) or non-terminals, subsequences of words seen before in the input text. Those input symbols are set in a vocabulary that is kept sorted by frequency. Therefore, those symbols can be easily encoded with dense codes. Our Dv2v permits real-time transmission of data, i.e. compression/transmission can begin as soon as data become available. Our experiments show that Dv2v is able to overcome the compression ratios of the v2vDC, the state-of-the-art semi-static variable-to-variable compressor, and to almost reach p7zip values. It also draws a competitive performance at both compression and decompression.
Comments: This research has received funding from the European Union's Horizon 2020 research and innovation programme under the Marie Sk{\l}odowska-Curie Actions H2020-MSCA-RISE-2015 BIRDS GA No. 690941
Subjects: Data Structures and Algorithms (cs.DS)
Journal reference: Dv2v: A Dynamic Variable-to-Variable Compressor. In 2019 Data Compression Conference (DCC) (pp. 83-92). IEEE
DOI: 10.1109/DCC.2019.00016
Cite as: arXiv:1911.04202 [cs.DS]
  (or arXiv:1911.04202v1 [cs.DS] for this version)

Submission history

From: Adrián Gómez-Brandón [view email]
[v1] Mon, 11 Nov 2019 12:05:37 GMT (532kb)

Link back to: arXiv, form interface, contact.