We gratefully acknowledge support from
the Simons Foundation and member institutions.
Full-text links:

Download:

Current browse context:

cs.DS

Change to browse by:

cs

References & Citations

DBLP - CS Bibliography

Bookmark

(what is this?)
CiteULike logo BibSonomy logo Mendeley logo del.icio.us logo Digg logo Reddit logo

Computer Science > Data Structures and Algorithms

Title: Computing Matching Statistics on Repetitive Texts

Authors: Younan Gao
Abstract: Computing the {\em matching statistics} of a string $P[1..m]$ with respect to a text $T[1..n]$ is a fundamental problem which has application to genome sequence comparison. In this paper, we study the problem of computing the matching statistics upon highly repetitive texts. We design three different data structures that are similar to LZ-compressed indexes. The space costs of all of them can be measured by $\gamma$, the size of the smallest string attractor [STOC'2018] and $\delta$, a better measure of repetitiveness [LATIN'2020].
Comments: full version of a DCC 2022 paper
Subjects: Data Structures and Algorithms (cs.DS)
Cite as: arXiv:2111.00376 [cs.DS]
  (or arXiv:2111.00376v2 [cs.DS] for this version)

Submission history

From: Younan Gao [view email]
[v1] Sun, 31 Oct 2021 01:42:27 GMT (196kb)
[v2] Thu, 13 Jan 2022 16:35:30 GMT (201kb)

Link back to: arXiv, form interface, contact.