Computing Matching Statistics on Repetitive Texts

Gao, Younan

Full-text links:

Download:

Current browse context:

cs.DS

< prev | next >

new | recent | 2111

Change to browse by:

Computer Science > Data Structures and Algorithms

Title: Computing Matching Statistics on Repetitive Texts

Authors: Younan Gao

(Submitted on 31 Oct 2021 (v1), last revised 13 Jan 2022 (this version, v2))

Abstract: Computing the {\em matching statistics} of a string $P[1..m]$ with respect to a text $T[1..n]$ is a fundamental problem which has application to genome sequence comparison. In this paper, we study the problem of computing the matching statistics upon highly repetitive texts. We design three different data structures that are similar to LZ-compressed indexes. The space costs of all of them can be measured by $\gamma$, the size of the smallest string attractor [STOC'2018] and $\delta$, a better measure of repetitiveness [LATIN'2020].

Comments:	full version of a DCC 2022 paper
Subjects:	Data Structures and Algorithms (cs.DS)
Cite as:	arXiv:2111.00376 [cs.DS]
	(or arXiv:2111.00376v2 [cs.DS] for this version)

Submission history

From: Younan Gao [view email]
[v1] Sun, 31 Oct 2021 01:42:27 GMT (196kb)
[v2] Thu, 13 Jan 2022 16:35:30 GMT (201kb)

Which authors of this paper are endorsers? | Disable MathJax (What is MathJax?)

Link back to: arXiv, form interface, contact.

> cs > arXiv:2111.00376v2

Download:

Current browse context:

Change to browse by:

References & Citations

DBLP - CS Bibliography

Bookmark

Computer Science > Data Structures and Algorithms

Title: Computing Matching Statistics on Repetitive Texts

Submission history