References & Citations
Computer Science > Data Structures and Algorithms
Title: Fast detection of specific fragments against a set of sequences
(Submitted on 6 Aug 2022 (v1), last revised 6 Apr 2023 (this version, v2))
Abstract: We design alignment-free techniques for comparing a sequence or word, called a target, against a set of words, called a reference. A target-specific factor of a target $T$ against a reference $R$ is a factor $w$ of a word in $T$ which is not a factor of a word of $R$ and such that any proper factor of $w$ is a factor of a word of $R$. We first address the computation of the set of target-specific factors of a target $T$ against a reference $R$, where $T$ and $R$ are finite sets of sequences. The result is the construction of an automaton accepting the set of all considered target-specific factors. The construction algorithm runs in linear time according to the size of $T\cup R$. The second result consists of the design of an algorithm to compute all the occurrences in a single sequence $T$ of its target-specific factors against a reference $R$. The algorithm runs in real-time on the target sequence, independently of the number of occurrences of target-specific factors.
Submission history
From: Marie-Pierre Béal [view email][v1] Sat, 6 Aug 2022 06:32:36 GMT (17kb)
[v2] Thu, 6 Apr 2023 11:40:20 GMT (27kb)
Link back to: arXiv, form interface, contact.