We gratefully acknowledge support from
the Simons Foundation and member institutions.
Full-text links:

Download:

Current browse context:

math.CO

Change to browse by:

References & Citations

Bookmark

(what is this?)
CiteULike logo BibSonomy logo Mendeley logo del.icio.us logo Digg logo Reddit logo ScienceWISE logo

Mathematics > Combinatorics

Title: Multi de Bruijn Sequences

Authors: Glenn Tesler
Abstract: We generalize the notion of a de Bruijn sequence to a "multi de Bruijn sequence": a cyclic or linear sequence that contains every k-mer over an alphabet of size q exactly m times. For example, over the binary alphabet {0,1}, the cyclic sequence (00010111) and the linear sequence 000101110 each contain two instances of each 2-mer 00,01,10,11. We derive formulas for the number of such sequences. The formulas and derivation generalize classical de Bruijn sequences (the case m=1). We also determine the number of multisets of aperiodic cyclic sequences containing every k-mer exactly m times; for example, the pair of cyclic sequences (00011)(011) contains two instances of each 2-mer listed above. This uses an extension of the Burrows-Wheeler Transform due to Mantaci et al, and generalizes a result by Higgins for the case m=1.
Comments: 29 pages, 2 figures
Subjects: Combinatorics (math.CO)
MSC classes: 68R15, 05C30 (Primary) 05C38, 05C45, 05C81, 68P30 (Secondary)
Journal reference: Journal of Combinatorics, 2017, 8(3):439-474
DOI: 10.4310/JOC.2017.v8.n3.a3
Cite as: arXiv:1708.03654 [math.CO]
  (or arXiv:1708.03654v1 [math.CO] for this version)

Submission history

From: Glenn Tesler [view email]
[v1] Fri, 11 Aug 2017 18:27:44 GMT (103kb,D)

Link back to: arXiv, form interface, contact.