We gratefully acknowledge support from
the Simons Foundation and member institutions.
Full-text links:

Download:

Current browse context:

cs

Change to browse by:

References & Citations

DBLP - CS Bibliography

Bookmark

(what is this?)
CiteULike logo BibSonomy logo Mendeley logo del.icio.us logo Digg logo Reddit logo

Mathematics > Probability

Title: Matching strings in encoded sequences

Abstract: We investigate the longest common substring problem for encoded sequences and its asymptotic behaviour. The main result is a strong law of large numbers for a re-scaled version of this quantity, which presents an explicit relation with the R\'enyi entropy of the source. We apply this result to the zero-inflated contamination model and the stochastic scrabble. In the case of dynamical systems, this problem is equivalent to the shortest distance between two observed orbits and its limiting relationship with the correlation dimension of the pushforward measure. An extension to the shortest distance between orbits for random dynamical systems is also provided.
Subjects: Probability (math.PR); Data Structures and Algorithms (cs.DS); Information Theory (cs.IT); Dynamical Systems (math.DS)
Cite as: arXiv:1903.09625 [math.PR]
  (or arXiv:1903.09625v2 [math.PR] for this version)

Submission history

From: Rodrigo Lambert [view email]
[v1] Fri, 22 Mar 2019 17:46:15 GMT (32kb)
[v2] Tue, 10 Dec 2019 17:58:58 GMT (24kb)

Link back to: arXiv, form interface, contact.