We gratefully acknowledge support from
the Simons Foundation and member institutions.
Full-text links:

Download:

Current browse context:

cs.IT

Change to browse by:

References & Citations

DBLP - CS Bibliography

Bookmark

(what is this?)
CiteULike logo BibSonomy logo Mendeley logo del.icio.us logo Digg logo Reddit logo

Computer Science > Information Theory

Title: Clustering Time Series and the Surprising Robustness of HMMs

Abstract: Suppose that you are given a time series where consecutive samples are believed to come from a probabilistic source, and that the source changes from time to time. Your objective is to learn the distribution of each source and to cluster the samples according to the source that generated them. A standard approach to this problem is to model the data as a hidden Markov model (HMM). However, due to the Markov property and stationarity of HMMs, simple examples can be given where this approach yields poor results for the clustering. We propose a more general, non-stationary model of the data, where the only restriction is that the sources can not change too often. Even though the model governing the sources may not be Markovian, we show that that a maximum likelihood HMM estimator can still be used. Specifically, we show that a maximum-likelihood HMM estimator produces the correct second moment of the data, and the results can be extended to higher moments. In contrast to the existing consistency and misspecification results involving maximum likelihood for HMMs, our approach yields bounds for finite sample sizes.
Subjects: Information Theory (cs.IT); Machine Learning (cs.LG); Machine Learning (stat.ML)
Cite as: arXiv:1605.02531 [cs.IT]
  (or arXiv:1605.02531v1 [cs.IT] for this version)

Submission history

From: Mark Kozdoba [view email]
[v1] Mon, 9 May 2016 11:24:19 GMT (81kb)
[v2] Wed, 14 Sep 2016 13:49:37 GMT (26kb)

Link back to: arXiv, form interface, contact.