Current browse context:
stat
Change to browse by:
References & Citations
Computer Science > Information Theory
Title: Clustering Time Series and the Surprising Robustness of HMMs
(Submitted on 9 May 2016 (v1), last revised 14 Sep 2016 (this version, v2))
Abstract: Suppose that we are given a time series where consecutive samples are believed to come from a probabilistic source, that the source changes from time to time and that the total number of sources is fixed. Our objective is to estimate the distributions of the sources. A standard approach to this problem is to model the data as a hidden Markov model (HMM). However, since the data often lacks the Markov or the stationarity properties of an HMM, one can ask whether this approach is still suitable or perhaps another approach is required. In this paper we show that a maximum likelihood HMM estimator can be used to approximate the source distributions in a much larger class of models than HMMs. Specifically, we propose a natural and fairly general non-stationary model of the data, where the only restriction is that the sources do not change too often. Our main result shows that for this model, a maximum-likelihood HMM estimator produces the correct second moment of the data, and the results can be extended to higher moments.
Submission history
From: Mark Kozdoba [view email][v1] Mon, 9 May 2016 11:24:19 GMT (81kb)
[v2] Wed, 14 Sep 2016 13:49:37 GMT (26kb)
Link back to: arXiv, form interface, contact.