Current browse context:
cs.LG
Change to browse by:
References & Citations
Computer Science > Machine Learning
Title: Learning Overcomplete HMMs
(Submitted on 7 Nov 2017 (v1), last revised 28 Jun 2018 (this version, v2))
Abstract: We study the problem of learning overcomplete HMMs---those that have many hidden states but a small output alphabet. Despite having significant practical importance, such HMMs are poorly understood with no known positive or negative results for efficient learning. In this paper, we present several new results---both positive and negative---which help define the boundaries between the tractable and intractable settings. Specifically, we show positive results for a large subclass of HMMs whose transition matrices are sparse, well-conditioned, and have small probability mass on short cycles. On the other hand, we show that learning is impossible given only a polynomial number of samples for HMMs with a small output alphabet and whose transition matrices are random regular graphs with large degree. We also discuss these results in the context of learning HMMs which can capture long-term dependencies.
Submission history
From: Vatsal Sharan [view email][v1] Tue, 7 Nov 2017 06:55:03 GMT (1909kb,D)
[v2] Thu, 28 Jun 2018 01:49:33 GMT (1230kb,D)
Link back to: arXiv, form interface, contact.