Unsupervised Learning of Disentangled and Interpretable Representations from Sequential Data

Hsu, Wei-Ning; Zhang, Yu; Glass, James

Full-text links:

Download:

Current browse context:

cs.LG

< prev | next >

new | recent | 1709

Computer Science > Machine Learning

Title: Unsupervised Learning of Disentangled and Interpretable Representations from Sequential Data

Authors: Wei-Ning Hsu, Yu Zhang, James Glass

(Submitted on 22 Sep 2017)

Abstract: We present a factorized hierarchical variational autoencoder, which learns disentangled and interpretable representations from sequential data without supervision. Specifically, we exploit the multi-scale nature of information in sequential data by formulating it explicitly within a factorized hierarchical graphical model that imposes sequence-dependent priors and sequence-independent priors to different sets of latent variables. The model is evaluated on two speech corpora to demonstrate, qualitatively, its ability to transform speakers or linguistic content by manipulating different sets of latent variables; and quantitatively, its ability to outperform an i-vector baseline for speaker verification and reduce the word error rate by as much as 35% in mismatched train/test scenarios for automatic speech recognition tasks.

Comments:	Accepted to NIPS 2017
Subjects:	Machine Learning (cs.LG); Computation and Language (cs.CL); Sound (cs.SD); Audio and Speech Processing (eess.AS); Machine Learning (stat.ML)
Cite as:	arXiv:1709.07902 [cs.LG]
	(or arXiv:1709.07902v1 [cs.LG] for this version)

Submission history

From: Wei-Ning Hsu [view email]
[v1] Fri, 22 Sep 2017 18:36:50 GMT (7928kb,D)

Which authors of this paper are endorsers? | Disable MathJax (What is MathJax?)

Link back to: arXiv, form interface, contact.

> cs > arXiv:1709.07902

Download:

Current browse context:

Change to browse by:

References & Citations

DBLP - CS Bibliography

Bookmark

Computer Science > Machine Learning

Title: Unsupervised Learning of Disentangled and Interpretable Representations from Sequential Data

Submission history