Max-Pooling Loss Training of Long Short-Term Memory Networks for Small-Footprint Keyword Spotting

Sun, Ming; Raju, Anirudh; Tucker, George; Panchapagesan, Sankaran; Fu, Gengshen; Mandal, Arindam; Matsoukas, Spyros; Strom, Nikko; Vitaladevuni, Shiv

doi:10.1109/SLT.2016.7846306

Full-text links:

Download:

Current browse context:

cs.CL

< prev | next >

new | recent | 1705

Computer Science > Computation and Language

Title: Max-Pooling Loss Training of Long Short-Term Memory Networks for Small-Footprint Keyword Spotting

Authors: Ming Sun, Anirudh Raju, George Tucker, Sankaran Panchapagesan, Gengshen Fu, Arindam Mandal, Spyros Matsoukas, Nikko Strom, Shiv Vitaladevuni

(Submitted on 5 May 2017)

Abstract: We propose a max-pooling based loss function for training Long Short-Term Memory (LSTM) networks for small-footprint keyword spotting (KWS), with low CPU, memory, and latency requirements. The max-pooling loss training can be further guided by initializing with a cross-entropy loss trained network. A posterior smoothing based evaluation approach is employed to measure keyword spotting performance. Our experimental results show that LSTM models trained using cross-entropy loss or max-pooling loss outperform a cross-entropy loss trained baseline feed-forward Deep Neural Network (DNN). In addition, max-pooling loss trained LSTM with randomly initialized network performs better compared to cross-entropy loss trained LSTM. Finally, the max-pooling loss trained LSTM initialized with a cross-entropy pre-trained network shows the best performance, which yields $67.6\%$ relative reduction compared to baseline feed-forward DNN in Area Under the Curve (AUC) measure.

Subjects:	Computation and Language (cs.CL); Machine Learning (cs.LG); Machine Learning (stat.ML)
Journal reference:	Spoken Language Technology Workshop (SLT), 2016 IEEE (pp. 474-480). IEEE
DOI:	10.1109/SLT.2016.7846306
Cite as:	arXiv:1705.02411 [cs.CL]
	(or arXiv:1705.02411v1 [cs.CL] for this version)

Submission history

From: Anirudh Raju [view email]
[v1] Fri, 5 May 2017 22:36:04 GMT (241kb,D)

Which authors of this paper are endorsers? | Disable MathJax (What is MathJax?)

Link back to: arXiv, form interface, contact.

> cs > arXiv:1705.02411

Download:

Current browse context:

Change to browse by:

References & Citations

DBLP - CS Bibliography

Bookmark

Computer Science > Computation and Language

Title: Max-Pooling Loss Training of Long Short-Term Memory Networks for Small-Footprint Keyword Spotting

Submission history