Resource-efficient DNNs for Keyword Spotting using Neural Architecture Search and Quantization

Peter, David; Roth, Wolfgang; Pernkopf, Franz

Full-text links:

Download:

Current browse context:

eess.AS

< prev | next >

new | recent | 2012

Electrical Engineering and Systems Science > Audio and Speech Processing

Title: Resource-efficient DNNs for Keyword Spotting using Neural Architecture Search and Quantization

Authors: David Peter, Wolfgang Roth, Franz Pernkopf

(Submitted on 18 Dec 2020)

Abstract: This paper introduces neural architecture search (NAS) for the automatic discovery of small models for keyword spotting (KWS) in limited resource environments. We employ a differentiable NAS approach to optimize the structure of convolutional neural networks (CNNs) to maximize the classification accuracy while minimizing the number of operations per inference. Using NAS only, we were able to obtain a highly efficient model with 95.4% accuracy on the Google speech commands dataset with 494.8 kB of memory usage and 19.6 million operations. Additionally, weight quantization is used to reduce the memory consumption even further. We show that weight quantization to low bit-widths (e.g. 1 bit) can be used without substantial loss in accuracy. By increasing the number of input features from 10 MFCC to 20 MFCC we were able to increase the accuracy to 96.3% at 340.1 kB of memory usage and 27.1 million operations.

Subjects:	Audio and Speech Processing (eess.AS); Machine Learning (cs.LG)
Cite as:	arXiv:2012.10138 [eess.AS]
	(or arXiv:2012.10138v1 [eess.AS] for this version)

Submission history

From: David Peter [view email]
[v1] Fri, 18 Dec 2020 09:53:55 GMT (522kb,D)

Which authors of this paper are endorsers? | Disable MathJax (What is MathJax?)

Link back to: arXiv, form interface, contact.

> eess > arXiv:2012.10138

Download:

Current browse context:

Change to browse by:

References & Citations

Bookmark

Electrical Engineering and Systems Science > Audio and Speech Processing

Title: Resource-efficient DNNs for Keyword Spotting using Neural Architecture Search and Quantization

Submission history