Data augmentation using generative networks to identify dementia

Mirheidari, Bahman; Pan, Yilin; Blackburn, Daniel; O'Malley, Ronan; Walker, Traci; Venneri, Annalena; Reuber, Markus; Christensen, Heidi

Full-text links:

Download:

Current browse context:

eess.AS

< prev | next >

new | recent | 2004

Electrical Engineering and Systems Science > Audio and Speech Processing

Title: Data augmentation using generative networks to identify dementia

Authors: Bahman Mirheidari, Yilin Pan, Daniel Blackburn, Ronan O'Malley, Traci Walker, Annalena Venneri, Markus Reuber, Heidi Christensen

(Submitted on 13 Apr 2020)

Abstract: Data limitation is one of the most common issues in training machine learning classifiers for medical applications. Due to ethical concerns and data privacy, the number of people that can be recruited to such experiments is generally smaller than the number of participants contributing to non-healthcare datasets. Recent research showed that generative models can be used as an effective approach for data augmentation, which can ultimately help to train more robust classifiers sparse data domains. A number of studies proved that this data augmentation technique works for image and audio data sets. In this paper, we investigate the application of a similar approach to different types of speech and audio-based features extracted from interactions recorded with our automatic dementia detection system. Using two generative models we show how the generated synthesized samples can improve the performance of a DNN based classifier. The variational autoencoder increased the F-score of a four-way classifier distinguishing the typical patient groups seen in memory clinics from 58% to around 74%, a 16% improvement

Subjects:	Audio and Speech Processing (eess.AS); Computation and Language (cs.CL); Machine Learning (cs.LG); Sound (cs.SD)
Cite as:	arXiv:2004.05989 [eess.AS]
	(or arXiv:2004.05989v1 [eess.AS] for this version)

Submission history

From: Bahman Mirheidari [view email]
[v1] Mon, 13 Apr 2020 15:05:24 GMT (438kb,D)

Which authors of this paper are endorsers? | Disable MathJax (What is MathJax?)

Link back to: arXiv, form interface, contact.

> eess > arXiv:2004.05989

Download:

Current browse context:

Change to browse by:

References & Citations

Bookmark

Electrical Engineering and Systems Science > Audio and Speech Processing

Title: Data augmentation using generative networks to identify dementia

Submission history