We gratefully acknowledge support from
the Simons Foundation and member institutions.
Full-text links:

Download:

Current browse context:

eess.AS

Change to browse by:

References & Citations

Bookmark

(what is this?)
CiteULike logo BibSonomy logo Mendeley logo del.icio.us logo Digg logo Reddit logo

Electrical Engineering and Systems Science > Audio and Speech Processing

Title: Speech Paralinguistic Approach for Detecting Dementia Using Gated Convolutional Neural Network

Abstract: We propose a non-invasive and cost-effective method to automatically detect dementia by utilizing solely speech audio data. We extract paralinguistic features for a short speech segment and use Gated Convolutional Neural Networks (GCNN) to classify it into dementia or healthy. We evaluate our method on the Pitt Corpus and on our own dataset, the PROMPT Database. Our method yields the accuracy of 73.1% on the Pitt Corpus using an average of 114 seconds of speech data. In the PROMPT Database, our method yields the accuracy of 74.7% using 4 seconds of speech data and it improves to 80.8% when we use all the patient's speech data. Furthermore, we evaluate our method on a three-class classification problem in which we included the Mild Cognitive Impairment (MCI) class and achieved the accuracy of 60.6% with 40 seconds of speech data.
Subjects: Audio and Speech Processing (eess.AS); Machine Learning (cs.LG); Sound (cs.SD); Quantitative Methods (q-bio.QM)
DOI: 10.1587/transinf.2020EDP7196
Cite as: arXiv:2004.07992 [eess.AS]
  (or arXiv:2004.07992v3 [eess.AS] for this version)

Submission history

From: Mariana Rodrigues Makiuchi [view email]
[v1] Thu, 16 Apr 2020 23:26:43 GMT (233kb,D)
[v2] Wed, 23 Sep 2020 05:30:57 GMT (0kb,I)
[v3] Tue, 6 Oct 2020 13:00:27 GMT (166kb,D)

Link back to: arXiv, form interface, contact.