We gratefully acknowledge support from
the Simons Foundation and member institutions.
Full-text links:

Download:

Current browse context:

cs.SD

Change to browse by:

References & Citations

DBLP - CS Bibliography

Bookmark

(what is this?)
CiteULike logo BibSonomy logo Mendeley logo del.icio.us logo Digg logo Reddit logo

Computer Science > Sound

Title: Deep Learning-based automated classification of Chinese Speech Sound Disorders

Abstract: This article describes a system for analyzing acoustic data to assist in the diagnosis and classification of children's speech sound disorders (SSDs) using a computer. The analysis concentrated on identifying and categorizing four distinct types of Chinese SSDs. The study collected and generated a speech corpus containing 2540 stopping, backing, final consonant deletion process (FCDP), and affrication samples from 90 children aged 3--6 years with normal or pathological articulatory features. Each recording was accompanied by a detailed diagnostic annotation by two speech-language pathologists (SLPs). Classification of the speech samples was accomplished using three well-established neural network models for image classification. The feature maps were created using three sets of Mel-frequency cepstral coefficients (MFCC) parameters extracted from speech sounds and aggregated into a three-dimensional data structure as model input. We employed six techniques for data augmentation to augment the available dataset while avoiding overfitting. The experiments examine the usability of four different categories of Chinese phrases and characters. Experiments with different data subsets demonstrate the system's ability to accurately detect the analyzed pronunciation disorders. The best multi-class classification using a single Chinese phrase achieves an accuracy of 74.4~percent.
Comments: Children 2022
Subjects: Sound (cs.SD); Machine Learning (cs.LG); Audio and Speech Processing (eess.AS)
Journal reference: Children 2022, 9, 996
DOI: 10.3390/children9070996
Cite as: arXiv:2205.11748 [cs.SD]
  (or arXiv:2205.11748v4 [cs.SD] for this version)

Submission history

From: Yao Ming Kuo [view email]
[v1] Tue, 24 May 2022 03:23:22 GMT (4294kb,D)
[v2] Thu, 23 Jun 2022 10:23:03 GMT (4157kb,D)
[v3] Sat, 25 Jun 2022 02:03:04 GMT (3995kb,D)
[v4] Wed, 6 Jul 2022 09:24:46 GMT (4084kb,D)

Link back to: arXiv, form interface, contact.