On Neural Phone Recognition of Mixed-Source ECoG Signals

Abdelaziz, Ahmed Hussen; Chang, Shuo-Yiin; Morgan, Nelson; Edwards, Erik; Kolossa, Dorothea; Ellis, Dan; Moses, David A.; Chang, Edward F.

Full-text links:

Download:

Current browse context:

eess.AS

< prev | next >

new | recent | 1912

Electrical Engineering and Systems Science > Audio and Speech Processing

Title: On Neural Phone Recognition of Mixed-Source ECoG Signals

Authors: Ahmed Hussen Abdelaziz, Shuo-Yiin Chang, Nelson Morgan, Erik Edwards, Dorothea Kolossa, Dan Ellis, David A. Moses, Edward F. Chang

(Submitted on 12 Dec 2019)

Abstract: The emerging field of neural speech recognition (NSR) using electrocorticography has recently attracted remarkable research interest for studying how human brains recognize speech in quiet and noisy surroundings. In this study, we demonstrate the utility of NSR systems to objectively prove the ability of human beings to attend to a single speech source while suppressing the interfering signals in a simulated cocktail party scenario. The experimental results show that the relative degradation of the NSR system performance when tested in a mixed-source scenario is significantly lower than that of automatic speech recognition (ASR). In this paper, we have significantly enhanced the performance of our recently published framework by using manual alignments for initialization instead of the flat start technique. We have also improved the NSR system performance by accounting for the possible transcription mismatch between the acoustic and neural signals.

Comments:	5 pages, showing algorithms, results and references from our collaboration during a 2017 postdoc stay of the first author
Subjects:	Audio and Speech Processing (eess.AS); Neural and Evolutionary Computing (cs.NE); Sound (cs.SD); Neurons and Cognition (q-bio.NC)
Cite as:	arXiv:1912.05869 [eess.AS]
	(or arXiv:1912.05869v1 [eess.AS] for this version)

Submission history

From: Dorothea Kolossa [view email]
[v1] Thu, 12 Dec 2019 10:37:22 GMT (5764kb,D)

Which authors of this paper are endorsers? | Disable MathJax (What is MathJax?)

Link back to: arXiv, form interface, contact.

> eess > arXiv:1912.05869

Download:

Current browse context:

Change to browse by:

References & Citations

Bookmark

Electrical Engineering and Systems Science > Audio and Speech Processing

Title: On Neural Phone Recognition of Mixed-Source ECoG Signals

Submission history