Current browse context:
cs.SD
Change to browse by:
References & Citations
Computer Science > Sound
Title: Synthesizing Speech from Intracranial Depth Electrodes using an Encoder-Decoder Framework
(Submitted on 2 Nov 2021 (this version), latest version 5 Jul 2022 (v2))
Abstract: Speech Neuroprostheses have the potential to enable communication for people with dysarthria or anarthria. Recent advances have demonstrated high-quality text decoding and speech synthesis from electrocorticographic grids placed on the cortical surface. Here, we investigate a less invasive measurement modality, namely stereotactic EEG (sEEG) that provides sparse sampling from multiple brain regions, including subcortical regions. To evaluate whether sEEG can also be used to synthesize high-quality audio from neural recordings, we employ a recurrent encoder-decoder framework based on modern deep learning methods. We demonstrate that high-quality speech can be reconstructed from these minimally invasive recordings, despite a limited amount of training data. Finally, we utilize variational feature dropout to successfully identify the most informative electrode contacts.
Submission history
From: Jonas Kohler [view email][v1] Tue, 2 Nov 2021 09:43:21 GMT (13973kb,D)
[v2] Tue, 5 Jul 2022 11:18:11 GMT (14482kb,D)
Link back to: arXiv, form interface, contact.