Current browse context:
eess.AS
Change to browse by:
References & Citations
Electrical Engineering and Systems Science > Audio and Speech Processing
Title: Semantic Communications for Speech Signals
(Submitted on 9 Dec 2020 (v1), last revised 7 Sep 2021 (this version, v2))
Abstract: We consider a semantic communication system for speech signals, named DeepSC-S. Motivated by the breakthroughs in deep learning (DL), we make an effort to recover the transmitted speech signals in the semantic communication systems, which minimizes the error at the semantic level rather than the bit level or symbol level as in the traditional communication systems. Particularly, based on an attention mechanism employing squeeze-and-excitation (SE) networks, we design the transceiver as an end-to-end (E2E) system, which learns and extracts the essential speech information. Furthermore, in order to facilitate the proposed DeepSC-S to work well on dynamic practical communication scenarios, we find a model yielding good performance when coping with various channel environments without retraining process. The simulation results demonstrate that our proposed DeepSC-S is more robust to channel variations and outperforms the traditional communication systems, especially in the low signal-to-noise (SNR) regime.
Submission history
From: Zhenzi Weng [view email][v1] Wed, 9 Dec 2020 23:43:59 GMT (1116kb)
[v2] Tue, 7 Sep 2021 20:18:51 GMT (1002kb)
Link back to: arXiv, form interface, contact.