Semantic Communications for Speech Signals

Weng, Zhenzi; Qin, Zhijin; Li, Geoffrey Ye

Full-text links:

Download:

Current browse context:

eess.AS

< prev | next >

new | recent | 2012

Electrical Engineering and Systems Science > Audio and Speech Processing

Title: Semantic Communications for Speech Signals

Authors: Zhenzi Weng, Zhijin Qin, Geoffrey Ye Li

(Submitted on 9 Dec 2020 (v1), last revised 7 Sep 2021 (this version, v2))

Abstract: We consider a semantic communication system for speech signals, named DeepSC-S. Motivated by the breakthroughs in deep learning (DL), we make an effort to recover the transmitted speech signals in the semantic communication systems, which minimizes the error at the semantic level rather than the bit level or symbol level as in the traditional communication systems. Particularly, based on an attention mechanism employing squeeze-and-excitation (SE) networks, we design the transceiver as an end-to-end (E2E) system, which learns and extracts the essential speech information. Furthermore, in order to facilitate the proposed DeepSC-S to work well on dynamic practical communication scenarios, we find a model yielding good performance when coping with various channel environments without retraining process. The simulation results demonstrate that our proposed DeepSC-S is more robust to channel variations and outperforms the traditional communication systems, especially in the low signal-to-noise (SNR) regime.

Comments:	6 pages. arXiv admin note: text overlap with arXiv:2107.11190
Subjects:	Audio and Speech Processing (eess.AS); Sound (cs.SD); Signal Processing (eess.SP)
Cite as:	arXiv:2012.05369 [eess.AS]
	(or arXiv:2012.05369v2 [eess.AS] for this version)

Submission history

From: Zhenzi Weng [view email]
[v1] Wed, 9 Dec 2020 23:43:59 GMT (1116kb)
[v2] Tue, 7 Sep 2021 20:18:51 GMT (1002kb)

Which authors of this paper are endorsers? | Disable MathJax (What is MathJax?)

Link back to: arXiv, form interface, contact.

> eess > arXiv:2012.05369

Download:

Current browse context:

Change to browse by:

References & Citations

Bookmark

Electrical Engineering and Systems Science > Audio and Speech Processing

Title: Semantic Communications for Speech Signals

Submission history