Speech Denoising Using Only Single Noisy Audio Samples

Wu, Jiasong; Li, Qingchun; Kong, Youyong; Yang, Guanyu; Senhadji, Lotfi; Shu, Huazhong

Full-text links:

Download:

PDF only

Current browse context:

eess.AS

< prev | next >

new | recent | 2111

Electrical Engineering and Systems Science > Audio and Speech Processing

Title: Speech Denoising Using Only Single Noisy Audio Samples

Authors: Jiasong Wu, Qingchun Li, Youyong Kong, Guanyu Yang, Lotfi Senhadji, Huazhong Shu

(Submitted on 30 Oct 2021 (v1), revised 31 May 2022 (this version, v2), latest version 19 Jan 2023 (v4))

Abstract: In this paper, we propose a novel Single Noisy Audio De-noising Framework (SNA-DF) for speech denoising using only single noisy audio samples, which overcomes the limi-tation of constructing either noisy-clean training pairs or multiple independent noisy audio samples. The proposed SNA-DF contains two modules: training audio pairs gener-ated module and audio denoising module. The first module adopts a random audio sub-sampler on single noisy audio samples for the generation of training audio pairs. The sub-sampled training audio pairs are then fed into the audio denoising module, which employs a deep complex U-Net incorporating a complex two-stage transformer (cTSTM) to extract both magnitude and phase information for taking full advantage of the complex features of single noisy au-dios. Experimental results show that the proposed SNA-DF not only eliminates the high dependence on clean targets of traditional audio denoising methods, but also outperforms the methods using multiple noisy audio samples.

Comments:	5 pages, 2 figures
Subjects:	Audio and Speech Processing (eess.AS); Sound (cs.SD)
Cite as:	arXiv:2111.00242 [eess.AS]
	(or arXiv:2111.00242v2 [eess.AS] for this version)

Submission history

From: Jiasong Wu [view email]
[v1] Sat, 30 Oct 2021 13:00:23 GMT (263kb)
[v2] Tue, 31 May 2022 08:17:37 GMT (263kb)
[v3] Tue, 14 Jun 2022 12:52:00 GMT (600kb)
[v4] Thu, 19 Jan 2023 13:17:18 GMT (952kb)

Which authors of this paper are endorsers? | Disable MathJax (What is MathJax?)

Link back to: arXiv, form interface, contact.

> eess > arXiv:2111.00242v2

Download:

Current browse context:

Change to browse by:

References & Citations

Bookmark

Electrical Engineering and Systems Science > Audio and Speech Processing

Title: Speech Denoising Using Only Single Noisy Audio Samples

Submission history