We gratefully acknowledge support from
the Simons Foundation and member institutions.
Full-text links:

Download:

Current browse context:

eess.AS

Change to browse by:

References & Citations

Bookmark

(what is this?)
CiteULike logo BibSonomy logo Mendeley logo del.icio.us logo Digg logo Reddit logo

Electrical Engineering and Systems Science > Audio and Speech Processing

Title: Speech Denoising Using Only Single Noisy Audio Samples

Abstract: In this paper, we propose a novel Single Noisy Audio De-noising Framework (SNA-DF) for speech denoising using only single noisy audio samples, which overcomes the limi-tation of constructing either noisy-clean training pairs or multiple independent noisy audio samples. The proposed SNA-DF contains two modules: training audio pairs gener-ated module and audio denoising module. The first module adopts a random audio sub-sampler on single noisy audio samples for the generation of training audio pairs. The sub-sampled training audio pairs are then fed into the audio denoising module, which employs a deep complex U-Net incorporating a complex two-stage transformer (cTSTM) to extract both magnitude and phase information for taking full advantage of the complex features of single noisy au-dios. Experimental results show that the proposed SNA-DF not only eliminates the high dependence on clean targets of traditional audio denoising methods, but also outperforms the methods using multiple noisy audio samples.
Comments: 5 pages, 2 figures
Subjects: Audio and Speech Processing (eess.AS); Sound (cs.SD)
Cite as: arXiv:2111.00242 [eess.AS]
  (or arXiv:2111.00242v2 [eess.AS] for this version)

Submission history

From: Jiasong Wu [view email]
[v1] Sat, 30 Oct 2021 13:00:23 GMT (263kb)
[v2] Tue, 31 May 2022 08:17:37 GMT (263kb)
[v3] Tue, 14 Jun 2022 12:52:00 GMT (600kb)
[v4] Thu, 19 Jan 2023 13:17:18 GMT (952kb)

Link back to: arXiv, form interface, contact.