Current browse context:
eess.AS
Change to browse by:
References & Citations
Electrical Engineering and Systems Science > Audio and Speech Processing
Title: Self-Supervised Speech Denoising Using Only Noisy Audio Signals
(Submitted on 30 Oct 2021 (v1), last revised 14 Jun 2022 (this version, v3))
Abstract: In traditional speech denoising tasks, clean audio signals are often used as the training target, but absolute clean signals are collected from expensive recording equipment or studios with strict environment. To address this issue, we propose an end-to-end self-supervised speech denoising training scheme using only noisy audio signals named On-ly-Noisy Training (ONT), overcoming the limitation of clean speech collection without any extra training condi-tions. The proposed ONT constructs training pairs only from each single noisy audio, and it contains two modules: training audio pairs generated module and speech de-noising module. The first module adopts a random audio sub-sampler on each noisy audio to generate training pairs. The sub-sampled pairs are then fed into a novel com-plex-valued speech denoising module. Experimental results show that the proposed method not only eliminates the high dependence on clean targets of traditional audio denoising tasks, but also achieves on-par or better performance than other training strategies. Source code was released in this https URL
Submission history
From: Jiasong Wu [view email][v1] Sat, 30 Oct 2021 13:00:23 GMT (263kb)
[v2] Tue, 31 May 2022 08:17:37 GMT (263kb)
[v3] Tue, 14 Jun 2022 12:52:00 GMT (600kb)
Link back to: arXiv, form interface, contact.