Current browse context:
cs
Change to browse by:
References & Citations
Computer Science > Sound
Title: Stabilizing Label Assignment for Speech Separation by Self-supervised Pre-training
(Submitted on 29 Oct 2020 (v1), last revised 22 Aug 2021 (this version, v3))
Abstract: Speech separation has been well developed, with the very successful permutation invariant training (PIT) approach, although the frequent label assignment switching happening during PIT training remains to be a problem when better convergence speed and achievable performance are desired. In this paper, we propose to perform self-supervised pre-training to stabilize the label assignment in training the speech separation model. Experiments over several types of self-supervised approaches, several typical speech separation models and two different datasets showed that very good improvements are achievable if a proper self-supervised approach is chosen.
Submission history
From: Sung-Feng Huang [view email][v1] Thu, 29 Oct 2020 06:07:01 GMT (1105kb,D)
[v2] Tue, 8 Jun 2021 15:31:15 GMT (721kb,D)
[v3] Sun, 22 Aug 2021 06:26:39 GMT (721kb,D)
Link back to: arXiv, form interface, contact.