J-Net: Randomly weighted U-Net for audio source separation

Chen, Bo-Wen; Hsu, Yen-Min; Lee, Hung-Yi

Full-text links:

Download:

Current browse context:

cs.SD

< prev | next >

new | recent | 1911

Computer Science > Sound

Title: J-Net: Randomly weighted U-Net for audio source separation

Authors: Bo-Wen Chen, Yen-Min Hsu, Hung-Yi Lee

(Submitted on 29 Nov 2019)

Abstract: Several results in the computer vision literature have shown the potential of randomly weighted neural networks. While they perform fairly well as feature extractors for discriminative tasks, a positive correlation exists between their performance and their fully trained counterparts. According to these discoveries, we pose two questions: what is the value of randomly weighted networks in difficult generative audio tasks such as audio source separation and does such positive correlation still exist when it comes to large random networks and their trained counterparts? In this paper, we demonstrate that the positive correlation still exists. Based on this discovery, we can try out different architecture designs or tricks without training the whole model. Meanwhile, we find a surprising result that in comparison to the non-trained encoder (down-sample path) in Wave-U-Net, fixing the decoder (up-sample path) to random weights results in better performance, almost comparable to the fully trained model.

Subjects:	Sound (cs.SD); Machine Learning (cs.LG); Audio and Speech Processing (eess.AS)
Cite as:	arXiv:1911.12926 [cs.SD]
	(or arXiv:1911.12926v1 [cs.SD] for this version)

Submission history

From: Yen-Min Hsu [view email]
[v1] Fri, 29 Nov 2019 02:24:05 GMT (432kb,D)

Which authors of this paper are endorsers? | Disable MathJax (What is MathJax?)

Link back to: arXiv, form interface, contact.

> cs > arXiv:1911.12926

Download:

Current browse context:

Change to browse by:

References & Citations

DBLP - CS Bibliography

Bookmark

Computer Science > Sound

Title: J-Net: Randomly weighted U-Net for audio source separation

Submission history