Upsampling layers for music source separation

Pons, Jordi; Serrà, Joan; Pascual, Santiago; Cengarle, Giulio; Arteaga, Daniel; Scaini, Davide

Full-text links:

Download:

Current browse context:

cs.SD

< prev | next >

new | recent | 2111

Computer Science > Sound

Title: Upsampling layers for music source separation

Authors: Jordi Pons, Joan Serrà, Santiago Pascual, Giulio Cengarle, Daniel Arteaga, Davide Scaini

(Submitted on 23 Nov 2021)

Abstract: Upsampling artifacts are caused by problematic upsampling layers and due to spectral replicas that emerge while upsampling. Also, depending on the used upsampling layer, such artifacts can either be tonal artifacts (additive high-frequency noise) or filtering artifacts (substractive, attenuating some bands). In this work we investigate the practical implications of having upsampling artifacts in the resulting audio, by studying how different artifacts interact and assessing their impact on the models' performance. To that end, we benchmark a large set of upsampling layers for music source separation: different transposed and subpixel convolution setups, different interpolation upsamplers (including two novel layers based on stretch and sinc interpolation), and different wavelet-based upsamplers (including a novel learnable wavelet layer). Our results show that filtering artifacts, associated with interpolation upsamplers, are perceptually preferrable, even if they tend to achieve worse objective scores.

Comments:	Demo page: this http URL
Subjects:	Sound (cs.SD); Artificial Intelligence (cs.AI); Audio and Speech Processing (eess.AS)
Cite as:	arXiv:2111.11773 [cs.SD]
	(or arXiv:2111.11773v1 [cs.SD] for this version)

Submission history

From: Jordi Pons [view email]
[v1] Tue, 23 Nov 2021 10:36:28 GMT (3757kb,D)

Which authors of this paper are endorsers? | Disable MathJax (What is MathJax?)

Link back to: arXiv, form interface, contact.

> cs > arXiv:2111.11773v1

Download:

Current browse context:

Change to browse by:

References & Citations

DBLP - CS Bibliography

Bookmark

Computer Science > Sound

Title: Upsampling layers for music source separation

Submission history