And what if two musical versions don't share melody, harmony, rhythm, or lyrics ?

Abrassart, Mathilde; Doras, Guillaume

Full-text links:

Download:

Current browse context:

cs.SD

< prev | next >

new | recent | 2210

Computer Science > Sound

Title: And what if two musical versions don't share melody, harmony, rhythm, or lyrics ?

Authors: Mathilde Abrassart, Guillaume Doras

(Submitted on 3 Oct 2022)

Abstract: Version identification (VI) has seen substantial progress over the past few years. On the one hand, the introduction of the metric learning paradigm has favored the emergence of scalable yet accurate VI systems. On the other hand, using features focusing on specific aspects of musical pieces, such as melody, harmony, or lyrics, yielded interpretable and promising performances. In this work, we build upon these recent advances and propose a metric learning-based system systematically leveraging four dimensions commonly admitted to convey musical similarity between versions: melodic line, harmonic structure, rhythmic patterns, and lyrics. We describe our deliberately simple model architecture, and we show in particular that an approximated representation of the lyrics is an efficient proxy to discriminate between versions and non-versions. We then describe how these features complement each other and yield new state-of-the-art performances on two publicly available datasets. We finally suggest that a VI system using a combination of melodic, harmonic, rhythmic and lyrics features could theoretically reach the optimal performances obtainable on these datasets.

Subjects:	Sound (cs.SD); Machine Learning (cs.LG); Audio and Speech Processing (eess.AS)
Cite as:	arXiv:2210.01256 [cs.SD]
	(or arXiv:2210.01256v1 [cs.SD] for this version)

Submission history

From: Mathilde Abrassart [view email]
[v1] Mon, 3 Oct 2022 22:33:14 GMT (4571kb)

Which authors of this paper are endorsers? | Disable MathJax (What is MathJax?)

Link back to: arXiv, form interface, contact.

> cs > arXiv:2210.01256

Download:

Current browse context:

Change to browse by:

References & Citations

DBLP - CS Bibliography

Bookmark

Computer Science > Sound

Title: And what if two musical versions don't share melody, harmony, rhythm, or lyrics ?

Submission history