vocadito: A dataset of solo vocals with $f_0$, note, and lyric annotations

Bittner, Rachel M.; Pasalo, Katherine; Bosch, Juan José; Meseguer-Brocal, Gabriel; Rubinstein, David

Full-text links:

Download:

Current browse context:

cs.SD

< prev | next >

new | recent | 2110

Computer Science > Sound

Title: vocadito: A dataset of solo vocals with $f_0$, note, and lyric annotations

Authors: Rachel M. Bittner, Katherine Pasalo, Juan José Bosch, Gabriel Meseguer-Brocal, David Rubinstein

(Submitted on 11 Oct 2021 (v1), last revised 29 Oct 2021 (this version, v2))

Abstract: To compliment the existing set of datasets, we present a small dataset entitled vocadito, consisting of 40 short excerpts of monophonic singing, sung in 7 different languages by singers with varying of levels of training, and recorded on a variety of devices. We provide several types of annotations, including $f_0$, lyrics, and two different note annotations. All annotations were created by musicians. We provide an analysis of the differences between the two note annotations, and see that the agreement level is low, which has implications for evaluating vocal note estimation algorithms. We also analyze the relation between the $f_0$ and note annotations, and show that quantizing $f_0$ values in frequency does not provide a reasonable note estimate, reinforcing the difficulty of the note estimation task for singing voice. Finally, we provide baseline results from recent algorithms on vocadito for note and $f_0$ transcription. Vocadito is made freely available for public use.

Subjects:	Sound (cs.SD); Audio and Speech Processing (eess.AS)
Cite as:	arXiv:2110.05580 [cs.SD]
	(or arXiv:2110.05580v2 [cs.SD] for this version)

Submission history

From: Gabriel Meseguer-Brocal [view email]
[v1] Mon, 11 Oct 2021 19:42:55 GMT (709kb,D)
[v2] Fri, 29 Oct 2021 14:27:37 GMT (710kb,D)

Which authors of this paper are endorsers? | Disable MathJax (What is MathJax?)

Link back to: arXiv, form interface, contact.

> cs > arXiv:2110.05580

Download:

Current browse context:

Change to browse by:

References & Citations

DBLP - CS Bibliography

Bookmark

Computer Science > Sound

Title: vocadito: A dataset of solo vocals with $f_0$, note, and lyric annotations

Submission history