SingSong: Generating musical accompaniments from singing

Donahue, Chris; Caillon, Antoine; Roberts, Adam; Manilow, Ethan; Esling, Philippe; Agostinelli, Andrea; Verzetti, Mauro; Simon, Ian; Pietquin, Olivier; Zeghidour, Neil; Engel, Jesse

Full-text links:

Download:

Current browse context:

cs.AI

< prev | next >

new | recent | 2301

Computer Science > Sound

Title: SingSong: Generating musical accompaniments from singing

Authors: Chris Donahue, Antoine Caillon, Adam Roberts, Ethan Manilow, Philippe Esling, Andrea Agostinelli, Mauro Verzetti, Ian Simon, Olivier Pietquin, Neil Zeghidour, Jesse Engel

(Submitted on 30 Jan 2023)

Abstract: We present SingSong, a system that generates instrumental music to accompany input vocals, potentially offering musicians and non-musicians alike an intuitive new way to create music featuring their own voice. To accomplish this, we build on recent developments in musical source separation and audio generation. Specifically, we apply a state-of-the-art source separation algorithm to a large corpus of music audio to produce aligned pairs of vocals and instrumental sources. Then, we adapt AudioLM (Borsos et al., 2022) -- a state-of-the-art approach for unconditional audio generation -- to be suitable for conditional "audio-to-audio" generation tasks, and train it on the source-separated (vocal, instrumental) pairs. In a pairwise comparison with the same vocal inputs, listeners expressed a significant preference for instrumentals generated by SingSong compared to those from a strong retrieval baseline.
Sound examples at this https URL

Subjects:	Sound (cs.SD); Artificial Intelligence (cs.AI); Machine Learning (cs.LG); Multimedia (cs.MM); Audio and Speech Processing (eess.AS)
Cite as:	arXiv:2301.12662 [cs.SD]
	(or arXiv:2301.12662v1 [cs.SD] for this version)

Submission history

From: Chris Donahue [view email]
[v1] Mon, 30 Jan 2023 04:53:23 GMT (286kb,D)

Which authors of this paper are endorsers? | Disable MathJax (What is MathJax?)

Link back to: arXiv, form interface, contact.

> cs > arXiv:2301.12662

Download:

Current browse context:

Change to browse by:

References & Citations

DBLP - CS Bibliography

Bookmark

Computer Science > Sound

Title: SingSong: Generating musical accompaniments from singing

Submission history