VaPar Synth -- A Variational Parametric Model for Audio Synthesis

Subramani, Krishna; Rao, Preeti; D'Hooge, Alexandre

doi:10.1109/ICASSP40776.2020.9054181

Full-text links:

Download:

Current browse context:

cs.LG

< prev | next >

new | recent | 2004

Electrical Engineering and Systems Science > Audio and Speech Processing

Title: VaPar Synth -- A Variational Parametric Model for Audio Synthesis

Authors: Krishna Subramani, Preeti Rao, Alexandre D'Hooge

(Submitted on 30 Mar 2020)

Abstract: With the advent of data-driven statistical modeling and abundant computing power, researchers are turning increasingly to deep learning for audio synthesis. These methods try to model audio signals directly in the time or frequency domain. In the interest of more flexible control over the generated sound, it could be more useful to work with a parametric representation of the signal which corresponds more directly to the musical attributes such as pitch, dynamics and timbre. We present VaPar Synth - a Variational Parametric Synthesizer which utilizes a conditional variational autoencoder (CVAE) trained on a suitable parametric representation. We demonstrate our proposed model's capabilities via the reconstruction and generation of instrumental tones with flexible control over their pitch.

Comments:	this https URL , Accepted in ICASSP 2020
Subjects:	Audio and Speech Processing (eess.AS); Machine Learning (cs.LG); Sound (cs.SD)
DOI:	10.1109/ICASSP40776.2020.9054181
Cite as:	arXiv:2004.00001 [eess.AS]
	(or arXiv:2004.00001v1 [eess.AS] for this version)

Submission history

From: Krishna Subramani [view email]
[v1] Mon, 30 Mar 2020 16:05:47 GMT (172kb,D)

Which authors of this paper are endorsers? | Disable MathJax (What is MathJax?)

Link back to: arXiv, form interface, contact.

> eess > arXiv:2004.00001

Download:

Current browse context:

Change to browse by:

References & Citations

DBLP - CS Bibliography

Bookmark

Electrical Engineering and Systems Science > Audio and Speech Processing

Title: VaPar Synth -- A Variational Parametric Model for Audio Synthesis

Submission history