Schr\"odingeRNN: Generative Modeling of Raw Audio as a Continuously Observed Quantum State

Uranga, Beñat Mencia; Lamacraft, Austen

Full-text links:

Download:

Current browse context:

eess

< prev | next >

new | recent | 1911

Computer Science > Sound

Title: SchrödingeRNN: Generative Modeling of Raw Audio as a Continuously Observed Quantum State

Authors: Beñat Mencia Uranga, Austen Lamacraft

(Submitted on 26 Nov 2019)

Abstract: We introduce Schr\"odingeRNN, a quantum inspired generative model for raw audio. Audio data is wave-like and is sampled from a continuous signal. Although generative modelling of raw audio has made great strides lately, relational inductive biases relevant to these two characteristics are mostly absent from models explored to date. Quantum Mechanics is a natural source of probabilistic models of wave behaviour. Our model takes the form of a stochastic Schr\"odinger equation describing the continuous time measurement of a quantum system, and is equivalent to the continuous Matrix Product State (cMPS) representation of wavefunctions in one dimensional many-body systems. This constitutes a deep autoregressive architecture in which the systems state is a latent representation of the past observations. We test our model on synthetic data sets of stationary and non-stationary signals. This is the first time cMPS are used in machine learning.

Comments:	32 pages, 20 figures, under review for MSML 2020
Subjects:	Sound (cs.SD); Statistical Mechanics (cond-mat.stat-mech); Machine Learning (cs.LG); Audio and Speech Processing (eess.AS)
Cite as:	arXiv:1911.11879 [cs.SD]
	(or arXiv:1911.11879v1 [cs.SD] for this version)

Submission history

From: Beñat Mencia Uranga [view email]
[v1] Tue, 26 Nov 2019 23:33:46 GMT (558kb,D)

Which authors of this paper are endorsers? | Disable MathJax (What is MathJax?)

Link back to: arXiv, form interface, contact.

> cs > arXiv:1911.11879

Download:

Current browse context:

Change to browse by:

References & Citations

Bookmark

Computer Science > Sound

Title: SchrödingeRNN: Generative Modeling of Raw Audio as a Continuously Observed Quantum State

Submission history