We gratefully acknowledge support from
the Simons Foundation and member institutions.
Full-text links:

Download:

Current browse context:

eess

Change to browse by:

References & Citations

Bookmark

(what is this?)
CiteULike logo BibSonomy logo Mendeley logo del.icio.us logo Digg logo Reddit logo

Computer Science > Sound

Title: SchrödingeRNN: Generative Modeling of Raw Audio as a Continuously Observed Quantum State

Abstract: We introduce Schr\"odingeRNN, a quantum inspired generative model for raw audio. Audio data is wave-like and is sampled from a continuous signal. Although generative modelling of raw audio has made great strides lately, relational inductive biases relevant to these two characteristics are mostly absent from models explored to date. Quantum Mechanics is a natural source of probabilistic models of wave behaviour. Our model takes the form of a stochastic Schr\"odinger equation describing the continuous time measurement of a quantum system, and is equivalent to the continuous Matrix Product State (cMPS) representation of wavefunctions in one dimensional many-body systems. This constitutes a deep autoregressive architecture in which the systems state is a latent representation of the past observations. We test our model on synthetic data sets of stationary and non-stationary signals. This is the first time cMPS are used in machine learning.
Comments: 32 pages, 20 figures, under review for MSML 2020
Subjects: Sound (cs.SD); Statistical Mechanics (cond-mat.stat-mech); Machine Learning (cs.LG); Audio and Speech Processing (eess.AS)
Cite as: arXiv:1911.11879 [cs.SD]
  (or arXiv:1911.11879v1 [cs.SD] for this version)

Submission history

From: Beñat Mencia Uranga [view email]
[v1] Tue, 26 Nov 2019 23:33:46 GMT (558kb,D)

Link back to: arXiv, form interface, contact.