We gratefully acknowledge support from
the Simons Foundation and member institutions.
Full-text links:

Download:

Current browse context:

eess.AS

Change to browse by:

References & Citations

Bookmark

(what is this?)
CiteULike logo BibSonomy logo Mendeley logo del.icio.us logo Digg logo Reddit logo

Electrical Engineering and Systems Science > Audio and Speech Processing

Title: The Scattering Transform Network with Generalized Morse Wavelets and Its Application to Music Genre Classification

Abstract: We propose to use the Generalized Morse Wavelets (GMWs) instead of commonly-used Morlet (or Gabor) wavelets in the Scattering Transform Network (STN), which we call the GMW-STN, for signal classification problems. The GMWs form a parameterized family of truly analytic wavelets while the Morlet wavelets are only approximately analytic. The analyticity of underlying wavelet filters in the STN is particularly important for nonstationary oscillatory signals such as music signals because it improves interpretability of the STN representations by providing multiscale amplitude and phase (and consequently frequency) information of input signals. We demonstrate the superiority of the GMW-STN over the conventional STN in music genre classification using the so-called GTZAN database. Moreover, we show the performance improvement of the GMW-STN by increasing its number of layers to three over the typical two-layer STN.}
Subjects: Audio and Speech Processing (eess.AS); Machine Learning (cs.LG)
MSC classes: 68T10, 94A12, 65T60
ACM classes: I.5.4
Cite as: arXiv:2206.07857 [eess.AS]
  (or arXiv:2206.07857v1 [eess.AS] for this version)

Submission history

From: Naoki Saito [view email]
[v1] Thu, 16 Jun 2022 00:30:09 GMT (2827kb,D)

Link back to: arXiv, form interface, contact.