We gratefully acknowledge support from
the Simons Foundation and member institutions.
Full-text links:

Download:

Current browse context:

stat.ML

Change to browse by:

References & Citations

Bookmark

(what is this?)
CiteULike logo BibSonomy logo Mendeley logo del.icio.us logo Digg logo Reddit logo

Statistics > Machine Learning

Title: Learning Features of Music from Scratch

Abstract: This paper introduces a new large-scale music dataset, MusicNet, to serve as a source of supervision and evaluation of machine learning methods for music research. MusicNet consists of hundreds of freely-licensed classical music recordings by 10 composers, written for 11 instruments, together with instrument/note annotations resulting in over 1 million temporal labels on 34 hours of chamber music performances under various studio and microphone conditions.
The paper defines a multi-label classification task to predict notes in musical recordings, along with an evaluation protocol, and benchmarks several machine learning architectures for this task: i) learning from spectrogram features; ii) end-to-end learning with a neural net; iii) end-to-end learning with a convolutional neural net. These experiments show that end-to-end models trained for note prediction learn frequency selective filters as a low-level representation of audio.
Comments: 14 pages; camera-ready version; updated experiments and related works; additional MIR metrics (Appendix C)
Subjects: Machine Learning (stat.ML); Machine Learning (cs.LG); Sound (cs.SD)
Cite as: arXiv:1611.09827 [stat.ML]
  (or arXiv:1611.09827v2 [stat.ML] for this version)

Submission history

From: John Thickstun [view email]
[v1] Tue, 29 Nov 2016 20:26:00 GMT (2373kb,D)
[v2] Thu, 6 Apr 2017 01:13:41 GMT (2373kb,D)

Link back to: arXiv, form interface, contact.