We gratefully acknowledge support from
the Simons Foundation and member institutions.
Full-text links:

Download:

Current browse context:

eess.AS

Change to browse by:

References & Citations

Bookmark

(what is this?)
CiteULike logo BibSonomy logo Mendeley logo del.icio.us logo Digg logo Reddit logo

Electrical Engineering and Systems Science > Audio and Speech Processing

Title: Learning to Denoise Historical Music

Abstract: We propose an audio-to-audio neural network model that learns to denoise old music recordings. Our model internally converts its input into a time-frequency representation by means of a short-time Fourier transform (STFT), and processes the resulting complex spectrogram using a convolutional neural network. The network is trained with both reconstruction and adversarial objectives on a synthetic noisy music dataset, which is created by mixing clean music with real noise samples extracted from quiet segments of old recordings. We evaluate our method quantitatively on held-out test examples of the synthetic dataset, and qualitatively by human rating on samples of actual historical recordings. Our results show that the proposed method is effective in removing noise, while preserving the quality and details of the original music.
Comments: ISMIR 2020
Subjects: Audio and Speech Processing (eess.AS); Machine Learning (cs.LG)
Cite as: arXiv:2008.02027 [eess.AS]
  (or arXiv:2008.02027v2 [eess.AS] for this version)

Submission history

From: Yunpeng Li [view email]
[v1] Wed, 5 Aug 2020 10:05:44 GMT (202kb,D)
[v2] Thu, 16 Jun 2022 11:18:28 GMT (207kb,D)

Link back to: arXiv, form interface, contact.