We gratefully acknowledge support from
the Simons Foundation and member institutions.
Full-text links:

Download:

Current browse context:

eess.AS

Change to browse by:

References & Citations

Bookmark

(what is this?)
CiteULike logo BibSonomy logo Mendeley logo del.icio.us logo Digg logo Reddit logo

Electrical Engineering and Systems Science > Audio and Speech Processing

Title: Autonomous In-Situ Soundscape Augmentation via Joint Selection of Masker and Gain

Abstract: The selection of maskers and playback gain levels in a soundscape augmentation system is crucial to its effectiveness in improving the overall acoustic comfort of a given environment. Traditionally, the selection of appropriate maskers and gain levels has been informed by expert opinion, which may not representative of the target population, or by listening tests, which can be time-consuming and labour-intensive. Furthermore, the resulting static choices of masker and gain are often inflexible to the dynamic nature of real-world soundscapes. In this work, we utilized a deep learning model to perform joint selection of the optimal masker and its gain level for a given soundscape. The proposed model was designed with highly modular building blocks, allowing for an optimized inference process that can quickly search through a large number of masker and gain combinations. In addition, we introduced the use of feature-domain soundscape augmentation conditioned on the digital gain level, eliminating the computationally expensive waveform-domain mixing process during inference time, as well as the tedious pre-calibration process required for new maskers. The proposed system was validated on a large-scale dataset of subjective responses to augmented soundscapes with more than 440 participants, ensuring the ability of the model to predict combined effect of the masker and its gain level on the perceptual pleasantness level.
Comments: Accepted to IEEE Signal Processing Letters. (c) 2022 IEEE
Subjects: Audio and Speech Processing (eess.AS); Machine Learning (cs.LG); Sound (cs.SD)
Journal reference: IEEE Signal Processing Letters, Vol. 29, pp. 1749 - 1753, 2022
DOI: 10.1109/LSP.2022.3194419
Cite as: arXiv:2204.13883 [eess.AS]
  (or arXiv:2204.13883v2 [eess.AS] for this version)

Submission history

From: Karn N Watcharasupat [view email]
[v1] Fri, 29 Apr 2022 04:59:56 GMT (2246kb,D)
[v2] Sat, 23 Jul 2022 13:45:19 GMT (2263kb,D)

Link back to: arXiv, form interface, contact.