Autonomous In-Situ Soundscape Augmentation via Joint Selection of Masker and Gain

Watcharasupat, Karn N.; Ooi, Kenneth; Lam, Bhan; Wong, Trevor; Ong, Zhen-Ting; Gan, Woon-Seng

doi:10.1109/LSP.2022.3194419

Full-text links:

Download:

Current browse context:

eess.AS

< prev | next >

new | recent | 2204

Electrical Engineering and Systems Science > Audio and Speech Processing

Title: Autonomous In-Situ Soundscape Augmentation via Joint Selection of Masker and Gain

Authors: Karn N. Watcharasupat, Kenneth Ooi, Bhan Lam, Trevor Wong, Zhen-Ting Ong, Woon-Seng Gan

(Submitted on 29 Apr 2022 (v1), last revised 23 Jul 2022 (this version, v2))

Abstract: The selection of maskers and playback gain levels in a soundscape augmentation system is crucial to its effectiveness in improving the overall acoustic comfort of a given environment. Traditionally, the selection of appropriate maskers and gain levels has been informed by expert opinion, which may not representative of the target population, or by listening tests, which can be time-consuming and labour-intensive. Furthermore, the resulting static choices of masker and gain are often inflexible to the dynamic nature of real-world soundscapes. In this work, we utilized a deep learning model to perform joint selection of the optimal masker and its gain level for a given soundscape. The proposed model was designed with highly modular building blocks, allowing for an optimized inference process that can quickly search through a large number of masker and gain combinations. In addition, we introduced the use of feature-domain soundscape augmentation conditioned on the digital gain level, eliminating the computationally expensive waveform-domain mixing process during inference time, as well as the tedious pre-calibration process required for new maskers. The proposed system was validated on a large-scale dataset of subjective responses to augmented soundscapes with more than 440 participants, ensuring the ability of the model to predict combined effect of the masker and its gain level on the perceptual pleasantness level.

Comments:	Accepted to IEEE Signal Processing Letters. (c) 2022 IEEE
Subjects:	Audio and Speech Processing (eess.AS); Machine Learning (cs.LG); Sound (cs.SD)
Journal reference:	IEEE Signal Processing Letters, Vol. 29, pp. 1749 - 1753, 2022
DOI:	10.1109/LSP.2022.3194419
Cite as:	arXiv:2204.13883 [eess.AS]
	(or arXiv:2204.13883v2 [eess.AS] for this version)

Submission history

From: Karn N Watcharasupat [view email]
[v1] Fri, 29 Apr 2022 04:59:56 GMT (2246kb,D)
[v2] Sat, 23 Jul 2022 13:45:19 GMT (2263kb,D)

Which authors of this paper are endorsers? | Disable MathJax (What is MathJax?)

Link back to: arXiv, form interface, contact.

> eess > arXiv:2204.13883

Download:

Current browse context:

Change to browse by:

References & Citations

Bookmark

Electrical Engineering and Systems Science > Audio and Speech Processing

Title: Autonomous In-Situ Soundscape Augmentation via Joint Selection of Masker and Gain

Submission history