Current browse context:
eess.AS
Change to browse by:
References & Citations
Electrical Engineering and Systems Science > Audio and Speech Processing
Title: Contrastive Representation Learning for Acoustic Parameter Estimation
(Submitted on 22 Feb 2023 (v1), last revised 13 Mar 2023 (this version, v2))
Abstract: A study is presented in which a contrastive learning approach is used to extract low-dimensional representations of the acoustic environment from single-channel, reverberant speech signals. Convolution of room impulse responses (RIRs) with anechoic source signals is leveraged as a data augmentation technique that offers considerable flexibility in the design of the upstream task. We evaluate the embeddings across three different downstream tasks, which include the regression of acoustic parameters reverberation time RT60 and clarity index C50, and the classification into small and large rooms. We demonstrate that the learned representations generalize well to unseen data and perform similarly to a fully-supervised baseline.
Submission history
From: Philipp Götz [view email][v1] Wed, 22 Feb 2023 08:37:43 GMT (276kb,D)
[v2] Mon, 13 Mar 2023 07:25:47 GMT (277kb,D)
Link back to: arXiv, form interface, contact.