We gratefully acknowledge support from
the Simons Foundation and member institutions.
Full-text links:

Download:

Current browse context:

eess.AS

Change to browse by:

References & Citations

Bookmark

(what is this?)
CiteULike logo BibSonomy logo Mendeley logo del.icio.us logo Digg logo Reddit logo

Electrical Engineering and Systems Science > Audio and Speech Processing

Title: Contrastive Representation Learning for Acoustic Parameter Estimation

Abstract: A study is presented in which a contrastive learning approach is used to extract low-dimensional representations of the acoustic environment from single-channel, reverberant speech signals. Convolution of room impulse responses (RIRs) with anechoic source signals is leveraged as a data augmentation technique that offers considerable flexibility in the design of the upstream task. We evaluate the embeddings across three different downstream tasks, which include the regression of acoustic parameters reverberation time RT60 and clarity index C50, and the classification into small and large rooms. We demonstrate that the learned representations generalize well to unseen data and perform similarly to a fully-supervised baseline.
Comments: Accepted for ICASSP 2023, Camera-ready version
Subjects: Audio and Speech Processing (eess.AS); Sound (cs.SD)
Cite as: arXiv:2302.11205 [eess.AS]
  (or arXiv:2302.11205v2 [eess.AS] for this version)

Submission history

From: Philipp Götz [view email]
[v1] Wed, 22 Feb 2023 08:37:43 GMT (276kb,D)
[v2] Mon, 13 Mar 2023 07:25:47 GMT (277kb,D)

Link back to: arXiv, form interface, contact.