We gratefully acknowledge support from
the Simons Foundation and member institutions.
Full-text links:

Download:

Current browse context:

stat

Change to browse by:

References & Citations

Bookmark

(what is this?)
CiteULike logo BibSonomy logo Mendeley logo del.icio.us logo Digg logo Reddit logo ScienceWISE logo

Electrical Engineering and Systems Science > Audio and Speech Processing

Title: Learning Transferable Features for Speech Emotion Recognition

Abstract: Emotion recognition from speech is one of the key steps towards emotional intelligence in advanced human-machine interaction. Identifying emotions in human speech requires learning features that are robust and discriminative across diverse domains that differ in terms of language, spontaneity of speech, recording conditions, and types of emotions. This corresponds to a learning scenario in which the joint distributions of features and labels may change substantially across domains. In this paper, we propose a deep architecture that jointly exploits a convolutional network for extracting domain-shared features and a long short-term memory network for classifying emotions using domain-specific features. We use transferable features to enable model adaptation from multiple source domains, given the sparseness of speech emotion data and the fact that target domains are short of labeled data. A comprehensive cross-corpora experiment with diverse speech emotion domains reveals that transferable features provide gains ranging from 4.3% to 18.4% in speech emotion recognition. We evaluate several domain adaptation approaches, and we perform an ablation study to understand which source domains add the most to the overall recognition effectiveness for a given target domain.
Comments: ACM-MM'17, October 23-27, 2017
Subjects: Audio and Speech Processing (eess.AS); Machine Learning (cs.LG); Sound (cs.SD); Machine Learning (stat.ML)
MSC classes: I.2.6
ACM classes: I.2.6
Journal reference: Proceedings of the on Thematic Workshops of ACM Multimedia 2017. ACM, 2017. Pages 529-536
DOI: 10.1145/3126686.3126735
Cite as: arXiv:1912.11547 [eess.AS]
  (or arXiv:1912.11547v1 [eess.AS] for this version)

Submission history

From: Alison Marczewski [view email]
[v1] Mon, 23 Dec 2019 18:06:08 GMT (305kb,D)

Link back to: arXiv, form interface, contact.