We gratefully acknowledge support from
the Simons Foundation and member institutions.
Full-text links:

Download:

Current browse context:

eess.AS

Change to browse by:

References & Citations

Bookmark

(what is this?)
CiteULike logo BibSonomy logo Mendeley logo del.icio.us logo Digg logo Reddit logo ScienceWISE logo

Electrical Engineering and Systems Science > Audio and Speech Processing

Title: ProsoBeast Prosody Annotation Tool

Abstract: The labelling of speech corpora is a laborious and time-consuming process. The ProsoBeast Annotation Tool seeks to ease and accelerate this process by providing an interactive 2D representation of the prosodic landscape of the data, in which contours are distributed based on their similarity. This interactive map allows the user to inspect and label the utterances. The tool integrates several state-of-the-art methods for dimensionality reduction and feature embedding, including variational autoencoders. The user can use these to find a good representation for their data. In addition, as most of these methods are stochastic, each can be used to generate an unlimited number of different prosodic maps. The web app then allows the user to seamlessly switch between these alternative representations in the annotation process. Experiments with a sample prosodically rich dataset have shown that the tool manages to find good representations of varied data and is helpful both for annotation and label correction. The tool is released as free software for use by the community.
Comments: Accepted at Interspeech 2021
Subjects: Audio and Speech Processing (eess.AS); Machine Learning (cs.LG)
Cite as: arXiv:2104.02397 [eess.AS]
  (or arXiv:2104.02397v2 [eess.AS] for this version)

Submission history

From: Branislav Gerazov [view email]
[v1] Tue, 6 Apr 2021 10:04:48 GMT (557kb,D)
[v2] Tue, 15 Jun 2021 07:40:36 GMT (553kb,D)

Link back to: arXiv, form interface, contact.