We gratefully acknowledge support from
the Simons Foundation and member institutions.
Full-text links:

Download:

Current browse context:

eess.AS

Change to browse by:

References & Citations

Bookmark

(what is this?)
CiteULike logo BibSonomy logo Mendeley logo del.icio.us logo Digg logo Reddit logo

Electrical Engineering and Systems Science > Audio and Speech Processing

Title: Automatic Organisation, Segmentation, and Filtering of User-Generated Audio Content

Abstract: Using solely the information retrieved by audio fingerprinting techniques, we propose methods to treat a possibly large dataset of user-generated audio content, that (1) enable the grouping of several audio files that contain a common audio excerpt (i.e., are relative to the same event), and (2) give information about how those files are correlated in terms of time and quality inside each event. Furthermore, we use supervised learning to detect incorrect matches that may arise from the audio fingerprinting algorithm itself, whilst ensuring our model learns with previous predictions. All the presented methods were further validated by user-generated recordings of several different concerts manually crawled from YouTube.
Comments: MMSP 2017 - IEEE 19th International Workshop on Multimedia Signal Processing
Subjects: Audio and Speech Processing (eess.AS); Information Retrieval (cs.IR); Multimedia (cs.MM); Sound (cs.SD)
Cite as: arXiv:1708.05302 [eess.AS]
  (or arXiv:1708.05302v1 [eess.AS] for this version)

Submission history

From: Gonçalo Mordido [view email]
[v1] Thu, 17 Aug 2017 14:19:17 GMT (216kb,D)

Link back to: arXiv, form interface, contact.