We gratefully acknowledge support from
the Simons Foundation and member institutions.
Full-text links:

Download:

Current browse context:

cs.SD

Change to browse by:

References & Citations

DBLP - CS Bibliography

Bookmark

(what is this?)
CiteULike logo BibSonomy logo Mendeley logo del.icio.us logo Digg logo Reddit logo

Computer Science > Sound

Title: Audio Event Detection using Weakly Labeled Data

Abstract: Acoustic event detection is essential for content analysis and description of multimedia recordings. The majority of current literature on the topic learns the detectors through fully-supervised techniques employing strongly labeled data. However, the labels available for online multimedia data are generally weak and do not provide sufficient detail for such methods to be employed. In this paper we propose a framework for learning acoustic event detectors using only weakly labeled data based on a Multiple Instance Learning (MIL) framework. We first show that audio event detection using weak data can be formulated as an MIL problem. We then suggest two frameworks for solving multiple-instance learning, one based on neural networks, and the second on support vector machines. The proposed methods can help in removing the time consuming and expensive process of manually annotating data to facilitate fully supervised learning. Our proposed framework can not only successfully detect events in a recording but can also provide temporal locations of events in the recording. This is interesting as these information were never known in the first place for weakly labeled data.
Comments: updated version on arXiv
Subjects: Sound (cs.SD); Artificial Intelligence (cs.AI); Multimedia (cs.MM)
Cite as: arXiv:1605.02401 [cs.SD]
  (or arXiv:1605.02401v2 [cs.SD] for this version)

Submission history

From: Anurag Kumar [view email]
[v1] Mon, 9 May 2016 02:17:12 GMT (255kb)
[v2] Thu, 9 Jun 2016 03:33:13 GMT (255kb)
[v3] Wed, 6 Jul 2016 05:46:56 GMT (256kb)

Link back to: arXiv, form interface, contact.