We gratefully acknowledge support from
the Simons Foundation and member institutions.
Full-text links:

Download:

Current browse context:

q-bio.QM

Change to browse by:

References & Citations

Bookmark

(what is this?)
CiteULike logo BibSonomy logo Mendeley logo del.icio.us logo Digg logo Reddit logo

Quantitative Biology > Quantitative Methods

Title: Learning Deep Models from Synthetic Data for Extracting Dolphin Whistle Contours

Abstract: We present a learning-based method for extracting whistles of toothed whales (Odontoceti) in hydrophone recordings. Our method represents audio signals as time-frequency spectrograms and decomposes each spectrogram into a set of time-frequency patches. A deep neural network learns archetypical patterns (e.g., crossings, frequency modulated sweeps) from the spectrogram patches and predicts time-frequency peaks that are associated with whistles. We also developed a comprehensive method to synthesize training samples from background environments and train the network with minimal human annotation effort. We applied the proposed learn-from-synthesis method to a subset of the public Detection, Classification, Localization, and Density Estimation (DCLDE) 2011 workshop data to extract whistle confidence maps, which we then processed with an existing contour extractor to produce whistle annotations. The F1-score of our best synthesis method was 0.158 greater than our baseline whistle extraction algorithm (~25% improvement) when applied to common dolphin (Delphinus spp.) and bottlenose dolphin (Tursiops truncatus) whistles.
Comments: Invited paper for International Joint Conference on Neural Networks
Subjects: Quantitative Methods (q-bio.QM); Sound (cs.SD); Audio and Speech Processing (eess.AS)
Journal reference: in Intl. Joint Conf. Neural Net. (Glasgow, Scotland, July 19-24), pp. 10 (2020)
Report number: IJCNN paper 6435539
Cite as: arXiv:2005.08894 [q-bio.QM]
  (or arXiv:2005.08894v1 [q-bio.QM] for this version)

Submission history

From: Marie Roch [view email]
[v1] Mon, 18 May 2020 17:09:34 GMT (3158kb)

Link back to: arXiv, form interface, contact.