We gratefully acknowledge support from
the Simons Foundation and member institutions.
Full-text links:

Download:

Current browse context:

eess.AS

Change to browse by:

References & Citations

Bookmark

(what is this?)
CiteULike logo BibSonomy logo Mendeley logo del.icio.us logo Digg logo Reddit logo

Electrical Engineering and Systems Science > Audio and Speech Processing

Title: Learnable Acoustic Frontends in Bird Activity Detection

Abstract: Autonomous recording units and passive acoustic monitoring present minimally intrusive methods of collecting bioacoustics data. Combining this data with species agnostic bird activity detection systems enables the monitoring of activity levels of bird populations. Unfortunately, variability in ambient noise levels and subject distance contribute to difficulties in accurately detecting bird activity in recordings. The choice of acoustic frontend directly affects the impact these issues have on system performance. In this paper, we benchmark traditional fixed-parameter acoustic frontends against the new generation of learnable frontends on a wide-ranging bird audio detection task using data from the DCASE2018 BAD Challenge. We observe that Per-Channel Energy Normalization is the best overall performer, achieving an accuracy of 89.9%, and that in general learnable frontends significantly outperform traditional methods. We also identify challenges in learning filterbanks for bird audio.
Comments: Submitted and presented at IWAENC, September 2022, 5 Pages, 1 Figure, 3 Tables
Subjects: Audio and Speech Processing (eess.AS)
Cite as: arXiv:2210.00889 [eess.AS]
  (or arXiv:2210.00889v1 [eess.AS] for this version)

Submission history

From: Mark Anderson [view email]
[v1] Mon, 3 Oct 2022 12:54:26 GMT (2012kb,D)

Link back to: arXiv, form interface, contact.