Skip navigation
Title: 

Long-distance Detection of Bioacoustic Events with Per-channel Energy Normalization

Authors: Lostanlen, Vincent
Palmer, Kaitlin
Knight, Elly
Clark, Christopher
Klinck, Holger
Farnsworth, Andrew
Wong, Tina
Cramer, Aurora
Bello, Juan
Date Issued: Oct-2019
Citation: V. Lostanlen, K. Palmer, E. Knight, C. Clark, H. Klinck, A. Farnsworth, T. Wong, A. Cramer & J. Bello, "Long-distance Detection of Bioacoustic Events with Per-channel Energy Normalization", Proceedings of the Detection and Classification of Acoustic Scenes and Events 2019 Workshop (DCASE2019), pages 144–148, New York University, NY, USA, Oct. 2019
Abstract: This paper proposes to perform unsupervised detection of bioacoustic events by pooling the magnitudes of spectrogram frames after per-channel energy normalization (PCEN). Although PCEN was originally developed for speech recognition, it also has beneficial effects in enhancing animal vocalizations, despite the presence of atmospheric absorption and intermittent noise. We prove that PCEN generalizes logarithm-based spectral flux, yet with a tunable time scale for background noise estimation. In comparison with pointwise logarithm, PCEN reduces false alarm rate by 50x in the near field and 5x in the far field, both on avian and marine bioacoustic datasets. Such improvements come at moderate computational cost and require no human intervention, thus heralding a promising future for PCEN in bioacoustics.
First Page: 144
Last Page: 148
DOI: https://doi.org/10.33682/ts6e-sn53
Type: Article
Appears in Collections:Proceedings of the Detection and Classification of Acoustic Scenes and Events 2019 Workshop (DCASE2019)

Files in This Item:
File SizeFormat 
DCASE2019Workshop_Lostanlen_60.pdf3.58 MBAdobe PDFView/Open


Items in FDA are protected by copyright, with all rights reserved, unless otherwise indicated.