Skip navigation
Full metadata record
DC FieldValueLanguage
dc.contributor.authorCordourier, Hector
dc.contributor.authorLopez Meyer, Paulo
dc.contributor.authorHuang, Jonathan
dc.contributor.authorDel Hoyo Ontiveros, Juan
dc.contributor.authorLu, Hong
dc.date.accessioned2019-10-24T01:50:14Z-
dc.date.available2019-10-24T01:50:14Z-
dc.date.issued2019-10
dc.identifier.citationH. Cordourier, P. Meyer, J. Huang, J. Ontiveros & H. Lu, "GCC-PHAT Cross-Correlation Audio Features for Simultaneous Sound Event Localization and Detection (SELD) on Multiple Rooms", Proceedings of the Detection and Classification of Acoustic Scenes and Events 2019 Workshop (DCASE2019), pages 55–58, New York University, NY, USA, Oct. 2019en
dc.identifier.urihttp://hdl.handle.net/2451/60727-
dc.description.abstractIn this work, we show a simultaneous sound event localization and detection (SELD) system, with enhanced acoustic features, in which we propose using the well-known Generalized Cross Correlation (GCC) PATH algorithm, to augment the magnitude and phase regular Fourier spectra features at each frame. GCC-PHAT has already been used for some time to calculate the Time Difference of Arrival (TDOA) in simultaneous audio signals, in moderately reverberant environments, using classic signal processing techniques, and can assist audio source localization in current deep learning machines. The neural net architecture we used is a Convolutional Recurrent Neural Network (CRNN), and is tested using the sound database prepared for the Task 3 of the 2019 DCASE Challenge. In the challenge results, our proposed system was able to achieve 20.8° of direction of arrival error, 85.6\% frame recall, 86.5\% F-score and 0.22 error rate detection in evaluation samples.en
dc.rightsCopyright The Authors, 2019en
dc.titleGCC-PHAT Cross-Correlation Audio Features for Simultaneous Sound Event Localization and Detection (SELD) on Multiple Roomsen
dc.typeArticleen
dc.identifier.DOIhttps://doi.org/10.33682/3re4-nd65
dc.description.firstPage55
dc.description.lastPage58
Appears in Collections:Proceedings of the Detection and Classification of Acoustic Scenes and Events 2019 Workshop (DCASE2019)

Files in This Item:
File SizeFormat 
DCASE2019Workshop_CordourierMaruri_59.pdf1.14 MBAdobe PDFView/Open


Items in FDA are protected by copyright, with all rights reserved, unless otherwise indicated.