Full metadata record
DC Field | Value | Language |
---|---|---|
dc.contributor.author | Cordourier, Hector | |
dc.contributor.author | Lopez Meyer, Paulo | |
dc.contributor.author | Huang, Jonathan | |
dc.contributor.author | Del Hoyo Ontiveros, Juan | |
dc.contributor.author | Lu, Hong | |
dc.date.accessioned | 2019-10-24T01:50:14Z | - |
dc.date.available | 2019-10-24T01:50:14Z | - |
dc.date.issued | 2019-10 | |
dc.identifier.citation | H. Cordourier, P. Meyer, J. Huang, J. Ontiveros & H. Lu, "GCC-PHAT Cross-Correlation Audio Features for Simultaneous Sound Event Localization and Detection (SELD) on Multiple Rooms", Proceedings of the Detection and Classification of Acoustic Scenes and Events 2019 Workshop (DCASE2019), pages 55–58, New York University, NY, USA, Oct. 2019 | en |
dc.identifier.uri | http://hdl.handle.net/2451/60727 | - |
dc.description.abstract | In this work, we show a simultaneous sound event localization and detection (SELD) system, with enhanced acoustic features, in which we propose using the well-known Generalized Cross Correlation (GCC) PATH algorithm, to augment the magnitude and phase regular Fourier spectra features at each frame. GCC-PHAT has already been used for some time to calculate the Time Difference of Arrival (TDOA) in simultaneous audio signals, in moderately reverberant environments, using classic signal processing techniques, and can assist audio source localization in current deep learning machines. The neural net architecture we used is a Convolutional Recurrent Neural Network (CRNN), and is tested using the sound database prepared for the Task 3 of the 2019 DCASE Challenge. In the challenge results, our proposed system was able to achieve 20.8° of direction of arrival error, 85.6\% frame recall, 86.5\% F-score and 0.22 error rate detection in evaluation samples. | en |
dc.rights | Copyright The Authors, 2019 | en |
dc.title | GCC-PHAT Cross-Correlation Audio Features for Simultaneous Sound Event Localization and Detection (SELD) on Multiple Rooms | en |
dc.type | Article | en |
dc.identifier.DOI | https://doi.org/10.33682/3re4-nd65 | |
dc.description.firstPage | 55 | |
dc.description.lastPage | 58 | |
Appears in Collections: | Proceedings of the Detection and Classification of Acoustic Scenes and Events 2019 Workshop (DCASE2019) |
Files in This Item:
File | Size | Format | |
---|---|---|---|
DCASE2019Workshop_CordourierMaruri_59.pdf | 1.14 MB | Adobe PDF | View/Open |
Items in FDA are protected by copyright, with all rights reserved, unless otherwise indicated.