Full metadata record
DC Field | Value | Language |
---|---|---|
dc.contributor.author | Cao, Yin | |
dc.contributor.author | Kong, Qiuqiang | |
dc.contributor.author | Iqbal, Turab | |
dc.contributor.author | An, Fengyan | |
dc.contributor.author | Wang, Wenwu | |
dc.contributor.author | Plumbley, Mark | |
dc.date.accessioned | 2019-10-24T01:50:25Z | - |
dc.date.available | 2019-10-24T01:50:25Z | - |
dc.date.issued | 2019-10 | |
dc.identifier.citation | Y. Cao, Q. Kong, T. Iqbal, F. An, W. Wang & M. Plumbley, "Polyphonic Sound Event Detection and Localization using a Two-Stage Strategy", Proceedings of the Detection and Classification of Acoustic Scenes and Events 2019 Workshop (DCASE2019), pages 30–34, New York University, NY, USA, Oct. 2019 | en |
dc.identifier.uri | http://hdl.handle.net/2451/60775 | - |
dc.description.abstract | Sound event detection (SED) and localization refer to recognizing sound events and estimating their spatial and temporal locations. Using neural networks has become the prevailing method for SED. In the area of sound localization, which is usually performed by estimating the direction of arrival (DOA), learning-based methods have recently been developed. In this paper, it is experimentally shown that the trained SED model is able to contribute to the direction of arrival estimation (DOAE). However, joint training of SED and DOAE degrades the performance of both. Based on these results, a two-stage polyphonic sound event detection and localization method is proposed. The method learns SED first, after which the learned feature layers are transferred for DOAE. It then uses the SED ground truth as a mask to train DOAE. The proposed method is evaluated on the DCASE 2019 Task 3 dataset, which contains different overlapping sound events in different environments. Experimental results show that the proposed method is able to improve the performance of both SED and DOAE, and also performs significantly better than the baseline method. | en |
dc.rights | Distributed under the terms of the Creative Commons Attribution 4.0 International (CC-BY) license. | en |
dc.title | Polyphonic Sound Event Detection and Localization using a Two-Stage Strategy | en |
dc.type | Article | en |
dc.identifier.DOI | https://doi.org/10.33682/4jhy-bj81 | |
dc.description.firstPage | 30 | |
dc.description.lastPage | 34 | |
Appears in Collections: | Proceedings of the Detection and Classification of Acoustic Scenes and Events 2019 Workshop (DCASE2019) |
Files in This Item:
File | Size | Format | |
---|---|---|---|
DCASE2019Workshop_Cao_34.pdf | 1.38 MB | Adobe PDF | View/Open |
Items in FDA are protected by copyright, with all rights reserved, unless otherwise indicated.