Title: | Sound Event Detection and Direction of Arrival Estimation using Residual Net and Recurrent Neural Networks |
Authors: | Ranjan, Rishabh Jayabalan, Sathish Nguyen, Thi Ngoc Tho Gan, Woon Seng |
Date Issued: | Oct-2019 |
Citation: | R. Ranjan, S. Jayabalan, T. Nguyen & W. Gan, "Sound Event Detection and Direction of Arrival Estimation using Residual Net and Recurrent Neural Networks", Proceedings of the Detection and Classification of Acoustic Scenes and Events 2019 Workshop (DCASE2019), pages 214–218, New York University, NY, USA, Oct. 2019 |
Abstract: | This paper presents deep learning approach for sound events detection and localization, which is also a part of detection and classification of acoustic scenes and events (DCASE) challenge 2019 Task 3. Deep residual nets originally used for image classification are adapted and combined with recurrent neural networks (RNN) to estimate the onset-offset of sound events, sound events class, and their direction in a reverberant environment. Additionally, data augmentation and post processing techniques are applied to generalize and improve the system performance on unseen data. Using our best model on validation dataset, sound events detection achieves F1-score of 0.89 and error rate of 0.18, whereas sound source localization task achieves angular error of 8° and 90% frame recall. |
First Page: | 214 |
Last Page: | 218 |
DOI: | https://doi.org/10.33682/93dp-f064 |
Type: | Article |
Appears in Collections: | Proceedings of the Detection and Classification of Acoustic Scenes and Events 2019 Workshop (DCASE2019) |
Files in This Item:
File | Size | Format | |
---|---|---|---|
DCASE2019Workshop_Ranjan_40.pdf | 671.76 kB | Adobe PDF | View/Open |
Items in FDA are protected by copyright, with all rights reserved, unless otherwise indicated.