Skip navigation
Title: 

Sound Event Detection and Direction of Arrival Estimation using Residual Net and Recurrent Neural Networks

Authors: Ranjan, Rishabh
Jayabalan, Sathish
Nguyen, Thi Ngoc Tho
Gan, Woon Seng
Date Issued: Oct-2019
Citation: R. Ranjan, S. Jayabalan, T. Nguyen & W. Gan, "Sound Event Detection and Direction of Arrival Estimation using Residual Net and Recurrent Neural Networks", Proceedings of the Detection and Classification of Acoustic Scenes and Events 2019 Workshop (DCASE2019), pages 214–218, New York University, NY, USA, Oct. 2019
Abstract: This paper presents deep learning approach for sound events detection and localization, which is also a part of detection and classification of acoustic scenes and events (DCASE) challenge 2019 Task 3. Deep residual nets originally used for image classification are adapted and combined with recurrent neural networks (RNN) to estimate the onset-offset of sound events, sound events class, and their direction in a reverberant environment. Additionally, data augmentation and post processing techniques are applied to generalize and improve the system performance on unseen data. Using our best model on validation dataset, sound events detection achieves F1-score of 0.89 and error rate of 0.18, whereas sound source localization task achieves angular error of 8° and 90% frame recall.
First Page: 214
Last Page: 218
DOI: https://doi.org/10.33682/93dp-f064
Type: Article
Appears in Collections:Proceedings of the Detection and Classification of Acoustic Scenes and Events 2019 Workshop (DCASE2019)

Files in This Item:
File SizeFormat 
DCASE2019Workshop_Ranjan_40.pdf671.76 kBAdobe PDFView/Open


Items in FDA are protected by copyright, with all rights reserved, unless otherwise indicated.