Title: | Acoustic Scene Classification from Binaural Signals using Convolutional Neural Networks |
Authors: | Mars, Rohith Pratik, Pranay Nagisetty, Srikanth Lim, Chongsoon |
Date Issued: | Oct-2019 |
Citation: | R. Mars, P. Pratik, S. Nagisetty & C. Lim, "Acoustic Scene Classification from Binaural Signals using Convolutional Neural Networks", Proceedings of the Detection and Classification of Acoustic Scenes and Events 2019 Workshop (DCASE2019), pages 149–153, New York University, NY, USA, Oct. 2019 |
Abstract: | In this paper, we present the details of our proposed framework and solution for the DCASE 2019 Task 1A - Acoustic Scene Classification challenge. We describe the audio pre-processing, feature extraction steps and the time-frequency (TF) representations employed for acoustic scene classification using binaural recordings. We propose two distinct and light-weight architectures of convolutional neural networks (CNNs) for processing the extracted audio features and classification. The performance of both these architectures are compared in terms of classification accuracy as well as model complexity. Using an ensemble of the predictions from the subset of models based on the above CNNs, we achieved an average classification accuracy of 79.35% on the test split of the development dataset for this task. In the Kaggle’s private leaderboard, oursolution was ranked 4th with a system score of 83.16% — an improvement of ≈ 20% over the baseline system. |
First Page: | 149 |
Last Page: | 153 |
DOI: | https://doi.org/10.33682/6c9z-gd15 |
Type: | Article |
Appears in Collections: | Proceedings of the Detection and Classification of Acoustic Scenes and Events 2019 Workshop (DCASE2019) |
Files in This Item:
File | Size | Format | |
---|---|---|---|
DCASE2019Workshop_Mars_73.pdf | 440.17 kB | Adobe PDF | View/Open |
Items in FDA are protected by copyright, with all rights reserved, unless otherwise indicated.