Acoustic Scene Classification from Binaural Signals using Convolutional Neural Networks

Mars, Rohith; Pratik, Pranay; Nagisetty, Srikanth; Lim, Chongsoon

doi:https://doi.org/10.33682/6c9z-gd15

Title:	Acoustic Scene Classification from Binaural Signals using Convolutional Neural Networks
Authors:	Mars, Rohith Pratik, Pranay Nagisetty, Srikanth Lim, Chongsoon
Date Issued:	Oct-2019
Citation:	R. Mars, P. Pratik, S. Nagisetty & C. Lim, "Acoustic Scene Classification from Binaural Signals using Convolutional Neural Networks", Proceedings of the Detection and Classification of Acoustic Scenes and Events 2019 Workshop (DCASE2019), pages 149–153, New York University, NY, USA, Oct. 2019
Abstract:	In this paper, we present the details of our proposed framework and solution for the DCASE 2019 Task 1A - Acoustic Scene Classification challenge. We describe the audio pre-processing, feature extraction steps and the time-frequency (TF) representations employed for acoustic scene classification using binaural recordings. We propose two distinct and light-weight architectures of convolutional neural networks (CNNs) for processing the extracted audio features and classification. The performance of both these architectures are compared in terms of classification accuracy as well as model complexity. Using an ensemble of the predictions from the subset of models based on the above CNNs, we achieved an average classification accuracy of 79.35% on the test split of the development dataset for this task. In the Kaggle’s private leaderboard, oursolution was ranked 4th with a system score of 83.16% — an improvement of ≈ 20% over the baseline system.
First Page:	149
Last Page:	153
DOI:	https://doi.org/10.33682/6c9z-gd15
Type:	Article
Appears in Collections:	Proceedings of the Detection and Classification of Acoustic Scenes and Events 2019 Workshop (DCASE2019)

Files in This Item:

File	Size	Format
DCASE2019Workshop_Mars_73.pdf	440.17 kB	Adobe PDF	View/Open

Show full item record