Title: | Distilling the Knowledge of Specialist Deep Neural Networks in Acoustic Scene Classification |
Authors: | Jung, Jee-weon Heo, HeeSoo Shim, Hye-jin Yu, Ha-Jin |
Date Issued: | Oct-2019 |
Citation: | J. Jung, H. Heo, H. Shim & H. Yu, "Distilling the Knowledge of Specialist Deep Neural Networks in Acoustic Scene Classification", Proceedings of the Detection and Classification of Acoustic Scenes and Events 2019 Workshop (DCASE2019), pages 114–118, New York University, NY, USA, Oct. 2019 |
Abstract: | Different acoustic scenes that share common properties are one of the main obstacles that hinder successful acoustic scene classification. Top two most confusing pairs of acoustic scenes, ‘airport- shopping mall’ and ‘metro-tram’ have occupied more than half of the total misclassified audio segments, demonstrating the need for consideration of these pairs. In this study, we exploited two specialist models in addition to a baseline model and applied the knowledge distillation framework from those three models into a single deep neural network. A specialist model refers to a model that concentrates on discriminating a pair of two similar scenes. We hypothesized that knowledge distillation from multiple specialist models and a pre-trained baseline model into a single model could gather the superiority of each specialist model and achieve similar effect to an ensemble of these models. In the results of the Detection and Classification of Acoustic Scenes and Events 2019 challenge, the distilled single model showed a classification accuracy of 81.2 %, equivalent to the performance of an ensemble of the baseline and two specialist models. |
First Page: | 114 |
Last Page: | 118 |
DOI: | https://doi.org/10.33682/gqpj-ac63 |
Type: | Article |
Appears in Collections: | Proceedings of the Detection and Classification of Acoustic Scenes and Events 2019 Workshop (DCASE2019) |
Files in This Item:
File | Size | Format | |
---|---|---|---|
DCASE2019Workshop_Jung_38.pdf | 1.01 MB | Adobe PDF | View/Open |
Items in FDA are protected by copyright, with all rights reserved, unless otherwise indicated.