Full metadata record
DC Field | Value | Language |
---|---|---|
dc.contributor.author | He, Kexin | |
dc.contributor.author | Shen, Yuhan | |
dc.contributor.author | Zhang, Wei-Qiang | |
dc.date.accessioned | 2019-10-24T01:50:16Z | - |
dc.date.available | 2019-10-24T01:50:16Z | - |
dc.date.issued | 2019-10 | |
dc.identifier.citation | K. He, Y. Shen & W. Zhang, "Multiple Neural Networks with Ensemble Method for Audio Tagging with Noisy Labels and Minimal Supervision", Proceedings of the Detection and Classification of Acoustic Scenes and Events 2019 Workshop (DCASE2019), pages 89–93, New York University, NY, USA, Oct. 2019 | en |
dc.identifier.uri | http://hdl.handle.net/2451/60734 | - |
dc.description.abstract | In this paper, we describe our system for the Task 2 of Detection and Classification of Acoustic Scenes and Events (DCASE) 2019 Challenge: Audio tagging with noisy labels and minimal supervision. This task provides a small amount of verified data (curated data) and a larger quantity of unverified data (noisy data) as training data. Each audio clip contains one or more sound events, so it can be considered as a multi-label audio classification task. To tackle this problem, we mainly use four strategies. The first is a sigmoid-softmax activation to deal with so-called sparse multi-label classification. The second is a staged training strategy to learn from noisy data. The third is a post-processing method that normalizes output scores for each sound class. The last is an ensemble method that averages models learned with multiple neural networks and various acoustic features. All of the above strategies contribute to our system significantly. Our final system achieved labelweighted label-ranking average precision (lwlrap) scores of 0.758 on the private test dataset and 0.742 on the public test dataset, winning the 2nd place in DCASE 2019 Challenge Task 2. | en |
dc.rights | Copyright The Authors, 2019 | en |
dc.title | Multiple Neural Networks with Ensemble Method for Audio Tagging with Noisy Labels and Minimal Supervision | en |
dc.type | Article | en |
dc.identifier.DOI | https://doi.org/10.33682/r7nr-v396 | |
dc.description.firstPage | 89 | |
dc.description.lastPage | 93 | |
Appears in Collections: | Proceedings of the Detection and Classification of Acoustic Scenes and Events 2019 Workshop (DCASE2019) |
Files in This Item:
File | Size | Format | |
---|---|---|---|
DCASE2019Workshop_He_26.pdf | 712.42 kB | Adobe PDF | View/Open |
Items in FDA are protected by copyright, with all rights reserved, unless otherwise indicated.