Audio Tagging with Noisy Labels and Minimal Supervision

Fonseca, Eduardo; Plakal, Manoj; Font, Frederic; Ellis, Daniel P.W.; Serra, Xavier

doi:https://doi.org/10.33682/w13e-5v06

Title:	Audio Tagging with Noisy Labels and Minimal Supervision
Authors:	Fonseca, Eduardo Plakal, Manoj Font, Frederic Ellis, Daniel P.W. Serra, Xavier
Date Issued:	Oct-2019
Citation:	E. Fonseca, M. Plakal, F. Font, D. Ellis & X. Serra, "Audio Tagging with Noisy Labels and Minimal Supervision", Proceedings of the Detection and Classification of Acoustic Scenes and Events 2019 Workshop (DCASE2019), pages 69–73, New York University, NY, USA, Oct. 2019
Abstract:	This paper introduces Task 2 of the DCASE2019 Challenge, titled "Audio tagging with noisy labels and minimal supervision". This task was hosted on the Kaggle platform as "Freesound Audio Tagging 2019". The task evaluates systems for multi-label audio tagging using a large set of noisy-labeled data, and a much smaller set of manually-labeled data, under a large vocabulary setting of 80 everyday sound classes. In addition, the proposed dataset poses an acoustic mismatch problem between the noisy train set and the test set due to the fact that they come from different web audio sources. This can correspond to a realistic scenario given by the difficulty of gathering large amounts of manually labeled data. We present the task setup, the FSDKaggle2019 dataset prepared for this scientific evaluation, and a baseline system consisting of a convolutional neural network. All these resources are freely available.
First Page:	69
Last Page:	73
DOI:	https://doi.org/10.33682/w13e-5v06
Type:	Article
Appears in Collections:	Proceedings of the Detection and Classification of Acoustic Scenes and Events 2019 Workshop (DCASE2019)

Files in This Item:

File	Size	Format
DCASE2019Workshop_Fonseca_33.pdf	579.52 kB	Adobe PDF	View/Open

Show full item record