Skip navigation
Full metadata record
DC FieldValueLanguage
dc.contributor.authorPishdadian, Fatemeh
dc.contributor.authorSeetharaman, Prem
dc.contributor.authorKim, Bongjun
dc.contributor.authorPardo, Bryan
dc.date.accessioned2019-10-24T01:50:22Z-
dc.date.available2019-10-24T01:50:22Z-
dc.date.issued2019-10
dc.identifier.citationF. Pishdadian, P. Seetharaman, B. Kim & B. Pardo, "Classifying Non-speech Vocals: Deep vs Signal Processing Representations", Proceedings of the Detection and Classification of Acoustic Scenes and Events 2019 Workshop (DCASE2019), pages 194–198, New York University, NY, USA, Oct. 2019en
dc.identifier.urihttp://hdl.handle.net/2451/60758-
dc.description.abstractDeep-learning-based audio processing algorithms have become very popular over the past decade. Due to promising results reported for deep-learning-based methods on many tasks, some now argue that signal processing audio representations (e.g. magnitude spectrograms) should be entirely discarded, in favor of learning representations from data using deep networks. In this paper, we compare the effectiveness of representations output by state-of-the-art deep nets trained for a task-specific problem, to off-the-shelf signal processing encoding. We address two tasks: query by vocal imitation and singing technique classification. For query by vocal imitation, experimental results showed deep representations were dominated by signal-processing representations. For singing technique classification, neither approach was clearly dominant. These results indicate it would be premature to abandon traditional signal processing in favor of exclusively using deep networks.en
dc.rightsCopyright The Authors, 2019en
dc.titleClassifying Non-speech Vocals: Deep vs Signal Processing Representationsen
dc.typeArticleen
dc.identifier.DOIhttps://doi.org/10.33682/bczf-wv12
dc.description.firstPage194
dc.description.lastPage198
Appears in Collections:Proceedings of the Detection and Classification of Acoustic Scenes and Events 2019 Workshop (DCASE2019)

Files in This Item:
File SizeFormat 
DCASE2019Workshop_Pishdadian_51.pdf573.25 kBAdobe PDFView/Open


Items in FDA are protected by copyright, with all rights reserved, unless otherwise indicated.