Title: | First Order Ambisonics Domain Spatial Augmentation for DNN-based Direction of Arrival Estimation |
Authors: | Mazzon, Luca Koizumi, Yuma Yasuda, Masahiro Harada, Noboru |
Date Issued: | Oct-2019 |
Citation: | L. Mazzon, Y. Koizumi, M. Yasuda & N. Harada, "First Order Ambisonics Domain Spatial Augmentation for DNN-based Direction of Arrival Estimation", Proceedings of the Detection and Classification of Acoustic Scenes and Events 2019 Workshop (DCASE2019), pages 154–158, New York University, NY, USA, Oct. 2019 |
Abstract: | In this paper, we propose a novel data augmentation method for training neural networks for Direction of Arrival (DOA) estimation. This method focuses on expanding the representation of the DOA subspace of a dataset. Given some input data, it applies a transformation to it in order to change its DOA information and simulate new potentially unseen one. Such transformation, in general, is a combination of a rotation and a reflection. It is possible to apply such transformation due to a well-known property of FirstOrder Ambisonics (FOA). The same transformation is applied also to the labels, in order to maintain consistency between input data and target labels. Three methods with different level of generality are proposed for applying this augmentation principle. Experiments are conducted on two different DOA networks. Results of both experiments demonstrate the effectiveness of the novel augmentation strategy by improving the DOA error by around 40%. |
First Page: | 154 |
Last Page: | 158 |
DOI: | https://doi.org/10.33682/3qgs-e216 |
Type: | Article |
Appears in Collections: | Proceedings of the Detection and Classification of Acoustic Scenes and Events 2019 Workshop (DCASE2019) |
Files in This Item:
File | Size | Format | |
---|---|---|---|
DCASE2019Workshop_Mazzon_79.pdf | 706.87 kB | Adobe PDF | View/Open |
Items in FDA are protected by copyright, with all rights reserved, unless otherwise indicated.