Skip navigation
Title: 

First Order Ambisonics Domain Spatial Augmentation for DNN-based Direction of Arrival Estimation

Authors: Mazzon, Luca
Koizumi, Yuma
Yasuda, Masahiro
Harada, Noboru
Date Issued: Oct-2019
Citation: L. Mazzon, Y. Koizumi, M. Yasuda & N. Harada, "First Order Ambisonics Domain Spatial Augmentation for DNN-based Direction of Arrival Estimation", Proceedings of the Detection and Classification of Acoustic Scenes and Events 2019 Workshop (DCASE2019), pages 154–158, New York University, NY, USA, Oct. 2019
Abstract: In this paper, we propose a novel data augmentation method for training neural networks for Direction of Arrival (DOA) estimation. This method focuses on expanding the representation of the DOA subspace of a dataset. Given some input data, it applies a transformation to it in order to change its DOA information and simulate new potentially unseen one. Such transformation, in general, is a combination of a rotation and a reflection. It is possible to apply such transformation due to a well-known property of FirstOrder Ambisonics (FOA). The same transformation is applied also to the labels, in order to maintain consistency between input data and target labels. Three methods with different level of generality are proposed for applying this augmentation principle. Experiments are conducted on two different DOA networks. Results of both experiments demonstrate the effectiveness of the novel augmentation strategy by improving the DOA error by around 40%.
First Page: 154
Last Page: 158
DOI: https://doi.org/10.33682/3qgs-e216
Type: Article
Appears in Collections:Proceedings of the Detection and Classification of Acoustic Scenes and Events 2019 Workshop (DCASE2019)

Files in This Item:
File SizeFormat 
DCASE2019Workshop_Mazzon_79.pdf706.87 kBAdobe PDFView/Open


Items in FDA are protected by copyright, with all rights reserved, unless otherwise indicated.