QUATERNION NEURAL NETWORKS FOR 3D SOUND SOURCE LOCALIZATION IN REVERBERANT ENVIRONMENTS

Michela Ricciardi Celsi,Simone Scardapane,Danilo Comminiello

DOI

SPS

Members: Free
IEEE Members: $11.00
Non-members: $15.00

Length: 15:25

21 Sep 2020

Localization of sound sources in 3D sound fields is an extremely challenging task, especially when the environments are reverberant and involve multiple sources. In this work, we propose a deep neural network to analyze audio signals recorded by 3D microphones and localize sound sources in a spatial sound field. In particular, we consider first-order Ambisonics microphones to capture 3D acoustic signals and represent them by spherical harmonic decomposition in the quaternion domain. Moreover, to improve the localization performance, we use quaternion input features derived from the acoustic intensity, which is strictly related to the direction of arrival (DOA) of a sound source. The proposed network architecture involves both quaternion-valued convolutional and recurrent layers. Results show that the proposed method is able to exploit both the quaternion-valued representation of ambisonic signals and to improve the localization performance with respect to existing methods.

Tags:

sps conference

mlsp 2020

virtual workshop

mlsp 2020 workshop

September 2020

QUATERNION NEURAL NETWORKS FOR 3D SOUND SOURCE LOCALIZATION IN REVERBERANT ENVIRONMENTS

Michela Ricciardi Celsi,Simone Scardapane,Danilo Comminiello

Value-Added Bundle(s) Including this Product

MLSP 2020 Virtual Conference - Presentation Videos Product Bundle

More Like This

IEEE ICASSP 2023, 4-10 June 2023, Greece. Virtual and In-Person Conference - Presentation Videos Product Bundle

IEEE ICASSP 2024, 1 4-19 April 2024, Seoul, Korea. Conference Presentation Videos Bundle

ICIP 2022, October 16-19, 2022, Bordeaux, France - Presentation Videos Product Bundle

Join the IEEE Signal Processing Society