Multichannel Singing Voice Separation by Deep Neural Network Informed DOA Constrained CMNMF

Antonio Jesús Muñoz-Montoro, Archontis Politis, Konstantinos Drossos, Julio José Carabias-Orti

DOI

SPS

Members: Free
IEEE Members: $11.00
Non-members: $15.00

Length: 11:36

21 Sep 2020

This work addresses the problem of multichannel source separation combining two powerful approaches, multichannel spectral factorization with recent monophonic deep learning (DL) based spectrum inference. Individual source spectra at different channels are estimated with a Masker-Denoiser twin network, able to model long-term temporal patterns of a musical piece. The monophonic source spectrograms are used within a spatial covariance mixing model based on complex-valued multichannel non-negative matrix factorization (CMNMF) that predicts the spatial characteristics of each source. The proposed framework is evaluated on the task of singing voice separation with a large multichannel dataset. Experimental results show that our joint DL+CMNMF method outperforms both the individual monophonic DL-based separation and the multichannel CMNMF baseline methods.

Tags:

sps conference

virtual workshop

mmsp 2020

September 2020

Multichannel Singing Voice Separation by Deep Neural Network Informed DOA Constrained CMNMF

Antonio Jesús Muñoz-Montoro, Archontis Politis, Konstantinos Drossos, Julio José Carabias-Orti

Value-Added Bundle(s) Including this Product

MMSP 2020 Virtual Conference - Presentation Videos Product Bundle

More Like This

IEEE ICASSP 2023, 4-10 June 2023, Greece. Virtual and In-Person Conference - Presentation Videos Product Bundle

IEEE ICASSP 2024, 1 4-19 April 2024, Seoul, Korea. Conference Presentation Videos Bundle

ICIP 2022, October 16-19, 2022, Bordeaux, France - Presentation Videos Product Bundle

Join the IEEE Signal Processing Society