Determined Audio Source Separation with Multichannel Star Generative Adversarial Network
Li Li,Hirokazu Kameoka,Shoji Makino
-
SPS
IEEE Members: $11.00
Non-members: $15.00Length: 16:14
This paper proposes a multichannel source separation approach, which uses a star generative adversarial network (StarGAN) to model power spectrograms of sources. Various studies have shown the significant contributions of a precise source model to the performance improvement in audio source separation, which indicates the importance of developing a better source model.In this paper, we explore the potential of StarGAN for modeling source spectrograms and investigate the effectiveness of the StarGAN source model in determined multichannel source separation by incorporating it into a frequency-domain independent component analysis (ICA) framework.The experimental results revealed that the proposed StarGAN-based method outperformed conventional methods, which employ non-negative matrix factorization (NMF) or variational autoencoder (VAE) to the source spectrogram modeling.