Determined Audio Source Separation with Multichannel Star Generative Adversarial Network

Li Li,Hirokazu Kameoka,Shoji Makino

DOI

SPS

Members: Free
IEEE Members: $11.00
Non-members: $15.00

Length: 16:14

21 Sep 2020

This paper proposes a multichannel source separation approach, which uses a star generative adversarial network (StarGAN) to model power spectrograms of sources. Various studies have shown the significant contributions of a precise source model to the performance improvement in audio source separation, which indicates the importance of developing a better source model.In this paper, we explore the potential of StarGAN for modeling source spectrograms and investigate the effectiveness of the StarGAN source model in determined multichannel source separation by incorporating it into a frequency-domain independent component analysis (ICA) framework.The experimental results revealed that the proposed StarGAN-based method outperformed conventional methods, which employ non-negative matrix factorization (NMF) or variational autoencoder (VAE) to the source spectrogram modeling.

Tags:

sps conference

mlsp 2020

virtual workshop

mlsp 2020 workshop

September 2020

Determined Audio Source Separation with Multichannel Star Generative Adversarial Network

Li Li,Hirokazu Kameoka,Shoji Makino

Value-Added Bundle(s) Including this Product

MLSP 2020 Virtual Conference - Presentation Videos Product Bundle

More Like This

IEEE ICASSP 2023, 4-10 June 2023, Greece. Virtual and In-Person Conference - Presentation Videos Product Bundle

IEEE ICASSP 2024, 1 4-19 April 2024, Seoul, Korea. Conference Presentation Videos Bundle

ICIP 2022, October 16-19, 2022, Bordeaux, France - Presentation Videos Product Bundle

Join the IEEE Signal Processing Society