Single Channel Voice Separation For Unknown Number Of Speakers Under Reverberant And Noisy Settings

Shlomo E. Chazan, Lior Wolf, Eliya Nachmani, Yossi Adi

DOI

SPS

Members: Free
IEEE Members: $11.00
Non-members: $15.00

Length: 00:09:30

10 Jun 2021

We present a unified network for voice separation of an unknown number of speakers. The proposed approach is composed of several separation heads optimized together with a speaker classification branch. The separation is carried out in the time domain, together with parameter sharing between all separation heads. The classification branch estimates the number of speakers while each head is specialized in separating a different number of speakers. We evaluate the proposed model under both clean and noisy reverberant settings. Results suggest that the proposed approach is superior to the baseline model by a significant margin. Additionally, we present a new noisy and reverberant dataset of up to five different speakers speaking simultaneously.

Chairs:

Raviv Raich

Tags:

signal processing society

IEEE icassp 2021

virtual conference

2021

sps

virtual conference icassp 2021

june 6-11 2021

icassp 2021

Single Channel Voice Separation For Unknown Number Of Speakers Under Reverberant And Noisy Settings

Shlomo E. Chazan, Lior Wolf, Eliya Nachmani, Yossi Adi

Value-Added Bundle(s) Including this Product

ICASSP 2021 Virtual Conference - Presentation Videos Product Bundle

More Like This

From Supervised To Unsupervised Harmonization Of Diffusion MRI Acquisitions

Scheme And Dataset For Evaluating Computer-Aided Polyp Detection System In Colonoscopy

From U-Net To Transformers: Navigating Through Key Advances In Medical Image Segmentation: Part 1

Join the IEEE Signal Processing Society