Enhancing End-To-End Multi-Channel Speech Separation Via Spatial Feature Learning

Rongzhi Gu, Yuexian Zou, Shi-Xiong Zhang, Yong Xu, Dong Yu, Lianwu Chen, Meng Yu, Dan Su

DOI

SPS

Members: Free
IEEE Members: $11.00
Non-members: $15.00

Length: 13:03

04 May 2020

Hand-crafted spatial features (e.g., inter-channel phase difference, IPD) play a fundamental role in recent deep learning based multi-channel speech separation (MCSS) methods. However, these manually designed spatial features are hard to incorporate into the end-to-end optimized MCSS framework. In this work, we propose an integrated architecture for learning spatial features directly from the multi-channel speech waveforms within an end-to-end speech separation framework. In this architecture, time-domain filters spanning signal channels are trained to perform adaptive spatial filtering. These filters are implemented by a 2d convolution (conv2d) layer and their parameters are optimized using a speech separation objective function in a purely data-driven fashion. Furthermore, inspired by the IPD formulation, we design a conv2d kernel to compute the inter-channel convolution differences (ICDs), which are expected to provide the spatial cues that help to distinguish the directional sources. Evaluation results on simulated multi-channel reverberant WSJ0 2-mix dataset demonstrate that our proposed ICD based MCSS model improves the overall signal-to-distortion ratio by 10.4% over the IPD based MCSS model.

Tags:

sps conference

icassp 2020 virtual conference

May 2020

icassp 2020

Enhancing End-To-End Multi-Channel Speech Separation Via Spatial Feature Learning

Rongzhi Gu, Yuexian Zou, Shi-Xiong Zhang, Yong Xu, Dong Yu, Lianwu Chen, Meng Yu, Dan Su

Value-Added Bundle(s) Including this Product

ICASSP 2020 Virtual Conference - Presentation Videos Product Bundle

More Like This

IEEE ICASSP 2023, 4-10 June 2023, Greece. Virtual and In-Person Conference - Presentation Videos Product Bundle

IEEE ICASSP 2024, 1 4-19 April 2024, Seoul, Korea. Conference Presentation Videos Bundle

ICIP 2022, October 16-19, 2022, Bordeaux, France - Presentation Videos Product Bundle

Join the IEEE Signal Processing Society