INPLACE CEPSTRAL SPEECH ENHANCEMENT SYSTEM FOR THE ICASSP 2023 CLARITY CHALLENGE
Jinjiang Liu (College of Computer Science, Inner Mongolia University); Xueliang zhang (Inner Mongolia University)
-
SPS
IEEE Members: $11.00
Non-members: $15.00
This report summarizes our system submission to the ICASSP 2023 Clarity Challenge. The goal of the challenge is to estimate clean binaural speech signals within a 5 ms system delay. Our submitted inplace cepstral speech enhancement (ICSE) system featured in following aspect.First, we developed a low-latency short-time Fourier transform (LL-STFT) analysis and synthesis strategy for a neural network-based speech enhancement algorithm in the time-frequency domain.Second, we designed an end-to-end inplace cepstral speech enhancement neural network that achieves good spatial resolution in an inplace speech enhancement framework. We also combine the cepstrum space speech enhancement with the TF-domain speech enhancement in the proposed system.
Finlay, we employ a speech model based perceptual loss to improve speech intelligibility and quality.The experimental results show that the proposed system significantly outperforms the baseline system and ranked among the top five systems.