TWO-STAGE NEURAL NETWORK FOR ICASSP 2023 SPEECH SIGNAL IMPROVEMENT CHALLENGE
Mingshuai Liu (NWPU); Shubo Lv (Shaanxi Provincial Key Laboratory of Speech and Image Information Processing, School of Computer Science, Northwestern Polytechnical University); Zihan Zhang (Northwestern Polytechnical University); Runduo Han (Northwestern Polytechnical University); Xiang Hao (NWPU); Xianjun Xia (ByteDance); Li Chen (ByteDance ); Yijian Xiao (ByteDance); Lei Xie (NWPU)
-
SPS
IEEE Members: $11.00
Non-members: $15.00
In ICASSP 2023 speech signal improvement challenge, we developed a dual-stage neural model which improves speech signal quality induced by different distortions in a stage-wise divide-and-conquer fashion. Specifically, in the first stage, the speech improvement network focuses on recovering the missing components of the spectrum, while in the second stage, our model aims to further suppress noise, reverberation, and artifacts introduced by the first-stage model. Achieving 0.446 in the final score and 0.517 in the P.835 score, our system ranks 4th in the non-real-time track.