Residual Recurrent Neural Network For Speech Enhancement

Jalal Abdulbaqi, Yue Gu, Shuhong Chen, Ivan Marsic

DOI

SPS

Members: Free
IEEE Members: $11.00
Non-members: $15.00

Length: 13:29

04 May 2020

Most current speech enhancement models use spectrogram features that require an expensive transformation and result in phase information loss. Previous work has overcome these issues by using convolutional networks to learn the temporal correlations across high-resolution waveforms. These models, however, are limited by memory-intensive dilated convolution and aliasing artifacts from upsampling. We introduce an end-to-end fully recurrent neural network for single-channel speech enhancement. The network structured as an hourglass-shape that can efficiently capture long-range temporal dependencies by reducing the features resolution without information loss. Also, we use residual connections to prevent gradient decay over layers and improve the model generalization. Experimental results show that our model outperforms state-of-the-art approaches in six quantitative evaluation metrics.

Tags:

sps conference

icassp 2020 virtual conference

May 2020

icassp 2020

Residual Recurrent Neural Network For Speech Enhancement

Jalal Abdulbaqi, Yue Gu, Shuhong Chen, Ivan Marsic

Value-Added Bundle(s) Including this Product

ICASSP 2020 Virtual Conference - Presentation Videos Product Bundle

More Like This

IEEE ICASSP 2023, 4-10 June 2023, Greece. Virtual and In-Person Conference - Presentation Videos Product Bundle

IEEE ICASSP 2024, 1 4-19 April 2024, Seoul, Korea. Conference Presentation Videos Bundle

ICIP 2022, October 16-19, 2022, Bordeaux, France - Presentation Videos Product Bundle

Join the IEEE Signal Processing Society