Redundant Convolutional Network With Attention Mechanism For Monaural Speech Enhancement
Tian Lan, Yilan Lyu, Refuoe Mokhosi, Sen Li, Qiao Liu, Guoqiang Hui
-
SPS
IEEE Members: $11.00
Non-members: $15.00Length: 12:06
The redundant convolutional encoder decoder network has proven useful in speech enhancement tasks. It can capture localized time-frequency details of speech signals through both the fully convolutional network structure and feature selection capability resulting from the encoder-decoder mechanism. However, it does not explicitly consider the signal filtering mechanism, which we consider important for speech enhancement models. In this study, we introduce an attention mechanism into the convolutional encoder-decoder model. This mechanism adaptively filters channel-wise feature responses by explicitly modeling attentions (on speech versus noise signals) between channels. Experimental results show that the proposed attention model is effective in capturing speech signals from background noise, and performs especially better in unseen noise conditions compared to other state-of-the-art models.