SPEECH DENOISING IN THE WAVEFORM DOMAIN WITH SELF-ATTENTION

Zhifeng Kong, Wei Ping, Ambrish Dantrey, Bryan Catanzaro

DOI

SPS

Members: Free
IEEE Members: $11.00
Non-members: $15.00

Length: 00:13:36

12 May 2022

In this work, we present CleanUNet, a causal speech denoising model on the raw waveform. The proposed model is based on an encoder-decoder architecture combined with several self-attention blocks to refine its bottleneck representations, which is crucial to obtain good results. The model is optimized through a set of losses defined over both waveform and multi-resolution spectrograms. The proposed method outperforms the state-of-the-art models in terms of denoised speech quality from various objective and subjective evaluation metrics.

Tags:

raw waveform

self-attention

u-net

speech enhancement

speech denoising

Value-Added Bundle(s) Including this Product

22 May 2022

ICASSP 2022, May 2022 Virtual and In-Person Conference - Presentation Videos Product Bundle

More Like This

30 Jan 2025

Short Course Bundle: ICASSP 2022 COURSE 5: Speech Technology for Health: From Technical Foundations to Applications (Parts 1-3)

SPS

Members: $65.00
IEEE Members: $85.00
Non-members: $100.00

15 Jan 2025

Audio Signal Enhancement: A Weakly Supervised Deep Learning Approach

1.00 pdh

SPS

Members: Free
IEEE Members: $11.00
Non-members: $15.00

19 Jun 2024

Diffusion Models for Speech Enhancement and Restoration

1.00 pdh

SPS

Members: Free
IEEE Members: $11.00
Non-members: $15.00