Cp-Gan: Context Pyramid Generative Adversarial Network For Speech Enhancement

Gang Liu, Ke Gong, Xiaodan Liang, Zhiguang Chen

DOI

SPS

Members: Free
IEEE Members: $11.00
Non-members: $15.00

Length: 13:26

04 May 2020

The topic of speech enhancement has been largely improved recently, especially with the development of generative adversarial networks (GANs). However prior methods simply follow the GAN architectures from computer vision tasks without specific designs for the speech enhancement according to the audio characteristics (i.e., different granularity context), which may leave noise points in some segments or disturb the contents of the original audio. In this work, we make the first attempt to explore the global and local speech features for coarse-to-fine speech enhancement and introduce a Context Pyramid Generative Adversarial Network (CP-GAN), which contains a densely-connected feature pyramid generator and a dynamic context granularity discriminator to better eliminate audio noise hierarchically. Extensive experiments demonstrate that our CP-GAN effectively achieves state-of-the-art speech enhancement results and boosts the performance of more high-level speech tasks including automatic speech recognition and speaker recognition.

Tags:

sps conference

icassp 2020 virtual conference

May 2020

icassp 2020

Cp-Gan: Context Pyramid Generative Adversarial Network For Speech Enhancement

Gang Liu, Ke Gong, Xiaodan Liang, Zhiguang Chen

Value-Added Bundle(s) Including this Product

ICASSP 2020 Virtual Conference - Presentation Videos Product Bundle

More Like This

IEEE ICASSP 2023, 4-10 June 2023, Greece. Virtual and In-Person Conference - Presentation Videos Product Bundle

IEEE ICASSP 2024, 1 4-19 April 2024, Seoul, Korea. Conference Presentation Videos Bundle

ICIP 2022, October 16-19, 2022, Bordeaux, France - Presentation Videos Product Bundle

Join the IEEE Signal Processing Society