An empirical study on speech restoration guided by self-supervised speech representation

Jaeuk Byun (Naver Corporation); Youna Ji (NAVER Corperation); Soo-Whan Chung (Naver Corporation); Soyeon Choe (NAVER Corporation); Min-Seok Choi (NAVER)

DOI

SPS

Members: Free
IEEE Members: $11.00
Non-members: $15.00

08 Jun 2023

Enhancing speech quality is an indispensable yet difficult task as it is often complicated by a range of degradation factors. In addition to additive noise, reverberation, clipping, and speech attenuation can all adversely affect speech quality. Speech restoration aims to recover speech components from these distortions. This paper focuses on exploring the impact of self-supervised speech representation learning on the speech restoration task. Specifically, we employ speech representation in various speech restoration networks and evaluate their performance under complicated distortion scenarios. Our experiments demonstrate that the contextual information provided by the self-supervised speech representation can enhance speech restoration performance in various distortion scenarios, while also increasing robustness against the duration of speech attenuation and mismatched test conditions.

Tags:

Audio signal enhancement and restoration

An empirical study on speech restoration guided by self-supervised speech representation

Jaeuk Byun (Naver Corporation); Youna Ji (NAVER Corperation); Soo-Whan Chung (Naver Corporation); Soyeon Choe (NAVER Corporation); Min-Seok Choi (NAVER)

Value-Added Bundle(s) Including this Product

IEEE ICASSP 2023, 4-10 June 2023, Greece. Virtual and In-Person Conference - Presentation Videos Product Bundle

More Like This

MAID: A Conditional Diffusion Model For Long Music Audio Inpainting

Immersive enhancement and removal of loudspeaker sound using wireless assistive listening systems and binaural hearing devices

A MODEL-BASED HEARING COMPENSATION METHOD USING A SELF-SUPERVISED FRAMEWORK

Join the IEEE Signal Processing Society