Controlling The Perceived Sound Quality For Dialogue Enhancement With Deep Learning

Christian Uhle, Matteo Torcoli, Jouni Paulus

DOI

SPS

Members: Free
IEEE Members: $11.00
Non-members: $15.00

Length: 12:04

04 May 2020

Speech enhancement attenuates interfering sounds in speech signals but may introduce artifacts that perceivably deteriorate the output signal. We propose a method for controlling the trade-off between the attenuation of the interfering background signal and the loss of sound quality. A deep neural network estimates the attenuation of the separated background signal such that the sound quality, quantified using the Artifact-related Perceptual Score, meets an adjustable target. Subjective evaluations indicate that consistent sound quality is obtained across various input signals. Our experiments show that the proposed method is able to control the trade-off with an accuracy that is adequate for real-world dialogue enhancement applications.

Tags:

sps conference

icassp 2020 virtual conference

May 2020

icassp 2020

Controlling The Perceived Sound Quality For Dialogue Enhancement With Deep Learning

Christian Uhle, Matteo Torcoli, Jouni Paulus

Value-Added Bundle(s) Including this Product

ICASSP 2020 Virtual Conference - Presentation Videos Product Bundle

More Like This

IEEE ICASSP 2023, 4-10 June 2023, Greece. Virtual and In-Person Conference - Presentation Videos Product Bundle

IEEE ICASSP 2024, 1 4-19 April 2024, Seoul, Korea. Conference Presentation Videos Bundle

ICIP 2022, October 16-19, 2022, Bordeaux, France - Presentation Videos Product Bundle

Join the IEEE Signal Processing Society