Compressed Representation Of Cepstral Coefficients Via Recurrent Neural Networks For Informed Speech Enhancement

Carol Chermaz, Dario Leuchtmann, Simon Tanner, Roger Wattenhofer

DOI

SPS

Members: Free
IEEE Members: $11.00
Non-members: $15.00

Length: 00:11:48

10 Jun 2021

Speech enhancement is one of the biggest challenges in hearing prosthetics. In face-to-face communication devices have to estimate the signal of interest, but playback of speech signals from an electronic device opens up new opportunities. Audio signals can be enriched with hidden data, which can subsequently be decoded by the receiver. We investigate a hybrid strategy made of signal processing and RNN (Recurrent Neural Networks) to calculate and compress cepstral coefficients: these are descriptors of the speech signal, which can be embedded in the signal itself and used at the receiver's end to perform an Informed Speech Enhancement. Objective evaluations showed an increase in speech quality for noisy signals enhanced with our method.

Chairs:

Timo Gerkmann

Tags:

signal processing society

IEEE icassp 2021

virtual conference

2021

sps

virtual conference icassp 2021

june 6-11 2021

icassp 2021