Using Automatic Speech Recognition And Speech Synthesis To Improve The Intelligibility Of Cochlear Implant Users In Reverberant Listening Environments

Kevin Chu, Leslie Collins, Boyla Mainsah

DOI

SPS

Members: Free
IEEE Members: $11.00
Non-members: $15.00

Length: 10:50

04 May 2020

Cochlear implant (CI) users experience substantial difficulties in understanding reverberant speech. A previous study proposed a strategy that leverages automatic speech recognition (ASR) to recognize reverberant speech and speech synthesis to translate the recognized text into anechoic speech. However, the strategy was trained and tested on the same reverberant environment, so it is unknown whether the strategy is robust to unseen environments. Thus, the current study investigated the performance of the previously proposed algorithm in multiple unseen environments. First, an ASR system was trained on anechoic and reverberant speech using different room types. Next, a speech synthesizer was trained to generate speech from the text predicted by the ASR system. Experiments were conducted in normal hearing listeners using vocoded speech, and the results showed that the strategy improved speech intelligibility in previously unseen conditions. These results suggest that the ASR-synthesis strategy can potentially benefit CI users in everyday reverberant environments.

Tags:

sps conference

icassp 2020 virtual conference

May 2020

icassp 2020