Single Frequency Filter Bank Based Long-Term Average Spectra For Hypernasality Detection And Assessment In Cleft Lip And Palate Speech
Hashim Javid Mohammad, Krishna Gurugubelli, Anil Kumar Vuppala
-
SPS
IEEE Members: $11.00
Non-members: $15.00Length: 14:08
Hypernasality is an abnormality in speech production observed in subjects with craniofacial anomalies like cleft lip and palate (CLP). Detection and assessment of hypernasality is a primary step in the clinical diagnosis of individuals with CLP. Existing methods explore the short-term spectral information from speech to assess hypernasality. The present work examines long-term average spectral (LTAS) features obtained from speech to detect and assess hypernasality. This work proposes single frequency filter bank based long-term average spectral (SFFB-LTAS) features for hypernasality detection and assessment. The SFFB is used to extract long-term average spectra with a good spectral resolution. The experiments are carried out using NMCPC-CLP database collected from 41 speakers with CLP and 32 speakers without CLP. The experimental results show that, SFFB-LTAS features performed better compared to state-of-art spectral and prosody features. The proposed systems for the detection and assessment of hypernasality have shown classification accuracy of 89% and 82.1%, respectively.