Analysis Of Acoustic Features For Speech Sound Based Classification Of Asthmatic And Healthy Subjects

Shivani Yadav, Merugu Keerthana, Dipanjan Gope, Prasanta Kumar Ghosh, Uma Maheswari Krishnaswamy

DOI

SPS

Members: Free
IEEE Members: $11.00
Non-members: $15.00

Length: 12:02

04 May 2020

Non-speech sounds (cough, wheeze) are typically known to perform better than speech sounds for asthmatic and healthy subject classification. In this work, we use sustained phonations of speech sounds, namely, /A:/, /i:/, /u:/, /eI/, /oU/, /s/, and /z/ from 47 asthmatic and 48 healthy controls. We consider INTERSPEECH 2013 Computational Paralinguistics Challenge baseline (ISCB) acoustic features for the classification task as they provide a rich set of characteristics of the speech sounds. Mel-frequency cepstral coefficients (MFCC) are used as the baseline features. The classification accuracy using ISCB improves over MFCC for all voiced speech sounds with the highest classification accuracy of 75.4% (18.28% better than baseline) for /oU/. The exhale achieves the highest classification accuracy of 77.8% (4.2% better than baseline). Comparable accuracies using speech sound /oU/ and non-speech exhale indicate the benefit of the rich acoustic features from ISCB. An analysis of 21 ISCB features groups using forward feature group selection shows that loudness and MFCC groups contribute the most in the case of /oU/, with interquartile range between 2nd and 3rd quartile of loudness feature being the best discriminator.

Tags:

sps conference

icassp 2020 virtual conference

May 2020

icassp 2020