STATIC AND DYNAMIC SOURCE AND FILTER CUES FOR CLASSIFICATION OF AMYOTROPHIC LATERAL SCLEROSIS PATIENTS AND HEALTHY SUBJECTS
Tanuka Bhattacharjee (Indian Institute of Science); Chowdam Venkata Thirumala Kumar (Indian Institute of Science,Bengaluru); Yamini BK (NIMHANS); Nalini Atchayaram (NIMHANS); Ravi Yadav (NIMHANS); Prasanta Dr Ghosh (Indian Institute of Science (IISc), Bangalore)
-
SPS
IEEE Members: $11.00
Non-members: $15.00
Dysarthria due to Amyotrophic Lateral Sclerosis (ALS) affects speech production. Even the elementary sustained vowel utterances get impaired. For these, the impairments can be in achieving vowel-specific articulatory configurations, reflected in static acoustic cues, and/or in sustaining a configuration for a prolonged duration, reflected in dynamic cues. Such cues can further be attributed to the vocal cord (source) and vocal tract (filter) involved in speech production. This paper analyzes the relative contributions of these static (captured through average spectral characteristics) and dynamic (captured through spectral variations over time) source and filter cues toward automatic classification of ALS patients and healthy subjects using sustained utterances of /a/, /i/, /o/ and /u/. Experiments with 80 ALS patients and 80 healthy subjects suggest that the source cues (static/dynamic) are not the primary discriminators. For /i/, the static filter cues achieve the highest mean classification accuracy of 76.66%, whereas, for /a/, /o/ and /u/, the dynamic filter attributes contribute the most attaining average accuracies of 66.29%, 73.03% and 70.27%, respectively. Hence, ALS patients seem to face difficulties in forming the front closed vocal tract structure of /i/, whereas, holding the target vocal tract shape for long appears to be the primary challenge in case of /a/, /o/ and /u/.