Skip to main content

TOWARDS DOMAIN GENERALISATION IN ASR WITH ELITIST SAMPLING AND ENSEMBLE KNOWLEDGE DISTILLATION

Rehan Ahmad (University of Sheffield); Md Asif Jalal (Samsung Research UK); Muhammad Umar Farooq (University of Sheffield); Anna L Ollerenshaw (University of Sheffield); Thomas Hain (University of Sheffield)

  • SPS
    Members: Free
    IEEE Members: $11.00
    Non-members: $15.00
06 Jun 2023

Knowledge distillation (KD) has widely been used for model compression and domain adaptation for speech applications. In the presence of multiple teachers, knowledge can easily be transferred to the student by averaging the models output. However, previous research shows that the student do not adapt well with such combination. This paper propose to use an elitist sampling strategy at the output of ensemble teacher models to select the best-decoded utterance generated by completely out-of-domain teacher models for generalizing unseen domain. The teacher models are trained on AMI, LibriSpeech and WSJ while the student is adapted for the Switchboard data. The results show that with the selection strategy based on the individual model’s posteriors the student model achieves a better WER compared to all the teachers and baselines with a minimum absolute improvement of about 6%. Furthermore, an insights on the model adaptation with out-of-domain data has also been studied via correlation analysis.

More Like This

  • SPS
    Members: Free
    IEEE Members: $11.00
    Non-members: $15.00
  • SPS
    Members: Free
    IEEE Members: $11.00
    Non-members: $15.00
  • SPS
    Members: Free
    IEEE Members: $11.00
    Non-members: $15.00