Frame-Level Mmi As A Sequence Discriminative Training Criterion For Lvcsr

Wilfried Michel, Ralf SchlÃ¼ter, Hermann Ney

DOI

SPS

Members: Free
IEEE Members: $11.00
Non-members: $15.00

Length: 13:01

04 May 2020

In this work we present frame-level maximum mutual information (MMI) as a novel sequence discriminative training criterion for hybrid HMM-DNN acoustic models. Compared to the standard, sequence-level MMI criterion we show that frame-level MMI has increased robustness towards missing cross-entropy (CE) smoothing and can converge even without interpolation. Using model free optimization, we show that in the asymptotic case of an infinite amount of training data models trained using this criterion are equal to the true class posterior distribution, whereas training using the state-level minimum Bayes risk (sMBR) criterion leads to a distorted function of the true class posterior distribution. This analytical result is backed by experimental evidence. We further propose a generalized class of training criteria, that continuously interpolates between frame-level MMI and sMBR criterion.

Tags:

sps conference

icassp 2020 virtual conference

May 2020

icassp 2020