Improving Factored Hybrid HMM Acoustic Modeling without State Tying

Tina Raissi, Ralf Schlüter, Eugen Beck, Hermann Ney

DOI

SPS

Members: Free
IEEE Members: $11.00
Non-members: $15.00

Length: 00:15:07

11 May 2022

In this work, we show that a factored hybrid hidden Markov model (FH-HMM) which is defined without any phonetic state-tying outperforms a state-of-the-art hybrid HMM. The factored hybrid HMM provides a link to transducer models in the way it models phonetic (label) context while preserving the strict separation of acoustic and language model of the hybrid HMM approach. Furthermore, we show that the factored hybrid model can be trained from scratch without using phonetic state-tying in any of the training steps. Our modeling approach enables triphone context while avoiding phonetic state-tying by a decomposition into locally normalized factored posteriors for monophones/HMM states in phoneme context. Experimental results are provided for Switchboard 300h and LibriSpeech. On the former task we also show that by avoiding the phonetic state-tying step, the factored hybrid can take better advantage of regularization techniques during training, compared to the standard hybrid HMM with phonetic state-tying based on classification and regression trees (CART).

Tags:

blstm acoustic model

librispeech

switchboard

cart-free hybrid hmm

regularization

Improving Factored Hybrid HMM Acoustic Modeling without State Tying

Tina Raissi, Ralf Schlüter, Eugen Beck, Hermann Ney

Value-Added Bundle(s) Including this Product

ICASSP 2022, May 2022 Virtual and In-Person Conference - Presentation Videos Product Bundle

More Like This

P4.9-Regularization and Dropout

DODGING THE DOUBLE DESCENT IN DEEP NEURAL NETWORKS

FEDMBP: MULTI-BRANCH PROTOTYPE FEDERATED LEARNING ON HETEROGENEOUS DATA

Join the IEEE Signal Processing Society