Exploring the Relationship between Internal Language Model Subtraction and Sequence Discriminative Training for Neural Transducers
Sequence discriminative training, such as maximum mutual information (MMI) and minimum Bayes risk (MBR) training, has a strong correlation with internal language model (ILM) subtraction for improving the performance of neural transducers.