Multi-Blank Transducers for Faster and More Accurate Speech Recognition
The proposed multi-blank RNN-T method introduces additional blank symbols that consume multiple input frames, enabling faster inference while improving speech recognition accuracy.