Language Model Aware Speech Tokenization for Improved Spoken Language Modeling and Speech Recognition
Integrating a pre-trained text language model into the speech tokenization process to guide the learning of discrete speech representations that are better suited for sequential modeling of speech data.