Efficient Speech Processing with Discrete Speech Units: Techniques and Insights from the Interspeech 2024 Challenge
The authors present their systems developed for the Interspeech 2024 Speech Processing Using Discrete Speech Unit Challenge, including techniques for text-to-speech, singing voice synthesis, and automatic speech recognition using discrete speech tokens. Their approaches demonstrate the potential of discrete speech representations to achieve high-quality and low-bitrate speech processing.