Wav2code: Restoring Clean Speech Representations from Noisy Inputs for Robust Automatic Speech Recognition
The proposed Wav2code framework leverages self-supervised learning and vector quantization to restore high-quality clean speech representations from noisy inputs, enabling more robust automatic speech recognition performance under various noisy conditions.