A new 239.30-hour spontaneous speech corpus with the paulistano accent in Brazilian Portuguese, the NURC-SP Audio Corpus, is introduced and used to evaluate state-of-the-art automatic speech recognition models.
The incorporation of pseudo-labeled publicly available data is a highly effective strategy for improving ASR accuracy and noise robustness.