A Large Dataset of Spontaneous Speech with the Paulistano Accent for Automatic Speech Recognition Evaluation
A new 239.30-hour spontaneous speech corpus with the paulistano accent in Brazilian Portuguese, the NURC-SP Audio Corpus, is introduced and used to evaluate state-of-the-art automatic speech recognition models.