Decoupling Frontend Enhancement and Backend Recognition in Monaural Robust ASR
The author proposes a system that decouples frontend enhancement from backend recognition to improve automatic speech recognition in noisy conditions. By training the ASR model on clean speech only, the proposed system outperforms existing approaches on various datasets.