FastSpell is a language identifier that combines fastText and Hunspell to provide a refined second-opinion on language predictions, with a focus on accurately distinguishing between similar and closely-related languages.
Pre-trained multilingual models, particularly those exposed to South African languages during pre-training, significantly outperform traditional methods like N-grams for language identification in low-resource South African languages.