Evaluating Hate Speech Detection on Nigerian Twitter Using Representative Data
Hate speech detection models evaluated on biased datasets largely overestimate real-world performance on representative Nigerian Twitter data. Domain-adaptive pretraining and finetuning on diverse data are key to maximizing hate speech detection in this low-resource context.