Ensemble Learning Boosts Vietnamese Scene Text Spotting in Challenging Urban Environments
An ensemble learning framework that combines multiple state-of-the-art scene text detection and recognition models significantly improves the performance of Vietnamese scene text spotting in complex urban settings.