JSTR: A Novel Framework for Improving Scene Text Recognition Accuracy by Judging Correct and Incorrect Predictions
The proposed JSTR framework enhances scene text recognition accuracy by explicitly learning to judge whether the model's text predictions match the input image, in addition to the standard text recognition task.