Efficient Mixed-Text Optical Character Recognition Using Transformer-based Pre-Training and Parameter-Efficient Fine-Tuning
A parameter-efficient hybrid text recognition method based on pre-trained OCR Transformer, DLoRA-TrOCR, which embeds DoRA into the image encoder and LoRA into the text decoder, enabling efficient fine-tuning for mixed handwritten, printed, and street view text recognition.