CFRet-DVQA: Coarse-to-Fine Retrieval and Efficient Tuning for Document Visual Question Answering
The author introduces CFRet-DVQA, a framework focusing on retrieval and efficient tuning to enhance Document Visual Question Answering tasks effectively.