The content discusses the challenges in Handwritten Mathematical Expression Recognition (HMER) due to complex layouts. It introduces an attention guidance mechanism to refine attention weights, with self-guidance and neighbor-guidance approaches. Experiments show improved recognition rates on CROHME datasets.
The HMER task is challenging due to two-dimensional layouts of mathematical expressions. Previous methods use historical attention weights, but limitations exist in addressing under-parsing issues. The proposed attention guidance mechanism aims to suppress irrelevant regions and enhance appropriate ones explicitly.
Self-guidance refines correlations by seeking consensus among different attention heads, while neighbor-guidance leverages final attention weights from previous decoding steps. Experiments demonstrate superior performance over existing methods on standard benchmarks.
The proposed method not only addresses HMER challenges but also has potential applications in other tasks requiring dynamic alignment. Future work may explore additional attention guidance approaches for further improvements.
翻譯成其他語言
從原文內容
arxiv.org
深入探究