OpenChemIE is a toolkit designed to extract comprehensive reaction data from chemistry literature. It addresses the challenge of integrating information across multiple modalities, including text, tables, and figures, to obtain complete reaction descriptions.
The key components of OpenChemIE include:
Figure Analysis:
Text Analysis:
Multimodal Integration:
OpenChemIE was evaluated on a newly annotated dataset of 1007 reactions from 78 substrate scope figures across 5 chemistry journals. It achieved an F1 score of 69.5% on this challenging task, demonstrating its ability to extract detailed reaction data by integrating information from multiple modalities. Additionally, in an end-to-end evaluation against the Reaxys database, OpenChemIE attained an accuracy of 64.3%.
The toolkit is available as an open-source package and a web interface, enabling broader usage and future development in this area.
A otro idioma
del contenido fuente
arxiv.org
Ideas clave extraídas de
by Vincent Fan,... a las arxiv.org 04-03-2024
https://arxiv.org/pdf/2404.01462.pdfConsultas más profundas