Основні поняття
A model is proposed to effectively transfer knowledge from the biomedical domain to the chemical domain for named entity recognition, by projecting source and target entities into separate regions of the feature space.
Анотація
The paper investigates the applicability of transfer learning for enhancing a named entity recognition (NER) model trained in the biomedical domain (the source domain) to be used in the chemical domain (the target domain). The authors observe that a common practice of pretraining the model on labeled source data and then finetuning it on a few labeled target examples is prone to mislabeling source entities as target entities, due to the shared context between the two domains.
To address this issue, the authors propose a two-stage model:
-
Entity Grouping in the Source Domain:
- Extracting auxiliary information about source entities from annotated events to establish relations between entities.
- Employing a refined multi-similarity loss to group similar source entities and project them into separate regions of the feature space.
-
Entity Discrimination in the Target Domain:
- Detecting potentially false positive entities (i.e., source entities) using pseudo-labeling.
- Leveraging the multi-similarity loss to enhance discrimination between the pseudo-labeled entities and the target entities, projecting them into separate regions of the feature space.
The authors conduct extensive experiments across three source and three target datasets, demonstrating that their method outperforms the baselines by up to 5% in absolute F1 score. They also provide detailed empirical analysis to shed light on the effectiveness of the proposed techniques.
Статистика
The catechol and hydroquinone metabolites, induced apoptosis in HL60 and HBMP cells in a time- and concentration dependent manner.
The phenols, NCR181, FLA873, and FLA797, and the derivatives formed by oxidation of the pyrrolidine ring had no effect.
Цитати
"Named entity recognition is a crucial step in IE tasks. Existing models have achieved remarkable performance in the general domain (Lin et al., 2020; Wang et al., 2021b; Zhang et al., 2023; Shen et al., 2023b). However, in the scientific domains, e.g., medical or chemical domains, these models usually struggle due to the extremely large quantity of concepts, the wide presence of multi-token entities, and the ambiguity in detecting entity boundaries."
"Given the already existing challenges of the named entity recognition task in the scientific domain—mentioned earlier—this factor can further exacerbate the problem. For instance, in our early few-shot learning experiments, we observed that the results of ChatGPT in the chemical domain named entity recognition task are significantly worse than those in the general domain."