GoLLIE proposes a model that follows annotation guidelines to enhance zero-shot information extraction. Large Language Models have struggled with Information Extraction tasks due to the complexity of annotation guidelines. GoLLIE outperforms previous attempts by fine-tuning to comply with detailed guidelines. The model leverages pre-training knowledge to extract mentions based on categories defined in the guidelines. However, challenges arise when different annotation schemas define labels differently. The ablation study shows that detailed guidelines are crucial for good results. GoLLIE introduces various training regularizations to ensure compliance with guidelines and prevent overfitting to specific datasets.
Til et andet sprog
fra kildeindhold
arxiv.org
Vigtigste indsigter udtrukket fra
by Osca... kl. arxiv.org 03-07-2024
https://arxiv.org/pdf/2310.03668.pdfDybere Forespørgsler