GoLLIE proposes a model that follows annotation guidelines to enhance zero-shot information extraction. Large Language Models have struggled with Information Extraction tasks due to the complexity of annotation guidelines. GoLLIE outperforms previous attempts by fine-tuning to comply with detailed guidelines. The model leverages pre-training knowledge to extract mentions based on categories defined in the guidelines. However, challenges arise when different annotation schemas define labels differently. The ablation study shows that detailed guidelines are crucial for good results. GoLLIE introduces various training regularizations to ensure compliance with guidelines and prevent overfitting to specific datasets.
Para outro idioma
do conteúdo fonte
arxiv.org
Principais Insights Extraídos De
by Osca... às arxiv.org 03-07-2024
https://arxiv.org/pdf/2310.03668.pdfPerguntas Mais Profundas