GoLLIE introduces a model fine-tuned to comply with annotation guidelines, showing significant progress in zero-shot information extraction. The model leverages detailed guidelines to improve performance on unseen tasks, surpassing previous methods. By following instructions and adhering to guidelines, GoLLIE demonstrates the importance of annotation guidelines in enhancing model performance.
Large Language Models (LLMs) combined with instruction tuning have made significant progress in generalizing to unseen tasks. However, they have been less successful in Information Extraction (IE), lagging behind task-specific models. Typically, IE tasks are characterized by complex annotation guidelines that describe the task and give examples to humans. Previous attempts to leverage such information have failed, even with the largest models, as they are not able to follow the guidelines out of the box.
In this paper, GoLLIE is proposed as a model able to improve zero-shot results on unseen IE tasks by being fine-tuned to comply with annotation guidelines. Comprehensive evaluation empirically demonstrates that GoLLIE is able to generalize and follow unseen guidelines, outperforming previous attempts at zero-shot information extraction.
The ablation study shows that detailed guidelines are key for good results. Code, data, and models are publicly available on GitHub.
In un'altra lingua
dal contenuto originale
arxiv.org
Approfondimenti chiave tratti da
by Osca... alle arxiv.org 03-07-2024
https://arxiv.org/pdf/2310.03668.pdfDomande più approfondite