The paper presents a comprehensive approach to unlocking the rich linguistic data in the Sejong dictionary, a major language resource for Korean. It introduces two key tools:
A web interface that provides intuitive access to verb information, including morphological, semantic, and syntactic details, as well as annotated sentence examples illustrating subcategorization frames.
A Python library (pySejongFrame) that enables efficient querying and processing of the Sejong dictionary data, supporting various loading methods and integration with existing NLP frameworks like NLTK.
The web interface organizes the Sejong dictionary data, allowing users to search for verbs, frames, arguments, and semantic roles, and view detailed information with annotated sentence examples. The Python library offers flexible loading options and querying capabilities, making it suitable for both corpus-based applications and linguistic research.
The authors also discuss their efforts to map subcategorization frames to corresponding sentence examples, providing a valuable resource for understanding verb-argument structures in Korean. Additionally, they outline plans to integrate other Korean verb lexicons, such as the Korean PropBank and FrameNet, to develop a comprehensive Korean VerbNet.
This work aims to enhance the accessibility and usability of the Sejong dictionary, a crucial language resource, for a wide range of users, from linguists to developers working on Korean language processing tasks.
In un'altra lingua
dal contenuto originale
arxiv.org
Approfondimenti chiave tratti da
by Seohyun Song... alle arxiv.org 10-03-2024
https://arxiv.org/pdf/2410.01100.pdfDomande più approfondite