Core Concepts
Leveraging large language models and knowledge graphs to generate coherent multi-sentence citations that accurately capture the relationships between source and target research papers.
Abstract
The paper presents a method for generating multi-sentence citation text using large language models (LLMs) such as LLaMA, Alpaca, and Vicuna. The authors curated a new dataset called MCG-S2ORC from the S2ORC corpus, which contains citations referencing multiple papers in a single sentence or paragraph.
The key highlights of the work are:
The authors fine-tuned the LLM models on the MCG-S2ORC dataset for the task of multi-citation text generation. Their experiments showed that the Vicuna model outperformed the other LLMs.
To further improve the performance, the authors incorporated knowledge graphs extracted from the source and target paper abstracts, introductions, and conclusions into the prompts used for fine-tuning the LLMs. This knowledge-enhanced prompting led to significant improvements in the quality and coherence of the generated citation text.
The authors conducted extensive evaluations using standard metrics like METEOR, ROUGE-1, ROUGE-2, and ROUGE-L, demonstrating the effectiveness of their approach.
The paper highlights the advantages of using citation generation models, such as saving time, ensuring accuracy and consistency, reducing plagiarism, and serving as educational resources.
Overall, the work showcases the potential of leveraging large language models and knowledge graphs to automate the citation generation process, enabling researchers to focus more on their core work while maintaining proper source attribution.
Stats
The average number of characters in the citation texts is 227.29.
The maximum number of characters in the citation texts is 2416.
The average number of characters in the source abstracts is 1122.95.
The maximum number of characters in the source abstracts is 5516.
The average number of characters in the target abstracts is 998.48.
The maximum number of characters in the target abstracts is 93551.
The average number of target papers per sample is 2.
Quotes
"Using a CTG model offers numerous advantages. Firstly, it significantly saves time by automating the citation generation process, allowing researchers, students, and authors to focus more on their work. Secondly, these models ensure accuracy and consistency by following specific citation styles and providing correct formatting for various source types."
"To enhance citation generation, we integrated knowledge graphs into the model's prompts by extracting entity relations from the source and target paper's abstracts, introductions, and conclusions using PL-Marker. Our experiments showed that incorporating knowledge graphs significantly improved the performance and text generation capabilities of the models, enabling better comprehension of relations between source and target papers."