This paper explores the use of large language models (LLMs) to analyze and gain insights into classical Chinese poetry. The researchers first compiled a comprehensive anthology of historical and AI-generated poems, including works from renowned authors as well as those annotated by poetry experts. They then designed a suite of metrics based on LLMs to evaluate the poems, including perplexity, entropy, probability, embedding, and frequency-based measures.
Through extensive statistical analysis and pattern summarization, the researchers identified several key findings:
Perplexity: Poems characterized by greater linguistic innovation and divergence from established conventions tend to exhibit higher perplexity, reflecting the model's relative unfamiliarity with such expressions.
Entropy: The entropy trajectory from the initial to the final couplet in Qilv poems typically follows a pattern of initial decline followed by a subsequent ascent, suggesting more rigid stylistic conventions in the second couplet. In Ci poems, the entropy of the latter section is often elevated, indicating richer thematic and content diversity.
Probability: Poems by authors who employ a less frequent vocabulary but maintain a close correspondence between conditional and absolute probabilities exhibit a distinctive creative approach, introducing novel dimensions to their poetic expressions.
Embedding: The relationships among tokens intensify as the model progresses through attention layers, reaching the pinnacle of interconnectivity at the output layer. The fine-tuning process predominantly modifies the parameters in the proximity of the output layer, influencing the model's predictive behavior.
Frequency: The Gini coefficients of historical poetry collections are much smaller than those of generated works, suggesting that current LLMs produce relatively monotonous content, highlighting the need to address the issue of diversity.
The researchers also developed a scoring model trained on expert-annotated data to assess the quality of poems, providing insights into the artistic creation and consistency of renowned authors.
These findings demonstrate the potential of LLMs to quantitatively analyze and uncover patterns in classical Chinese poetry, paving the way for enhanced AI-generated literary creations that can better capture the nuances and complexities of this rich poetic tradition.
다른 언어로
소스 콘텐츠 기반
arxiv.org
더 깊은 질문