Automated metric BOOOOKSCORE evaluates book-length summarization coherence, revealing insights on LLM performance.
Book-length summarization using LLMs can be systematically evaluated with the BOOOOKSCORE metric, providing insights into coherence and model performance.