แนวคิดหลัก
LLMs struggle to cover diverse information effectively, highlighting the challenges in multi-document summarization.
สถิติ
LLMs struggle to achieve high coverage rates even with advanced models like GPT-4.
GPT-4 only covers about 37% of diverse information on average.
Faithfulness scores are high across different LLMs, but coverage remains a challenge.