The paper proposes a training-free approach called Temporal-Aware Cluster-based SUMmarization (TAC-SUM) for video summarization. The key highlights are:
TAC-SUM integrates temporal context into the clustering mechanism to address the limitations of traditional cluster-based methods, which often overlook temporal coherence.
The method comprises four main stages:
Experimental results on the SumMe dataset show that TAC-SUM significantly outperforms existing unsupervised cluster-based methods and achieves comparable performance to state-of-the-art supervised techniques.
The qualitative analysis demonstrates the interpretability of TAC-SUM's summarization results, with the generated importance scores aligning well with human-annotated scores.
While the current approach relies on naive rules, the authors acknowledge the potential for future improvements by integrating learnable components to enhance adaptability and data-driven summarization.
Sang ngôn ngữ khác
từ nội dung nguồn
arxiv.org
Thông tin chi tiết chính được chắt lọc từ
by Hai-Dang Huy... lúc arxiv.org 04-09-2024
https://arxiv.org/pdf/2404.04511.pdfYêu cầu sâu hơn