Khái niệm cốt lõi
Proposing a novel evaluation framework for large video generative models to assess visual, content, motion qualities, and text-video alignment.
Thống kê
We propose a novel framework and pipeline for exhaustively evaluating the performance of generated videos.
Our approach involves generating 700 prompts based on real-world user data and analyzing them with objective metrics.
Trích dẫn
"We argue that it is hard to judge the large conditional generative models from simple metrics."
"Our final score shows a higher correlation than simply averaging the metrics."