효율적인 비디오 기반 모델 훈련 방법 소개
The author proposes a training-efficient method for temporal-sensitive Video Foundation Models by integrating existing methods, focusing on data efficiency and multi-modal friendliness.