Establishing a large-scale dataset, T2VQA-DB, and proposing a transformer-based model, T2VQA, for subjective-aligned text-to-video quality assessment.