Centrala begrepp
An end-to-end optimized blind panoramic video quality assessment method that explicitly models user viewing patterns through learned visual scanpaths.
Sammanfattning
The content presents an end-to-end optimized blind panoramic video quality assessment (PVQA) method that consists of two modules: a scanpath generator and a quality assessor.
The scanpath generator is initially trained to predict future scanpaths by minimizing their expected code length, and then jointly optimized with the quality assessor for quality prediction. The scanpath generator is probabilistic and can work with any planar video quality assessment (VQA) model, enabling direct quality assessment of panoramic images by treating them as videos composed of identical frames.
The proposed method addresses the challenges posed by the spherical data structure of panoramic videos and the diverse and uncertain user viewing behaviors. Experiments on three public panoramic image and video quality datasets, encompassing both synthetic and authentic distortions, validate the superiority of the blind PVQA model over existing methods.
The key highlights and insights are:
- The scanpath generator is differentiable and can be integrated with any planar VQA model, enabling end-to-end optimization.
- The scanpath generator is trained to predict future scanpaths by minimizing their expected code length, capturing the uncertainty and diversity of human viewing patterns.
- The proposed three-stage optimization strategy, involving pre-training, quality assessor warmup, and end-to-end finetuning, accelerates convergence.
- The learned scanpaths enhance the performance of all quality assessors compared to other scanpath-based methods, and the method outperforms existing blind PVQA models under both in-dataset and cross-dataset settings.
- The scanpath generator closely replicates human viewing patterns, as validated by metrics such as minimum orthodromic distance and maximum temporal correlation.
Statistik
The content does not provide any specific numerical data or statistics. The focus is on the methodology and evaluation of the proposed blind PVQA method.
Citat
The content does not contain any striking quotes that support the key logics.