Core Concepts
The proposed ResVR framework jointly optimizes the processes of omnidirectional image downscaling and viewport rendering to achieve efficient transmission and high-quality viewing experiences on head-mounted displays.
Abstract
The paper presents ResVR, a novel framework for the comprehensive processing of omnidirectional images (ODIs). ResVR seamlessly integrates image rescaling and viewport rendering to balance transmission efficiency and user visual experience.
Key highlights:
Conventional ODI rescaling methods focus solely on enhancing the quality of equirectangular projection (ERP) images, overlooking the fact that the content viewed on head-mounted displays (HMDs) is the rendered viewport.
ResVR directly optimizes the quality of the final viewport, without the need to produce high-resolution ERP images.
A discrete pixel sampling strategy is developed to tackle the complex mapping between the viewport and ERP, enabling end-to-end training of the ResVR pipeline.
A spherical pixel shape representation technique is introduced to significantly improve the visual quality of rendered viewports, especially in high-latitude and high-longitude regions.
Extensive experiments demonstrate that ResVR outperforms existing methods in viewport rendering tasks across different fields of view, resolutions, and view directions, while maintaining a low transmission bitrate.
Stats
Rendering a 2048x1536 viewport with 120°x90° field of view from a downscaled ERP image, ResVR achieves 31.39 dB PSNR on ODI-SR dataset and 32.95 dB PSNR on SUN360 dataset, outperforming previous methods by around 0.4 dB.
ResVR maintains a low bitrate of around 0.3 bpp for the transmitted ERP image.
Quotes
"Focusing solely on the quality of ERP images will result in sub-optimal viewport visual experiences."
"Our ResVR directly optimizes the quality of the final viewport, without the need to produce HR ERP images."