Core Concepts
Leveraging multiple irregularly sampled low-resolution Sentinel-2 satellite images to generate a higher resolution image at a specified time, using a time-equivariant fusion module.
Abstract
The content discusses the challenge of efficiently processing and analyzing satellite imagery, which often involves a trade-off between spatial and temporal resolutions. To address this, the authors investigate multi-image super-resolution (MISR) techniques that utilize multiple low-resolution (LR) images of the same area acquired at different times to reconstruct a higher-resolution (HR) image.
The key highlights and insights are:
The authors introduce BreizhSR, a new dataset for 4x super-resolution of Sentinel-2 time series using very high-resolution SPOT-6 imagery of the Brittany region in France. This dataset complements existing datasets with wider coverage and smaller temporal discrepancies between LR and HR acquisitions.
To handle the irregular temporal sampling of satellite image time series (SITS), the authors propose a time-equivariant fusion module based on the lightweight temporal attention encoder (L-TAE). This allows the model to generate an HR image at any specified time within the time frame of the LR time series.
The authors evaluate several MISR models, including HighRes-net with the proposed L-TAE fusion module, and compare them to single-image super-resolution (SISR) approaches. The results show that MISR models, especially those using the time-equivariant fusion, outperform SISR methods, particularly when the time difference between LR and HR acquisitions is large.
The authors observe a trade-off between spectral fidelity and perceptual quality of the reconstructed HR images, raising questions about the appropriate loss functions for super-resolution of Earth Observation data.
The MISR models demonstrate the ability to reduce cloud cover in the super-resolved images compared to the input LR time series, though some residual cloud artifacts remain.
Stats
The median time difference between Sentinel-2 and SPOT-6 acquisitions is less than 10 days.
Over 78% of the SPOT-6 images have a corresponding Sentinel-2 image within 10 days.
The median number of Sentinel-2 images in the time series is 8, with a maximum of 17.
Quotes
"Satellite imaging generally presents a trade-off between the frequency of acquisitions and the spatial resolution of the images."
"One possible solution to make the most of high-temporal low-spatial data and low-temporal high-spatial data is super-resolution, an image processing technique that aims to improve the spatial resolution of an image."
"We show that using multiple images significantly improves super-resolution performance, and that a well-designed temporal positional encoding allows us to perform super-resolution for different times of the series."