Основні поняття
WaveMixSR-V2 is a highly efficient neural network architecture that achieves state-of-the-art performance in image super-resolution tasks while consuming significantly fewer resources compared to other methods.
Анотація
The paper presents WaveMixSR-V2, an enhanced version of the previous WaveMixSR model for single image super-resolution (SISR). The key improvements include:
Multi-stage Design: Instead of a single upsampling layer, WaveMixSR-V2 uses a series of 2x SR blocks to progressively increase the resolution, allowing better refinement of details at each step.
PixelShuffle: The transposed convolution in the original WaveMixSR blocks is replaced with a PixelShuffle operation followed by a convolution. This significantly reduces the number of parameters and computational cost while avoiding checkerboard artifacts.
The experiments demonstrate that WaveMixSR-V2 outperforms other state-of-the-art methods, including WaveMixSR, SwinIR, and HAT, on the BSD100 dataset for both 2x and 4x super-resolution. Notably, WaveMixSR-V2 achieves these results using less than half the number of parameters and computations compared to the previous best model, WaveMixSR.
Additionally, the paper reports that WaveMixSR-V2 exhibits higher parameter efficiency, lower latency, and higher throughput compared to WaveMixSR, making it one of the most efficient models for super-resolution tasks.
The authors also conducted ablation studies to analyze the impact of different loss functions, input resolutions, and the inclusion of Gaussian noise. The results suggest that the WaveMixSR-V2 architecture's focus on low-frequency components may not align well with the objectives of GAN training, leading to limited benefits from the GAN approach.
Статистика
WaveMixSR-V2 has 0.7M parameters and 25.6G Multi-Adds for 4x SR of 64x64 input patch, compared to 11.8M parameters and 49.6G Multi-Adds for SwinIR, and 20.8M parameters and 103.7G Multi-Adds for HAT.
WaveMixSR-V2 has a training latency of 19.6ms and a training throughput of 50.8fps, compared to 22.8ms and 43.8fps for WaveMixSR.
WaveMixSR-V2 has an inference latency of 12.1ms and an inference throughput of 82.6fps, compared to 18.6ms and 53.7fps for WaveMixSR.
Цитати
"Incorporating these improvements in the architecture has enabled WaveMixSR-V2 to achieve new state-of-the-art (SOTA) performance on the BSD100 dataset [5]."
"Notably, it accomplishes this with less than half the number of parameters, lesser computations and lower latency compared to WaveMixSR (previous SOTA)."