Core Concepts
Enhanced generative image compression method EGIC efficiently traverses the distortion-perception curve using semantic segmentation guidance and output residual prediction.
Abstract
EGIC introduces a novel approach to generative image compression, combining semantic segmentation guidance and output residual prediction for efficient traversal of the distortion-perception curve. The method outperforms state-of-the-art diffusion and GAN-based methods while being simple to implement and storage-efficient. By incorporating OASIS-C and ORP, EGIC provides excellent interpolation characteristics, making it suitable for practical applications targeting low bit ranges.
Stats
DIRAC-100 achieves 0.157bpp at 1.11x PSNR.
HiFiC operates at 0.172bpp with 1.08x PSNR.
MS-ILLM performs at 0.164bpp with 1.03x PSNR.
EGIC achieves 0.159bpp with α=1.0.
EGIC also reaches 0.159bpp with α=0.0.
Quotes
"EGIC forms a powerful codec, outperforming state-of-the-art diffusion and GAN-based methods."
"ORP is a lightweight retrofit solution for multi-realism image compression."
"OASIS-C provides spatially and semantically-aware gradient feedback to the generator."