Core Concepts
An accurate post-training quantization framework for diffusion models that reduces quantization errors across generation timesteps and selects optimal calibration images to enable efficient image generation.
Abstract
The paper proposes an accurate post-training quantization framework for diffusion models, called APQ-DM, to enable efficient image generation.
Key highlights:
Conventional quantization frameworks use shared quantization functions across different timesteps in diffusion models, despite the significant variation in activation distributions. This leads to large quantization errors.
The calibration images are also randomly selected, failing to provide sufficient information for generalizable quantization function learning.
APQ-DM addresses these issues by:
Designing distribution-aware quantization functions, where timesteps are partitioned into groups with specific rounding functions for each group.
Employing a differentiable search strategy to acquire the optimal group assignment and rounding function parameters.
Extending the structural risk minimization (SRM) principle to actively select the optimal timesteps for informative calibration image generation.
Extensive experiments on various datasets and network architectures demonstrate that APQ-DM significantly outperforms state-of-the-art post-training quantization methods for diffusion models, achieving high-quality image generation with 6-bit weights and activations.
Stats
The paper reports the following key metrics:
Inception Score (IS) and Fréchet Inception Distance (FID) for evaluating the quality of generated images.
Quantization errors (C-Error and G-Error) for activations in the calibration and generation stages, respectively.
Quotes
"Conventional quantization frameworks learn shared quantization functions for tensor discretization regardless of the generation timesteps in diffusion models, while the activation distribution differs significantly across various timesteps."
"We also extend structural risk minimization principle for informative calibration image generation to enhance the generalization ability in the deployment of quantized diffusion model."