Optimizing Downstream Rewards in Pre-Trained Diffusion Models without Fine-Tuning or Differentiable Proxies
A novel inference-time algorithm, SVDD, that optimizes downstream reward functions in pre-trained diffusion models without the need for fine-tuning or constructing differentiable proxy models.