StereoDiffusion presents a novel approach to generating stereo image pairs without training, seamlessly integrating into Stable Diffusion models. The method involves modifying the latent variable, applying Stereo Pixel Shift operations, Symmetric Pixel Shift Masking Denoise, and Self-Attention Layers Modification to ensure consistency between left and right images. This technique achieves state-of-the-art scores in quantitative evaluations on various datasets like Middlebury and KITTI. The proposed method offers a lightweight solution for fast and high-quality stereo image generation.
Sang ngôn ngữ khác
từ nội dung nguồn
arxiv.org
Thông tin chi tiết chính được chắt lọc từ
by Lezhong Wang... lúc arxiv.org 03-11-2024
https://arxiv.org/pdf/2403.04965.pdfYêu cầu sâu hơn