DiffCut achieves state-of-the-art unsupervised zero-shot semantic segmentation by leveraging the semantic richness of diffusion UNet encoder features within a flexible recursive graph partitioning framework.
To mitigate the objective misalignment issue in zero-shot semantic segmentation, AlignZeg employs a comprehensive approach including Mutually-Refined Proposal Extraction, Generalization-Enhanced Proposal Classification, and Predictive Bias Correction, leading to significant improvements in unseen class recognition.
Enhancing semantic alignment and generalization in zero-shot semantic segmentation through a language-driven visual consensus approach.