Training-Free Open-Vocabulary Semantic Segmentation with Diffusion-Augmented Prototype Generation
A training-free approach for open-vocabulary semantic segmentation that leverages diffusion-augmented visual prototypes and combines local and global similarities to segment input images.