PEM proposes a novel prototype-based cross-attention mechanism to improve efficiency in multiple segmentation tasks. It introduces an efficient multi-scale feature pyramid network, combining deformable convolutions and context-based self-modulation. The architecture outperforms task-specific models on Cityscapes and ADE20K datasets. PEM achieves remarkable performance while being faster than competing architectures.
To Another Language
from source content
arxiv.org
Principais Insights Extraídos De
by Nicc... às arxiv.org 03-01-2024
https://arxiv.org/pdf/2402.19422.pdfPerguntas Mais Profundas