PEM proposes a novel prototype-based cross-attention mechanism to improve efficiency in multiple segmentation tasks. It introduces an efficient multi-scale feature pyramid network, combining deformable convolutions and context-based self-modulation. The architecture outperforms task-specific models on Cityscapes and ADE20K datasets. PEM achieves remarkable performance while being faster than competing architectures.
In un'altra lingua
dal contenuto originale
arxiv.org
Approfondimenti chiave tratti da
by Nicc... alle arxiv.org 03-01-2024
https://arxiv.org/pdf/2402.19422.pdfDomande più approfondite