Guided Slot Attention Network for Robust Unsupervised Video Object Segmentation
The proposed guided slot attention network leverages guided slots, feature aggregation transformer, and K-nearest neighbors filtering to effectively separate foreground and background spatial structural information, achieving state-of-the-art performance on challenging video object segmentation datasets.