Masked AutoDecoder: Effective Multi-Task Vision Generalist
Masked AutoDecoder (MAD) is an effective multi-task vision generalist that employs parallel decoding and masked sequence modeling for efficient and accurate performance across various vision tasks.