Zero-shot Generalizable Incremental Learning for Vision-Language Object Detection
The author presents Incremental Vision-Language Object Detection (IVLOD) as a novel learning task to adapt VLODMs to specialized domains while preserving zero-shot generalization. The approach involves Zero-interference Reparameterizable Adaptation (ZiRa) to address this challenge efficiently.