Versatile Data Cleanser: A Multimodal Approach to Detecting Diverse Dirty Samples in Datasets
Leveraging multimodal large language models, the proposed Versatile Data Cleanser (VDC) framework can effectively detect various types of dirty samples, including poisoned samples and noisy labels, by quantifying the visual-linguistic inconsistency between image content and associated labels.