Efficient 3D Grounded Language-Image Pretraining with CT Scans and Radiology Reports for Full-Body Scenarios
CT-GLIP, a novel method for 3D grounded language-image pretraining, efficiently aligns organ-level visual features with precise diagnostic text descriptions to enable zero-shot organ classification and abnormality detection in full-body CT scans.