High train-test similarity alone does not explain CLIP's exceptional performance; other training data properties play a crucial role.