Cultural and Linguistic Diversity in Computer Vision Datasets and Models
The author argues that human perception is not homogeneous, as different cultural backgrounds influence how people observe visual stimuli. By studying multilingual vision-language datasets, the author demonstrates significant differences in semantic content and linguistic expression.