A Multimodal Dataset for Fine-Grained Understanding of Regional Chinese Food Culture
FoodieQA, a manually curated multimodal dataset, captures the intricate features of food cultures across various regions in China to probe the fine-grained understanding of vision-language models.