Core Concepts
Even the best open-source language models struggle to comprehend the diverse cultures across eleven Indonesian provinces, with the highest accuracy reaching only 53.2%. Incorporating location context significantly enhances model performance, especially in larger models like GPT-4.
Abstract
This paper introduces IndoCulture, a novel dataset to evaluate cultural commonsense reasoning across eleven Indonesian provinces. The dataset was manually developed by local experts in each province based on predefined topics.
The key highlights and insights from the study are:
All open-source language models, including multilingual and Indonesian-centric models, exhibit limited understanding of Indonesian cultures, in contrast to the 100% accuracy achieved by human experts.
The multiple-choice question method generally outperforms the sentence completion method, with exceptions for some smaller Indonesian-centric models.
Incorporating location context, especially at the province level, significantly boosts the performance of larger language models like GPT-4, emphasizing the importance of geographical context in commonsense reasoning.
Models perform better on cultures from specific provinces like Bali and West Java, likely due to the abundance of training data on these regions, highlighting the risk of cultural biases in language models.
Analysis on fine-grained cultural elements shows that models struggle the most with understanding cultural norms and language-specific aspects, compared to other elements like artifacts and rituals.
Manual evaluation of model-generated explanations reveals a significant gap between the models' ability to select the correct answer and provide a reasonable justification, especially for open-source models.
The findings underscore the challenging nature of the IndoCulture dataset and the need for more inclusive and geographically-aware language models that can effectively reason about diverse cultural contexts.
Stats
The fat bodies of female dancers are believed symbols of prosperity.
The fat body of female dancers is believed symbols of beauty.
Emi secluded herself in the forest due to the Korowai tribe's belief that pregnant women were vulnerable to attacks by evil spirits.
Aldia wore a rencong around her waist.
Quotes
"Culture is a multifaceted concept encompassing the way of life, including our thoughts and actions."
"Indonesia is a highly multicultural country, home to over 1,300 recognized ethnic groups and more than 700 languages."