核心概念
Generative models need to consider cultural contexts, as shown by DOSA dataset creation and LLM benchmarking.
統計資料
"Since the training data for LLMs is web-based and the Web is limited in its representation of information, it does not capture knowledge present within communities that are not on the Web."
"We use a gamified framework that relies on collective sensemaking to collect the names and descriptions of these artifacts such that the descriptions semantically align with the shared sensibilities of the individuals from those cultures."
引述
"Culture is a complex societal-level concept, and it can be defined by multiple factors: location, sexuality, race, nationality, language, religious beliefs, ethnicity."
"Our work offers an example of how technology evaluation can benefit from engaging community members using participatory research."