SeeGULL Multilingual is a dataset created by Google Research to address the lack of cross-cultural considerations in safety evaluations of generative multilingual models. The dataset contains over 25,000 stereotypes across 20 languages and 23 regions, providing insights into geo-cultural factors influencing stereotypes. By leveraging LLM generations and human annotations, the dataset aims to improve model evaluations and safeguard against harmful stereotypes. The resource is publicly available to foster research in this domain and enhance multilingual model safety.
The content highlights the importance of evaluating model safety from a multicultural perspective to prevent harmful effects caused by stereotypes. It emphasizes the need for diverse stereotype resources beyond English to capture unique salient stereotypes prevalent in different languages worldwide. Through culturally situated validations and offensiveness annotations, SeeGULL Multilingual offers a comprehensive approach to understanding and mitigating biases in generative models.
The dataset creation methodology involves identifying salient identity terms, generating associations using PaLM-2, and obtaining culturally situated human annotations for validation. Annotations are collected for both stereotypes and offensiveness ratings across various languages and regions. The content also discusses the overlap with the English version of SeeGULL, highlighting differences in offensive stereotypes across different countries.
Furthermore, the evaluation of foundation models using SeeGULL Multilingual reveals varying rates of endorsing stereotypes across different languages. The results underscore the importance of multilingual evaluations for model safety and highlight disparities in stereotype endorsements based on language and region.
Overall, SeeGULL Multilingual serves as a valuable resource for researchers and developers to enhance model safeguards against harmful stereotypes through a global-scale perspective.
In un'altra lingua
dal contenuto originale
arxiv.org
Approfondimenti chiave tratti da
by Mukul Bhutan... alle arxiv.org 03-12-2024
https://arxiv.org/pdf/2403.05696.pdfDomande più approfondite