SESGO: Spanish Evaluation of Stereotypical Generative Outputs

Melissa Robles; C. Bernal Bellido; Denniss Raigoso; Mateo Dulce Rubio

doi:10.1609/aies.v8i3.36707

SESGO: Spanish Evaluation of Stereotypical Generative Outputs

dc.contributor.author	Melissa Robles
dc.contributor.author	C. Bernal Bellido
dc.contributor.author	Denniss Raigoso
dc.contributor.author	Mateo Dulce Rubio
dc.coverage.spatial	Bolivia
dc.date.accessioned	2026-03-22T19:47:02Z
dc.date.available	2026-03-22T19:47:02Z
dc.date.issued	2025
dc.description.abstract	This paper addresses the critical gap in evaluating bias in multilingual Large Language Models (LLMs), with a specific focus on Spanish language within culturally-aware Latin American contexts. Despite widespread global deployment, current evaluations remain predominantly US-English-centric, leaving potential harms in other linguistic and cultural contexts largely underexamined. We introduce a novel, culturally-grounded framework for detecting social biases in instruction-tuned LLMs. Our approach adapts the underspecified question methodology from the BBQ dataset by incorporating culturally-specific expressions and sayings that encode regional stereotypes across four social categories: gender, race, socioeconomic class, and national origin. Using more than 4,000 prompts, we propose a new metric that combines accuracy with the direction of error to effectively balance model performance and bias alignment in both ambiguous and disambiguated contexts. To our knowledge, our work presents the first systematic evaluation examining how leading commercial LLMs respond to culturally specific bias in the Spanish language, revealing varying patterns of bias manifestation across state-of-the-art models. We also contribute evidence that bias mitigation techniques optimized for English do not effectively transfer to Spanish tasks, and that bias patterns remain largely consistent across different sampling temperatures. Our modular framework offers a natural extension to new stereotypes, bias categories, or languages and cultural contexts, representing a significant step toward more equitable and culturally-aware evaluation of AI systems in the diverse linguistic environments where they operate.
dc.identifier.doi	10.1609/aies.v8i3.36707
dc.identifier.uri	https://doi.org/10.1609/aies.v8i3.36707
dc.identifier.uri	https://andeanlibrary.org/handle/123456789/78094
dc.language.iso	en
dc.relation.ispartof	Proceedings of the AAAI/ACM Conference on AI Ethics and Society
dc.source	Universidad de Los Andes
dc.subject	Computer science
dc.subject	Converse
dc.subject	Focus (optics)
dc.subject	Artificial intelligence
dc.subject	Natural language processing
dc.subject	Generative grammar
dc.subject	Cognitive psychology
dc.subject	Cultural bias
dc.subject	Socioeconomic status
dc.subject	Linguistics
dc.title	SESGO: Spanish Evaluation of Stereotypical Generative Outputs
dc.type	article

Collections

Artículo Científico Publicado

SESGO: Spanish Evaluation of Stereotypical Generative Outputs

Files

Collections