Loading...
Qorǵau: Evaluating Safety in Kazakh-Russian Bilingual Contexts
Goloburda, Maiya ; Laiyk, Nurkhan ; Turmakhan, Diana ; Wang, Yuxia ; Togmanov, Mukhammed ; Mansurov, Jonibek ; Sametov, Askhat ; Mukhituly, Nurdaulet ; Wang, Minghan ; Orel, Daniil ... show 4 more
Goloburda, Maiya
Laiyk, Nurkhan
Turmakhan, Diana
Wang, Yuxia
Togmanov, Mukhammed
Mansurov, Jonibek
Sametov, Askhat
Mukhituly, Nurdaulet
Wang, Minghan
Orel, Daniil
Files
Supervisor
Department
Natural Language Processing
Embargo End Date
Type
Conference proceeding
Date
License
http://creativecommons.org/licenses/by/4.0/
Language
Collections
Research Projects
Organizational Units
Journal Issue
Abstract
Large language models (LLMs) are known to have the potential to generate harmful content, posing risks to users. While significant progress has been made in developing taxonomies for LLM risks and safety evaluation prompts, most studies have focused on monolingual contexts, primarily in English. However, language- and region-specific risks in bilingual contexts are often overlooked, and core findings can diverge from those in monolingual settings. In this paper, we introduce Qorǵau, a novel dataset specifically designed for safety evaluation in Kazakh and Russian, reflecting the unique bilingual context in Kazakhstan, where both Kazakh (a low-resource language) and Russian (a high-resource language) are spoken. Experiments with both multilingual and language-specific LLMs reveal notable differences in safety performance, emphasizing the need for tailored, region-specific datasets to ensure the responsible and safe deployment of LLMs in countries like Kazakhstan. Warning: this paper contains example data that may be offensive, harmful, or biased.
Citation
M. Goloburda, N. Laiyk, D. Turmakhan, Y. Wang, M. Togmanov, J. Mansurov, A. Sametov, N. Mukhituly, M. Wang, D. Orel, Z.M. Mujahid, F. Koto, T. Baldwin, P. Nakov, "Qorǵau: Evaluating Safety in Kazakh-Russian Bilingual Contexts," 2025, pp. 9765-9784.
Source
Findings of the Association for Computational Linguistics: ACL 2025
Conference
Findings of the Association for Computational Linguistics: ACL 2025
Keywords
Subjects
Source
Findings of the Association for Computational Linguistics: ACL 2025
Publisher
Association for Computational Linguistics
