Fugu-MT 論文翻訳(概要): Schützen: Evaluating LLM Safety in Bulgarian and German Contexts

論文の概要: Schützen: Evaluating LLM Safety in Bulgarian and German Contexts

arxiv url: http://arxiv.org/abs/2606.11316v1
Date: Tue, 09 Jun 2026 18:01:19 GMT
ステータス: 翻訳完了
システム内更新日: 2026-06-11 16:42:38.123232
Title: Schützen: Evaluating LLM Safety in Bulgarian and German Contexts
Title（参考訳）: Schützen氏: ブルガリアとドイツの文脈でLLMの安全性を評価する
Authors: Kiril Georgiev, Yuxia Wang, Dimitar Iliyanov Dimitrov, Preslav Nakov, Ivan Koychev,
Abstract要約: 本稿では、リスク下でのモデル応答性を評価するために設計された、ドイツとブルガリアの安全データセットであるSchtzenを紹介する。多言語および言語固有のLLMを用いた実験では、安全行動の言語間差が顕著である。
参考スコア（独自算出の注目度）: 53.865251738592605
License: http://creativecommons.org/licenses/by/4.0/
Abstract: Large language models are increasingly deployed across professional domains, bringing hard-to-predict risks, including the generation of harmful or disrespectful content. Although substantial progress has been made in developing safety evaluation datasets, existing resources remain overwhelmingly English- and Chinese-centric. This limitation is particularly pronounced when evaluating languages that operate within shared sociocultural, legal, and ethical contexts. To address this gap, we introduce Schützen: a German--Bulgarian safety dataset designed to assess model answerability under risk, covering both a low-resource language (Bulgarian) and a high-resource language (German). Experiments with multilingual and language-specific LLMs reveal pronounced cross-language differences in safety behavior, highlighting the necessity of tailored, region-specific evaluation resources to support the responsible deployment of LLMs in Germany and Bulgaria. Datasets and code are available at https://github.com/xnlp-lab/Schutzen. Warning: this paper contains examples that may be offensive, harmful, or biased.
Abstract（参考訳）: 大規模な言語モデルは、プロのドメインにまたがって展開され、有害なコンテンツや不敬なコンテンツの生成など、予測の難しいリスクをもたらしている。安全性評価データセットの開発には大きな進歩があったが、既存の資源は英語と中国語が中心である。この制限は、共有社会文化的、法的、倫理的文脈の中で機能する言語を評価するときに特に顕著である。このギャップに対処するために、我々はSchützenを紹介します: ドイツ-ブルガリアの安全データセットは、リスク下でのモデル応答性を評価するために設計され、低リソース言語(ブルガリア語)と高リソース言語(ドイツ語)の両方をカバーする。多言語および言語固有のLSMを用いた実験は、安全行動の言語間差異を明確に示し、ドイツとブルガリアにおけるLSMの責任ある展開を支援するために、調整された地域固有の評価リソースの必要性を強調している。データセットとコードはhttps://github.com/xnlp-lab/Schutzen.comで入手できる。警告: 本論文は、攻撃的、有害、偏見のある例を含む。

論文の概要: Schützen: Evaluating LLM Safety in Bulgarian and German Contexts

関連論文リスト