Fugu-MT 論文翻訳(概要): AlignCultura: Towards Culturally Aligned Large Language Models?

論文の概要: AlignCultura: Towards Culturally Aligned Large Language Models?

arxiv url: http://arxiv.org/abs/2604.19016v1
Date: Tue, 21 Apr 2026 03:06:50 GMT
ステータス: 翻訳完了
システム内更新日: 2026-04-22 22:41:49.588377
Title: AlignCultura: Towards Culturally Aligned Large Language Models?
Title（参考訳）: AlignCultura: 文化的に規定された大規模言語モデルを目指して?
Authors: Gautam Siddharth Kashyap, Mark Dras, Usman Naseem,
Abstract要約: アリン・カルチュラ(Align-Cultura)は、文化的アライメントのための2段階のパイプラインである。 CulTURAXはユネスコの文化分類に基づくHHH- Englishデータセットである。 Stage IIはCULTURAXを、汎用モデル、文化的に微調整されたモデル、オープンウェイトLLMでベンチマークする。実証的に、文化的に微調整されたモデルは、関節HHHを4%-6%改善し、文化的な失敗を18%削減し、10%-12%の効率向上を実現し、漏れを0.3%に制限した。
参考スコア（独自算出の注目度）: 19.134202394422285
License: http://creativecommons.org/licenses/by/4.0/
Abstract: Cultural alignment in Large Language Models (LLMs) is essential for producing contextually aware, respectful, and trustworthy outputs. Without it, models risk generating stereotyped, insensitive, or misleading responses that fail to reflect cultural diversity w.r.t Helpful, Harmless, and Honest (HHH) paradigm. Existing benchmarks represent early steps toward cultural alignment; yet, no benchmarks currently enables systematic evaluation of cultural alignment in line with UNESCO's principles of cultural diversity w.r.t HHH paradigm. Therefore, to address this gap, we built Align-Cultura, two-stage pipeline for cultural alignment. Stage I constructs CULTURAX, the HHH-English dataset grounded in the UNESCO cultural taxonomy, through Query Construction, which reclassifies prompts, expands underrepresented domains (or labels), and prevents data leakage with SimHash. Then, Response Generation pairs prompts with culturally grounded responses via two-stage rejection sampling. The final dataset contains 1,500 samples spanning 30 subdomains of tangible and intangible cultural forms. Stage II benchmarks CULTURAX on general-purpose models, culturally fine-tuned models, and open-weight LLMs (Qwen3-8B and DeepSeek-R1-Distill-Qwen-7B). Empirically, culturally fine-tuned models improve joint HHH by 4%-6%, reduce cultural failures by 18%, achieve 10%-12% efficiency gains, and limit leakage to 0.3%.
Abstract（参考訳）: 大規模言語モデル(LLM)における文化的アライメントは、文脈的に認識され、尊敬され、信頼できるアウトプットを生成するために不可欠である。それなしでは、モデルは文化的多様性を反映しないステレオタイプ、非感受性、または誤解を招く応答を生成するリスクを負う。既存のベンチマークは、カルチャーアライメントに向けた初期のステップを表しているが、UNESCOの文化的多様性の原則であるHHHパラダイムに従って、現在、カルチャーアライメントを体系的に評価できるベンチマークは存在しない。そのため、このギャップに対処するため、文化的なアライメントのための2段階パイプラインであるAlign-Culturaを構築しました。 Stage Iは、UNESCO文化分類に基づくHHH- EnglishデータセットであるCULTURAXを構築する。Query Constructionは、プロンプトを再分類し、表現されていないドメイン(またはラベル)を拡張し、SimHashによるデータ漏洩を防ぐ。次に、レスポンスジェネレーションペアは、2段階の拒絶サンプリングを通じて文化的に根拠付けられた応答をプロンプトする。最終データセットには、30のサブドメインにまたがる、有形および無形文化の1500のサンプルが含まれている。 Stage IIはCULTURAXを、汎用モデル、文化的に微調整されたモデル、オープンウェイトLLM(Qwen3-8BとDeepSeek-R1-Distill-Qwen-7B)でベンチマークしている。実証的に、文化的に微調整されたモデルは、関節HHHを4%-6%改善し、文化的な失敗を18%削減し、10%-12%の効率向上を実現し、漏れを0.3%に制限した。

論文の概要: AlignCultura: Towards Culturally Aligned Large Language Models?

関連論文リスト