Fugu-MT 論文翻訳(概要): Does Claude's Constitution Have a Culture?

論文の概要: Does Claude's Constitution Have a Culture?

arxiv url: http://arxiv.org/abs/2603.28123v1
Date: Mon, 30 Mar 2026 07:38:46 GMT
ステータス: 翻訳完了
システム内更新日: 2026-03-31 23:18:45.280981
Title: Does Claude's Constitution Have a Culture?
Title（参考訳）: クロードの憲法は文化を持っているか?
Authors: Parham Pourdavood,
Abstract要約: 我々は,6つの価値領域にまたがる高い異文化性を示す55の世界価値調査項目について,Claude Sonnetの評価を行った。クロードの価値プロファイルは、北欧や英語圏のものと最もよく似ている。本研究は, このリスクの複合性と, グローバルに代表される憲法制定プロセスの必要性について論じる。
参考スコア（独自算出の注目度）: 0.0
License: http://creativecommons.org/licenses/by-nc-nd/4.0/
Abstract: Constitutional AI (CAI) aligns language models with explicitly stated normative principles, offering a transparent alternative to implicit alignment through human feedback alone. However, because constitutions are authored by specific groups of people, the resulting models may reflect particular cultural perspectives. We investigate this question by evaluating Anthropic's Claude Sonnet on 55 World Values Survey items, selected for high cross-cultural variance across six value domains and administered as both direct survey questions and naturalistic advice-seeking scenarios. Comparing Claude's responses to country-level data from 90 nations, we find that Claude's value profile most closely resembles those of Northern European and Anglophone countries, but on a majority of items extends beyond the range of all surveyed populations. When users provide cultural context, Claude adjusts its rhetorical framing but not its substantive value positions, with effect sizes indistinguishable from zero across all twelve tested countries. An ablation removing the system prompt increases refusals but does not alter the values expressed when responses are given, and replication on a smaller model (Claude Haiku) confirms the same cultural profile across model sizes. These findings suggest that when a constitution is authored within the same cultural tradition that dominates the training data, constitutional alignment may codify existing cultural biases rather than correct them--producing a value floor that surface-level interventions cannot meaningfully shift. We discuss the compounding nature of this risk and the need for globally representative constitution-authoring processes.
Abstract（参考訳）: コンスティチューショナルAI(CAI)は、言語モデルを明示的な規範的原則と整合させ、人間のフィードバックだけで暗黙のアライメントに代わる透明な代替手段を提供する。しかし、憲法は特定の人々のグループによって作成されているため、その結果のモデルは特定の文化的視点を反映している可能性がある。本研究では,6つの価値領域にまたがる異文化多様度を選別し,直接調査と自然主義的アドバイス-探索のシナリオとして管理した,55の世界価値調査項目について,Arhropic's Claude Sonnetの評価を行った。 90か国の国レベルのデータに対するクロードの反応と比較すると、クロードの価値プロファイルは北欧やアングロフォン諸国のそれと最もよく似ているが、ほとんどの項目では調査対象人口の範囲を超えている。ユーザが文化的文脈を提供する場合、クロードは修辞的なフレーミングを調整するが、実質的な価値位置は調整しない。システムを取り除いたアブレーションは拒絶を促すが、応答が与えられたときに表現される値を変更せず、より小さなモデル(Claude Haiku)で複製すると、モデルサイズで同じ文化的プロファイルが確認できる。これらの結果は、トレーニングデータを支配する同じ文化伝統の中で憲法が作成されると、憲法の整合性は、それらを修正するよりも、既存の文化的偏見を成す可能性があることを示唆している。本研究は, このリスクの複合性と, グローバルに代表される憲法制定プロセスの必要性について論じる。

論文の概要: Does Claude's Constitution Have a Culture?

関連論文リスト