Fugu-MT 論文翻訳(概要): The Cultural Gene of Large Language Models: A Study on the Impact of Cross-Corpus Training on Model Values and Biases

論文の概要: The Cultural Gene of Large Language Models: A Study on the Impact of Cross-Corpus Training on Model Values and Biases

arxiv url: http://arxiv.org/abs/2508.12411v1
Date: Sun, 17 Aug 2025 15:54:14 GMT
ステータス: 翻訳完了
システム内更新日: 2025-08-19 14:49:10.753085
Title: The Cultural Gene of Large Language Models: A Study on the Impact of Cross-Corpus Training on Model Values and Biases
Title（参考訳）: 大規模言語モデルの文化的遺伝子:クロスコルプス学習がモデル価値とバイアスに及ぼす影響に関する研究
Authors: Emanuel Z. Fenech-Borg, Tilen P. Meznaric-Kos, Milica D. Lekovic-Bojovic, Arni J. Hentze-Djurhuus,
Abstract要約: 大規模言語モデル(LLM)は世界中に展開されているが、その根底にある文化的・倫理的な前提は未解明のままである。西洋中心モデル (GPT-4) と東洋中心モデル (ERNIE Bot) を比較した。人間のアノテーションは両次元に有意かつ一貫した相違を示す。
参考スコア（独自算出の注目度）: 0.0
License: http://creativecommons.org/licenses/by/4.0/
Abstract: Large language models (LLMs) are deployed globally, yet their underlying cultural and ethical assumptions remain underexplored. We propose the notion of a "cultural gene" -- a systematic value orientation that LLMs inherit from their training corpora -- and introduce a Cultural Probe Dataset (CPD) of 200 prompts targeting two classic cross-cultural dimensions: Individualism-Collectivism (IDV) and Power Distance (PDI). Using standardized zero-shot prompts, we compare a Western-centric model (GPT-4) and an Eastern-centric model (ERNIE Bot). Human annotation shows significant and consistent divergence across both dimensions. GPT-4 exhibits individualistic and low-power-distance tendencies (IDV score approx 1.21; PDI score approx -1.05), while ERNIE Bot shows collectivistic and higher-power-distance tendencies (IDV approx -0.89; PDI approx 0.76); differences are statistically significant (p < 0.001). We further compute a Cultural Alignment Index (CAI) against Hofstede's national scores and find GPT-4 aligns more closely with the USA (e.g., IDV CAI approx 0.91; PDI CAI approx 0.88) whereas ERNIE Bot aligns more closely with China (IDV CAI approx 0.85; PDI CAI approx 0.81). Qualitative analyses of dilemma resolution and authority-related judgments illustrate how these orientations surface in reasoning. Our results support the view that LLMs function as statistical mirrors of their cultural corpora and motivate culturally aware evaluation and deployment to avoid algorithmic cultural hegemony.
Abstract（参考訳）: 大規模言語モデル(LLM)は世界中に展開されているが、その根底にある文化的・倫理的な前提は未解明のままである。本研究では, LLMが学習コーパスから継承する体系的価値指向である「文化的遺伝子」の概念を提案し, 個人主義・集団主義(IDV)とパワーディスタンス(PDI)の2つの古典的文化的側面を対象とする200のプロンプトからなる文化的プローブデータセット(CPD)を導入する。標準のゼロショットプロンプトを用いて、西部中心モデル(GPT-4)と東部中心モデル(ERNIE Bot)を比較した。人間のアノテーションは両次元に有意かつ一貫した相違を示す。 GPT-4は個人性および低パワー依存傾向を示す(IDV score approx 1.21; PDI score approx -1.05)が、ERNIE Botは集団性および高パワー依存傾向を示す(IDV approx -0.89; PDI approx 0.76)。さらに、ホフステデの全国得点に対する文化的アライメント指数(CAI)を算出し、GPT-4は米国(例えば、IDV CAI approx 0.91; PDI CAI approx 0.88)とより緊密に、ERNIE Botは中国(IDV CAI approx 0.85; PDI CAI approx 0.81)とより密に一致している。ジレンマ分解能の質的解析と権威に関する判断は、これらの配向が推論においてどのように表されるかを示す。本研究は,LLMが文化的コーパスの統計ミラーとして機能し,アルゴリズム的文化的ヘゲモニーを回避するために,文化的に意識された評価と展開を動機付けているという見解を支持する。

論文の概要: The Cultural Gene of Large Language Models: A Study on the Impact of Cross-Corpus Training on Model Values and Biases

関連論文リスト