Fugu-MT 論文翻訳(概要): DeepSeek's WEIRD Behavior: The cultural alignment of Large Language Models and the effects of prompt language and cultural prompting

論文の概要: DeepSeek's WEIRD Behavior: The cultural alignment of Large Language Models and the effects of prompt language and cultural prompting

arxiv url: http://arxiv.org/abs/2512.09772v2
Date: Fri, 12 Dec 2025 17:25:30 GMT
ステータス: 翻訳完了
システム内更新日: 2025-12-15 13:50:29.102843
Title: DeepSeek's WEIRD Behavior: The cultural alignment of Large Language Models and the effects of prompt language and cultural prompting
Title（参考訳）: DeepSeekのWEIRD行動:大規模言語モデルの文化的アライメントと即興言語と文化的プロンプトの効果
Authors: James Luther, Donald Brown,
Abstract要約: 我々は、大規模な言語モデル(LLM)の文化的アライメントを理解するために、Hofstede氏のVSM13国際調査を使用している。我々は、特定の国を反映させるために、システムプロンプトを使ってモデルのアライメントをシフトさせる戦略である、プロンプト言語とカルチャープロンプトの組み合わせを使用する。以上の結果から,DeepSeek-V3,V3.1,OpenAIのGPT-5は米国の調査結果と密接に一致していることがわかった。また、GPT-4は英語で指示された場合、中国に近づきつつあるが、文化的な刺激は、このアライメントをアメリカに近づかせるのに有効である。
参考スコア（独自算出の注目度）: 0.0
License: http://creativecommons.org/licenses/by/4.0/
Abstract: Culture is a core component of human-to-human interaction and plays a vital role in how we perceive and interact with others. Advancements in the effectiveness of Large Language Models (LLMs) in generating human-sounding text have greatly increased the amount of human-to-computer interaction. As this field grows, the cultural alignment of these human-like agents becomes an important field of study. Our work uses Hofstede's VSM13 international surveys to understand the cultural alignment of the following models: DeepSeek-V3, V3.1, GPT-4, GPT-4.1, GPT-4o, and GPT-5. We use a combination of prompt language and cultural prompting, a strategy that uses a system prompt to shift a model's alignment to reflect a specific country, to align these LLMs with the United States and China. Our results show that DeepSeek-V3, V3.1, and OpenAI's GPT-5 exhibit a close alignment with the survey responses of the United States and do not achieve a strong or soft alignment with China, even when using cultural prompts or changing the prompt language. We also find that GPT-4 exhibits an alignment closer to China when prompted in English, but cultural prompting is effective in shifting this alignment closer to the United States. Other low-cost models, GPT-4o and GPT-4.1, respond to the prompt language used (i.e., English or Simplified Chinese) and cultural prompting strategies to create acceptable alignments with both the United States and China.
Abstract（参考訳）: 文化は人間と人間の相互作用の中核的な要素であり、他者との知覚と相互作用において重要な役割を担います。音声テキスト生成におけるLarge Language Models(LLMs)の有効性の進歩は、人間とコンピュータの相互作用の量を大幅に増加させた。この分野が成長するにつれて、これらの人間のようなエージェントの文化的アライメントが重要な研究分野となる。我々の研究は、HofstedeのVSM13国際調査を用いて、DeepSeek-V3, V3.1, GPT-4, GPT-4.1, GPT-4o, GPT-5の文化的アライメントを理解する。我々は,特定の国を反映するモデルアライメントを,システムプロンプトを用いて,これらのLLMを米国と中国と整合させる,プロンプト言語と文化的プロンプトの組み合わせを用いる。以上の結果から,DeepSeek-V3,V3.1,OpenAIのGPT-5は米国の調査回答と密接な一致を示し,文化的なプロンプトを用いたり,プロンプト言語を変更したりしても,中国との強いあるいは柔らかい連携は得られないことがわかった。また、GPT-4は英語で指示された場合、中国に近い配向を示すが、文化的な促進は、この配向をアメリカに近付けるのに有効である。他の低価格モデル、GPT-4oとGPT-4.1は、使われるプロンプト言語(英語または簡体字中国語)と文化的なプロンプト戦略に対応し、米国と中国の両方と許容されるアライメントを作り出す。

論文の概要: DeepSeek's WEIRD Behavior: The cultural alignment of Large Language Models and the effects of prompt language and cultural prompting

関連論文リスト