Fugu-MT 論文翻訳(概要): Whose Norms? Disentangling Cultural and Personal Alignment in Large Language Models

論文の概要: Whose Norms? Disentangling Cultural and Personal Alignment in Large Language Models

arxiv url: http://arxiv.org/abs/2606.07877v1
Date: Fri, 05 Jun 2026 22:20:30 GMT
ステータス: 翻訳完了
システム内更新日: 2026-06-09 14:42:05.500218
Title: Whose Norms? Disentangling Cultural and Personal Alignment in Large Language Models
Title（参考訳）: だれのノルムか : 大規模言語モデルにおける文化的・個人的アライメントの解消
Authors: Angana Borah, Isabelle Augenstein, Rada Mihalcea,
Abstract要約: 大規模言語モデルは、文化的規範と個人の嗜好のバランスを必要とする社会的意思決定の状況にますます使われてきている。 PACT(Personal-Preference and Cultural-Norm Trade-off framework)を導入し,モデルが文化的規範に従うか,あるいは個人の嗜好を許容するかを評価する。 PACTに関する5つの国による研究は、人間の文化のフォローは主にシナリオ国によって行われており、参加者が自身の文化的文脈を判断する際の合意は最低であることを示している。
参考スコア（独自算出の注目度）: 66.34115263472931
License: http://creativecommons.org/licenses/by/4.0/
Abstract: Large language models are increasingly used for social decision-making situations that require balancing cultural norms with personal preferences. For example, a user preferring honesty might ask whether to correct a coworker publicly when local norms favor indirect feedback. Yet existing research studies cultural alignment and personalization largely separately. We introduce PACT, the Personal-Preference and Cultural-Norm Trade-off framework, which evaluates whether models choose to follow a cultural norm or allow personal preferences. We find that LLMs vary in how rigidly they enforce cultural norms, with behavior shifted more by country context (7.8%) than age (1%) and gender (0.7%) and shifting non-uniformly after instruction tuning. Furthermore, our five-country human study on PACT shows that culture-following in humans is mainly driven by scenario country, with the lowest agreement when participants judge their own cultural contexts, showing within-culture pluralism. Finally, human-LLM alignment experiments show that models can match majority choices, but fail to capture response distributions and uncertainty (with best correlations reaching only 0.24). Together, these findings motivate alignment evaluations that go beyond majority to capture cultural pluralism and disagreement in social judgment.
Abstract（参考訳）: 大規模言語モデルは、文化的規範と個人の嗜好のバランスを必要とする社会的意思決定の状況にますます使われてきている。例えば、誠実さを好むユーザは、地元の規範が間接的なフィードバックを好む場合、同僚を公然と修正するかどうかを尋ねることがある。しかし、既存の研究は文化的アライメントとパーソナライゼーションを主に別々に研究している。 PACT(Personal-Preference and Cultural-Norm Trade-off framework)を導入し,モデルが文化的規範に従うか,あるいは個人の嗜好を許容するかを評価する。 LLMは、年齢(1%)や性別(0.7%)より国の文脈(7.8%)に変化し、指導指導後に一様でない行動に変化している。さらに,PACTに関する5ヶ国にわたる人的調査では,人的文化のフォローは主にシナリオ国が主導しており,参加者が自己の文化的文脈を判断し,内文化の多元性を示す場合の合意は最低であった。最後に、人間-LLMアライメント実験は、モデルが多数選択と一致するが、応答分布と不確実性(最良の相関は0.24にしか達しない)を捉えることができないことを示す。これらの知見は、文化的多元主義を捉え、社会的判断における不一致を捉えるために、多数を超えるアライメント評価を動機付けている。

論文の概要: Whose Norms? Disentangling Cultural and Personal Alignment in Large Language Models

関連論文リスト