Fugu-MT 論文翻訳(概要): When AI Takes Sides on Questions of Faith: Persistent Asymmetries in AI-Mediated Faith Guidance

論文の概要: When AI Takes Sides on Questions of Faith: Persistent Asymmetries in AI-Mediated Faith Guidance

arxiv url: http://arxiv.org/abs/2605.22975v2
Date: Thu, 28 May 2026 16:07:03 GMT
ステータス: 翻訳完了
システム内更新日: 2026-05-30 05:02:24.537183
Title: When AI Takes Sides on Questions of Faith: Persistent Asymmetries in AI-Mediated Faith Guidance
Title（参考訳）: AIが信仰の問いに逆らうとき:AIを媒介とした信仰指導における永続的対称性
Authors: Brett Israelsen, Sheryl Carty, Josh Coates, Nancy Fulda, Julie Park, Pete Whiting,
Abstract要約: 我々は、大言語モデル(LLM)が宗教的変換に関するクエリを対称に扱うかどうかを問う。モデルが一貫した非対称性を示し、一部の宗教を好んで、他の宗教への転換を軽蔑していることを示す。パターンはモデルサイズとモデルプロバイダによって異なり、Grok 4.20は最も強力な対称性を示す。
参考スコア（独自算出の注目度）: 1.163745353081629
License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
Abstract: We ask whether large language models (LLMs) treat queries about religious conversion symmetrically. The answer is no. When asked for advice on hypothetical faith transitions from religion A->B vs. religion B->A , models exhibited consistent asymmetries, favoring some religions while subtly discouraging conversion to others. On average Catholic, Bahá'í, and Sikh religions were broadly favored (high support for joining, low support for leaving), while Atheists, Agnostics, and Jehovah's Witnesses were primarily disfavored. Patterns varied by model size and model provider, with Grok 4.20 exhibiting the strongest asymmetries. We tested 20 commercial and open-source language models across 182 religion pairings using a human-verified LLM-as-judge framework. Each model was probed via interactions with a simulated user asking for advice on a potential faith conversion. Models tended to use more encouraging language for some faith transitions over others; these patterns were systematically repeatable across multiple trials. All LLMs tested exhibited reproducible asymmetry, though the pattern of preferences differed for each. Overall preferences persist across multiple question phrasings and variations in the religious pairing dataset. Taken together, these results suggest that asymmetry is a robust property of model behavior rather than an artifact of how the models' answers were scored. It is important to consider that any imbalances deployed and reproduced at scale can have real-world implications.
Abstract（参考訳）: 我々は、大言語モデル(LLM)が宗教的変換に関するクエリを対称に扱うかどうかを問う。答えはノーです。宗教 A->B vs.宗教 B->A からの仮説的信仰遷移についての助言を求めると、モデルは一貫した対称性を示し、一部の宗教を好んで、他への改宗を軽視した。平均的なカトリック、バハーイー教、シク教徒の宗教は広く支持され(参加への高い支持、離脱への支持の低さ)、無神論者、アグノスティック派、ジェホバの証人は主に嫌われていた。パターンはモデルサイズとモデルプロバイダによって異なり、Grok 4.20は最も強力な対称性を示す。 LLM-as-judgeフレームワークを用いて182の宗教ペアリングにまたがる20の商用およびオープンソース言語モデルを検証した。各モデルは、潜在的な信条変換に関するアドバイスを求めるシミュレーションユーザとのインタラクションを通じて調査された。これらのパターンは、複数のトライアルで体系的に再現可能であった。全てのLSMは再現可能な非対称性を示したが、それぞれの好みのパターンは異なっていた。全体的な嗜好は、宗教的なペアリングデータセットにおける複数の質問のフレーズとバリエーションに持続する。これらの結果は、非対称性がモデル回答のスコアのアーチファクトではなく、モデル行動の頑健な性質であることを示唆している。大規模に展開され、再生されるあらゆる不均衡が、現実世界に影響を及ぼす可能性があると考えることが重要である。

論文の概要: When AI Takes Sides on Questions of Faith: Persistent Asymmetries in AI-Mediated Faith Guidance

関連論文リスト