Fugu-MT 論文翻訳(概要): Political Neutrality as Balanced Approval: A Large-Scale Human Evaluation of AI Responses

論文の概要: Political Neutrality as Balanced Approval: A Large-Scale Human Evaluation of AI Responses

arxiv url: http://arxiv.org/abs/2605.28911v1
Date: Wed, 27 May 2026 17:27:39 GMT
ステータス: 翻訳完了
システム内更新日: 2026-05-30 02:45:55.044709
Title: Political Neutrality as Balanced Approval: A Large-Scale Human Evaluation of AI Responses
Title（参考訳）: バランスの取れた承認としての政治的中立性:AI反応の大規模評価
Authors: Jonathan Stray, David Zhai Yang, Steven Luo, Miu Nicole Takagi, Serina Chang,
Abstract要約: 我々は、AIの政治的中立性の新たな定義を提案し、それをテストするために大規模なユーザスタディを設計する。議論の的になっている問題について尋ねると、AIモデルは、反対の視点を持つグループ間での承認を最大化する応答を生成するべきです。我々は、Redditに関する政治的に告発された質問と、フロンティアAIモデルからの回答から引き出された、米国の問題に関するベンチマークを構築します。すべての20の課題において、AIの反応が双方の承認率を高めることが可能であることに気付きました。
参考スコア（独自算出の注目度）: 4.452267344065157
License: http://creativecommons.org/licenses/by/4.0/
Abstract: As AI systems increasingly shape political views, defining and evaluating AI political neutrality is an urgent problem. Here, we propose a new definition of AI political neutrality and design a large-scale user study to test it, releasing a new dataset PARETO with 7,434 participants and 208,152 evaluations of AI responses. Our definition follows a simple principle grounded in political theory: when asked about a controversial issue, an AI model should generate responses that maximize approval across groups with opposing viewpoints, while balancing approval between groups. This definition allows empirical testing of whether an AI response is "neutral" and generalizes to any political context without pre-supposing a single left-right axis of division. We construct a benchmark of controversial U.S. issues, with prompts sourced from politically charged questions on Reddit and responses from frontier AI models, and recruit human participants to rate AI responses. Across all 20 issues, we find that it is possible for AI responses to achieve high rates of approval on both sides, even as those sides disagree strongly with each other on the substance of the issues. We also find that default responses lean liberal for GPT, Gemini, Claude, and Llama, but not Grok, and that user prompts with political charges are harder to respond to than neutral prompts. This work introduces a rigorous definition and benchmark of AI political neutrality, and a dataset to measure progress toward it.
Abstract（参考訳）: AIシステムが政治的見解をますます形成するにつれて、AIの政治的中立性の定義と評価は緊急の問題である。ここでは、AIの政治的中立性に関する新たな定義を提案し、それをテストするために大規模なユーザスタディを設計し、7,434人の参加者と208,152人のAI応答を評価する新しいデータセットPARETOをリリースする。議論の的になっている問題について尋ねると、AIモデルは、グループ間の承認のバランスを保ちながら、反対の視点を持つグループ間の承認を最大化する応答を生成するべきです。この定義は、AIの反応が「中立的」かどうかを実証的にテストし、分裂の1つの左右軸を前提にすることなく、あらゆる政治的文脈に一般化することができる。我々は、Redditで政治的に告発された質問とフロンティアAIモデルからの回答から引き起こされた米国の問題に関するベンチマークを構築し、AI反応を評価するために人間の参加者を募集する。 20のすべての問題において、AIの反応が双方で高い承認率を達成することが可能であることに気付きました。また、デフォルトの応答はGPT、Gemini、Claude、Llamaではリベラルだが、Grokではない。この研究は、AIの政治的中立性に関する厳密な定義とベンチマークと、それに向けた進捗を測定するデータセットを導入している。

論文の概要: Political Neutrality as Balanced Approval: A Large-Scale Human Evaluation of AI Responses

関連論文リスト