Fugu-MT 論文翻訳(概要): What Counts as AI Sycophancy? A Taxonomy and Expert Survey of a Fragmented Construct

論文の概要: What Counts as AI Sycophancy? A Taxonomy and Expert Survey of a Fragmented Construct

arxiv url: http://arxiv.org/abs/2605.21778v1
Date: Wed, 20 May 2026 22:17:00 GMT
ステータス: 翻訳完了
システム内更新日: 2026-05-22 16:35:42.006488
Title: What Counts as AI Sycophancy? A Taxonomy and Expert Survey of a Fragmented Construct
Title（参考訳）: AIのサイコフィナンシーとは何か? 分類学と専門家による断片構造の調査
Authors: Meryl Ye, Lujain Ibrahim, Jessica Y. Bo, Myra Cheng, Ida Mattsson, Daniel Vennemeyer, Robert Kraut, Steve Rathje,
Abstract要約: 我々は、その行動がどのように定義され、測定されたかの分類を開発するために、70の論文をレビューした。我々は、AIの梅毒や関連分野の専門家106人を調査し、どのモデル行動が梅毒であるかについて研究者が同意するかどうかを調査した。
参考スコア（独自算出の注目度）: 8.830662211867955
License: http://creativecommons.org/licenses/by-nc-nd/4.0/
Abstract: AI sycophancy has become a prominent concern in large language model (LLM) research. Yet the term lacks a consistent definition and has been applied to behaviors ranging from agreeing with a user's false claim to excessively praising the user to withholding corrective feedback. When researchers, companies, and policymakers use the same term to describe different behaviors, evaluation results become difficult to compare, mitigation strategies fail to transfer, and systems that are resistant to one form of sycophancy continue exhibiting other forms. To address this, we make two contributions. First, we reviewed 70 papers on AI sycophancy to develop a taxonomy of how the behavior has been defined and measured. The taxonomy distinguishes (1) whether a model is sycophantic toward a user's positions and beliefs, or toward the user's broader personal traits and emotions, and (2) whether this occurs through explicit, direct language or more implicit, subtle behaviors such as framing, omission, or tone. Mapping existing literature to our taxonomy reveals that current research has focused on overt forms of sycophancy toward users' beliefs, leaving more subtle and person-directed behaviors relatively understudied. Second, we surveyed 106 experts in AI sycophancy and related fields to examine whether researchers agree on which model behaviors are sycophantic. While experts are nearly unanimous in believing that sycophancy is a significant problem in current AI systems (94.3% agree), they disagree substantially on which specific behaviors qualify. Together, these findings demonstrate that AI sycophancy is a broad family of behaviors with different measurement challenges, intervention requirements, and governance implications. Our taxonomy provides a shared vocabulary for understanding and addressing these behaviors.
Abstract（参考訳）: AI sycophancyは、大規模言語モデル(LLM)研究において顕著な関心事となっている。しかし、この用語には一貫した定義がなく、ユーザの誤った主張に同意することから、ユーザの過度に賞賛すること、修正的なフィードバックを控えることまで、様々な行動に適用されている。研究者、企業、政策立案者が、異なる行動を記述するために同じ用語を使用すると、評価結果は比較しにくくなり、緩和戦略は移行に失敗し、また、ある形態の梅毒に耐性のあるシステムは、他の形態を呈し続けている。これを解決するために、私たちは2つのコントリビューションを行います。まず、70の論文をレビューし、その行動がどのように定義され、測定されたかの分類学を開発する。分類学は、(1)モデルがユーザーの立場や信念に対してシコファン的であるか、またはユーザーのより広い個人的特性や感情に向けられているか、(2)明示的、直接的な言語、または、フレーミング、省略、トーンのようなより暗黙的な行動によって起こるか、を区別する。既存の文献を分類学にマッピングすると、現在の研究は、ユーザーの信念に対する梅毒の過剰な形態に焦点を当てており、より微妙で個人指向の行動が比較的研究されていることが分かる。第2に、AI梅毒と関連する分野の専門家106人を調査し、どのモデル行動が梅毒であるかについて、研究者が同意するかどうかを調査した。専門家は、現在のAIシステムにおいて、梅毒が重大な問題であると信じている(94.3%は同意している)が、特定の行動がどの行動に適合するかについては意見が一致しない。これらの知見は、AIの梅毒は様々な測定課題、介入要件、ガバナンスの意味を持つ幅広い行動のファミリーであることを示している。我々の分類学は、これらの行動を理解し、対処するための共通の語彙を提供する。

論文の概要: What Counts as AI Sycophancy? A Taxonomy and Expert Survey of a Fragmented Construct

関連論文リスト