Fugu-MT 論文翻訳(概要): Hallucinations in Organization-backed AI advisors: Evidence about Skepticism, Verification, and Reliance in Goal-Directed Use

論文の概要: Hallucinations in Organization-backed AI advisors: Evidence about Skepticism, Verification, and Reliance in Goal-Directed Use

arxiv url: http://arxiv.org/abs/2606.23491v1
Date: Mon, 22 Jun 2026 15:36:10 GMT
ステータス: 翻訳完了
システム内更新日: 2026-06-24 18:43:22.058175
Title: Hallucinations in Organization-backed AI advisors: Evidence about Skepticism, Verification, and Reliance in Goal-Directed Use
Title（参考訳）: 組織支援型AIアドバイザの幻覚 : 目標指向型使用における懐疑論、検証、信頼性の証拠
Authors: Simon J. Blanchard, Aaron M. Garvey, Laura O'Laughlin,
Abstract要約: AIが推奨する意思決定の中心的な問題は、ユーザーが不正確な情報を頼っているかどうかだけでなく、応答が検証を必要とする可能性があることを認識しているかどうかである。既存の研究では,ユーザが提示された情報に懐疑的であるかどうか,確認が成功するかどうか,ユーザ検証の結果が情報に依存するかどうか,という3つの特徴を区別する。
参考スコア（独自算出の注目度）: 0.0
License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
Abstract: Generative AI systems are increasingly used by organizations to deliver information to consumers, patients, students, employees, and citizens. These systems can hallucinate, producing plausible but inaccurate responses. A central question for AI-advised decisions is therefore not only whether users rely on inaccurate information, but whether they recognize that a response may require verification. To answer this question, we review emerging empirical evidence relevant to hallucination detection in goal-directed interactions, with a focus on organization-backed AI advisors. We distinguish three constructs that existing studies often conflate: whether users are skeptical of information presented, whether they check it, whether checking succeeds, and whether the result of user verification affects reliance on the information. Across studies examining product search, medical decision-making, content generation, and chatbot-assisted tasks, several patterns emerge. Nearly all studies measure reliance, while variables such as user skepticism and verification of the information are more often targeted by an intervention than measured directly. The cues used to prompt scrutiny of the AI response are predominantly related to the AI output, such as source citations, and the most deployable of these AI output interventions for organizations (general and specific warnings about the risk of hallucinations) show the weakest and most mixed effects in the studies reviewed. Although the existing literature posits that users may be more likely to scrutinize responses related to particular areas of content, no studies varied the content category, leaving this question open for further research. In future research, measuring skepticism and verification separately from reliance may clarify what current evidence shows, what it only implies, and which questions require further exploration.
Abstract（参考訳）: 生成AIシステムは、消費者、患者、学生、従業員、市民に情報を提供するために、組織によってますます利用されている。これらのシステムは幻覚を生じさせ、もっともらしいが不正確な反応を引き起こす。 AIが推奨する意思決定の中心的な問題は、ユーザーが不正確な情報を頼っているかどうかだけでなく、応答が検証を必要とする可能性があると認識しているかどうかである。この質問に答えるために、私たちは、目標指向インタラクションにおける幻覚検出に関連する新たな実証的証拠を、組織が支援するAIアドバイザに焦点をあててレビューする。既存の研究では,ユーザが提示された情報に懐疑的であるかどうか,確認が成功するかどうか,ユーザ検証の結果が情報に依存するかどうか,という3つの特徴を区別する。製品検索,医療意思決定,コンテンツ生成,チャットボット支援タスクなど,さまざまなパターンが出現する。ほぼすべての研究が信頼度を測定する一方で、ユーザ懐疑主義や情報の検証といった変数は直接測定するよりも介入によって標的にされることが多い。 AI応答の精査を促すために使用される手がかりは、主にソース引用などのAI出力と関連しており、これらのAI出力介入のうち最も多くデプロイ可能なもの(幻覚のリスクに関する一般的な、特定の警告)は、レビューされた研究で最も弱く、最も混合した影響を示している。既存の文献では、ユーザーは特定のコンテンツ領域に関する回答を精査する傾向が強いが、コンテンツカテゴリーの異なる研究は存在せず、この質問はさらなる研究のために開かれたままである。将来の研究では、懐疑論を測り、信頼から別々に検証することで、現在の証拠が何を示すのか、それが何を意味するのか、どの疑問がさらなる調査を必要とするのかを明らかにすることができる。

論文の概要: Hallucinations in Organization-backed AI advisors: Evidence about Skepticism, Verification, and Reliance in Goal-Directed Use

関連論文リスト