Fugu-MT 論文翻訳(概要): Exposing Citation Vulnerabilities in Generative Engines

論文の概要: Exposing Citation Vulnerabilities in Generative Engines

arxiv url: http://arxiv.org/abs/2510.06823v1
Date: Wed, 08 Oct 2025 09:47:48 GMT
ステータス: 翻訳完了
システム内更新日: 2025-10-09 16:41:20.397755
Title: Exposing Citation Vulnerabilities in Generative Engines
Title（参考訳）: ジェネレーティブエンジンにおけるサイテーション脆弱性の露光
Authors: Riku Mochizuki, Shusuke Komatsu, Souta Noguchi, Kazuto Ataka,
Abstract要約: 我々は,生成エンジン(GE)が生成する回答を,引用パブリッシャとコンテンツ注入障壁の観点から分析する。誰でもウェブで情報を公開できるので、GEは毒攻撃に弱い。
参考スコア（独自算出の注目度）: 0.0
License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
Abstract: We analyze answers generated by generative engines (GEs) from the perspectives of citation publishers and the content-injection barrier, defined as the difficulty for attackers to manipulate answers to user prompts by placing malicious content on the web. GEs integrate two functions: web search and answer generation that cites web pages using large language models. Because anyone can publish information on the web, GEs are vulnerable to poisoning attacks. Existing studies of citation evaluation focus on how faithfully answer content reflects cited sources, leaving unexamined which web sources should be selected as citations to defend against poisoning attacks. To fill this gap, we introduce evaluation criteria that assess poisoning threats using the citation information contained in answers. Our criteria classify the publisher attributes of citations to estimate the content-injection barrier thereby revealing the threat of poisoning attacks in current GEs. We conduct experiments in political domains in Japan and the United States (U.S.) using our criteria and show that citations from official party websites (primary sources) are approximately \(25\%\)--\(45\%\) in the U.S. and \(60\%\)--\(65\%\) in Japan, indicating that U.S. political answers are at higher risk of poisoning attacks. We also find that sources with low content-injection barriers are frequently cited yet are poorly reflected in answer content. To mitigate this threat, we discuss how publishers of primary sources can increase exposure of their web content in answers and show that well-known techniques are limited by language differences.
Abstract（参考訳）: 我々は,Web上に悪意あるコンテンツを置くことで,ユーザプロンプトに対する回答を操作するのが困難であると定義された,引用出版者の視点から,生成エンジン(GE)が生成した回答を分析した。 GEは、大きな言語モデルを使用してWebページを引用するWeb検索と回答生成という2つの機能を統合している。誰でもウェブで情報を公開できるので、GEは毒攻撃に弱い。引用評価の既存の研究は、引用されたソースをいかに忠実に答えるかに焦点が当てられており、どのWebソースを毒殺攻撃から守るための引用として選択すべきかを未検討のまま残している。このギャップを埋めるために,回答に含まれる引用情報を用いて汚染脅威を評価する評価基準を導入する。我々の基準は, 引用の出版者属性を分類し, コンテンツ注入障壁を推定することにより, 現在のGEにおける中毒攻撃の脅威を明らかにする。我々は,本基準を用いて,日本と米国(アメリカ合衆国)の政治ドメインで実験を行い,公的な政党ウェブサイト(一次資料)からの引用が米国において約 \(25\%\)-\(45\%\) であり,日本の \(60\%\)-\(65\%\) であり,米国の政治的回答が中毒攻撃のリスクが高いことを示す。また,コンテンツ注入障壁が低いソースがしばしば引用されるが,回答内容にはほとんど反映されていないことも判明した。この脅威を軽減するために,プライマリ・ソースのパブリッシャーが回答中のWebコンテンツの露出をいかに増加させるかについて議論し,言語の違いによってよく知られた技術が制限されていることを示す。

論文の概要: Exposing Citation Vulnerabilities in Generative Engines

関連論文リスト