Fugu-MT 論文翻訳(概要): PhantomLint: Principled Detection of Hidden LLM Prompts in Structured Documents

論文の概要: PhantomLint: Principled Detection of Hidden LLM Prompts in Structured Documents

arxiv url: http://arxiv.org/abs/2508.17884v1
Date: Mon, 25 Aug 2025 10:45:10 GMT
ステータス: 翻訳完了
システム内更新日: 2025-08-26 18:43:45.736921
Title: PhantomLint: Principled Detection of Hidden LLM Prompts in Structured Documents
Title（参考訳）: PhantomLint:構造化文書中の隠蔽LDMプロンプトの原理的検出
Authors: Toby Murray,
Abstract要約: 本稿では,構造化文書におけるLLMのインタプリタ検出に対する第一原理的アプローチを提案する。我々はPhantomLintというプロトタイプツールにアプローチを実装しました。我々は,PDFおよびHTML文書を含む3,402文書のコーパスに対してPhantomLintを評価し,学術論文のプリプリントやCV,これらなどをカバーする。
参考スコア（独自算出の注目度）: 4.441866681085517
License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
Abstract: Hidden LLM prompts have appeared in online documents with increasing frequency. Their goal is to trigger indirect prompt injection attacks while remaining undetected from human oversight, to manipulate LLM-powered automated document processing systems, against applications as diverse as r\'esum\'e screeners through to academic peer review processes. Detecting hidden LLM prompts is therefore important for ensuring trust in AI-assisted human decision making. This paper presents the first principled approach to hidden LLM prompt detection in structured documents. We implement our approach in a prototype tool called PhantomLint. We evaluate PhantomLint against a corpus of 3,402 documents, including both PDF and HTML documents, and covering academic paper preprints, CVs, theses and more. We find that our approach is generally applicable against a wide range of methods for hiding LLM prompts from visual inspection, has a very low false positive rate (approx. 0.092%), is practically useful for detecting hidden LLM prompts in real documents, while achieving acceptable performance.
Abstract（参考訳）: 隠されたLSMプロンプトは、頻度が上がるオンライン文書に現れている。彼らの目標は、人間の監視から検出されずに間接的なインジェクション攻撃を誘発し、学術的な査読プロセスを通じて r\'esum\'e スクリーニングのような多様なアプリケーションに対して LLM による自動文書処理システムを操作することである。したがって、隠れたLSMプロンプトを検出することは、AIによる人間の意思決定への信頼を確保するために重要である。本稿では,構造化文書におけるLLMのインタプリタ検出に対する第一原理的アプローチを提案する。我々はPhantomLintというプロトタイプツールにアプローチを実装しました。我々は,PDFおよびHTML文書を含む3,402文書のコーパスに対してPhantomLintを評価し,学術論文のプリプリントやCV,これらなどをカバーする。提案手法は視覚検査からLLMプロンプトを隠蔽する幅広い手法に対して適用可能であり, 偽陽性率(約0.092%)が非常に低く, 実際の文書に隠蔽されたLCMプロンプトを検出するのに有効であり, 性能は良好である。

論文の概要: PhantomLint: Principled Detection of Hidden LLM Prompts in Structured Documents

関連論文リスト