Fugu-MT 論文翻訳(概要): Quantifying Conversation Drift in MCP via Latent Polytope

論文の概要: Quantifying Conversation Drift in MCP via Latent Polytope

arxiv url: http://arxiv.org/abs/2508.06418v1
Date: Fri, 08 Aug 2025 16:05:27 GMT
ステータス: 翻訳完了
システム内更新日: 2025-08-11 20:39:06.294619
Title: Quantifying Conversation Drift in MCP via Latent Polytope
Title（参考訳）: 潜在ポリトープを用いたMPPの会話ドリフトの定量化
Authors: Haoran Shi, Hongwei Yao, Shuo Shao, Shaopeng Jiao, Ziqi Peng, Zhan Qin, Cong Wang,
Abstract要約: Model Context Protocol(MCP)は、外部ツールを統合することで、大きな言語モデル(LLM)を強化する。逆向きに作られたコンテンツは、ツール中毒や間接的なプロンプト注射を誘発し、会話のハイジャック、誤情報伝播、データ流出につながる。本稿では,会話のドリフト,空間軌跡の偏差を,対向的外部知識により検出し,定量化するフレームワークであるSecMCPを提案する。
参考スコア（独自算出の注目度）: 12.004235167472238
License: http://creativecommons.org/licenses/by/4.0/
Abstract: The Model Context Protocol (MCP) enhances large language models (LLMs) by integrating external tools, enabling dynamic aggregation of real-time data to improve task execution. However, its non-isolated execution context introduces critical security and privacy risks. In particular, adversarially crafted content can induce tool poisoning or indirect prompt injection, leading to conversation hijacking, misinformation propagation, or data exfiltration. Existing defenses, such as rule-based filters or LLM-driven detection, remain inadequate due to their reliance on static signatures, computational inefficiency, and inability to quantify conversational hijacking. To address these limitations, we propose SecMCP, a secure framework that detects and quantifies conversation drift, deviations in latent space trajectories induced by adversarial external knowledge. By modeling LLM activation vectors within a latent polytope space, SecMCP identifies anomalous shifts in conversational dynamics, enabling proactive detection of hijacking, misleading, and data exfiltration. We evaluate SecMCP on three state-of-the-art LLMs (Llama3, Vicuna, Mistral) across benchmark datasets (MS MARCO, HotpotQA, FinQA), demonstrating robust detection with AUROC scores exceeding 0.915 while maintaining system usability. Our contributions include a systematic categorization of MCP security threats, a novel latent polytope-based methodology for quantifying conversation drift, and empirical validation of SecMCP's efficacy.
Abstract（参考訳）: Model Context Protocol(MCP)は、外部ツールを統合することで、大規模言語モデル(LLM)を強化し、リアルタイムデータの動的集約を可能にし、タスク実行を改善する。しかし、その非分離実行コンテキストは、セキュリティとプライバシの重大なリスクをもたらす。特に、敵対的に制作されたコンテンツは、ツール中毒や間接的なプロンプト注射を誘発し、会話のハイジャック、誤情報伝播、データ流出につながる。ルールベースのフィルタやLLM駆動検出のような既存の防御は、静的シグネチャへの依存、計算の非効率性、会話のハイジャックの定量化ができないため、依然として不十分である。このような制約に対処するため, SecMCPは, 対向的外的知識によって引き起こされる潜在空間軌道のずれを検知し, 定量化するセキュアなフレームワークである。 LLMアクティベーションベクトルを潜在ポリトープ空間内でモデル化することにより、SecMCPは会話力学における異常なシフトを識別し、ハイジャック、ミスリード、データの流出を積極的に検出することができる。我々は、ベンチマークデータセット(MS MARCO、HotpotQA、FinQA)の3つの最先端LCM(Llama3、Vicuna、Mistral)上でSecMCPを評価し、システム使用性を維持しながらAUROCスコアの0.915を超える堅牢な検出を実証した。我々の貢献には、MCPのセキュリティ脅威の体系的な分類、会話の漂流を定量化するための新しい潜在ポリトープベースの方法論、SecMCPの有効性の実証的検証が含まれる。

論文の概要: Quantifying Conversation Drift in MCP via Latent Polytope

関連論文リスト