Fugu-MT 論文翻訳(概要): From Component Manipulation to System Compromise: Understanding and Detecting Malicious MCP Servers

論文の概要: From Component Manipulation to System Compromise: Understanding and Detecting Malicious MCP Servers

arxiv url: http://arxiv.org/abs/2604.01905v1
Date: Thu, 02 Apr 2026 11:22:07 GMT
ステータス: 翻訳完了
システム内更新日: 2026-04-03 14:21:10.702422
Title: From Component Manipulation to System Compromise: Understanding and Detecting Malicious MCP Servers
Title（参考訳）: コンポーネント操作からシステム妥協へ:悪意のあるMSPサーバの理解と検出
Authors: Yiheng Huang, Zhijia Zhao, Bihuan Chen, Susheng Wu, Zhuotong Zhou, Yiheng Cao, Xin Hu, Xin Peng,
Abstract要約: 本研究は、悪意のあるMPPサーバの理解と検出のためのコンポーネント中心の視点を示す。我々は、114の悪意あるMPPサーバからなる最初のPoCデータセットを構築し、MPPコンポーネントとその構成に対する操作として攻撃を行う。悪意のあるMPPサーバのための2段階の行動偏差検出器であるConnorを提案し,実装する。
参考スコア（独自算出の注目度）: 10.040414071765781
License: http://creativecommons.org/licenses/by-nc-nd/4.0/
Abstract: The model context protocol (MCP) standardizes how LLMs connect to external tools and data sources, enabling faster integration but introducing new attack vectors. Despite the growing adoption of MCP, existing MCP security studies classify attacks by their observable effects, obscuring how attacks behave across different MCP server components and overlooking multi-component attack chains. Meanwhile, existing defenses are less effective when facing multi-component attacks or previously unknown malicious behaviors. This work presents a component-centric perspective for understanding and detecting malicious MCP servers. First, we build the first component-centric PoC dataset of 114 malicious MCP servers where attacks are achieved as manipulation over MCP components and their compositions. We evaluate these attacks' effectiveness across two MCP hosts and five LLMs, and uncover that (1) component position shapes attack success rate; and (2) multi-component compositions often outperform single-component attacks by distributing malicious logic. Second, we propose and implement Connor, a two-stage behavioral deviation detector for malicious MCP servers. It first performs pre-execution analysis to detect malicious shell commands and extract each tool's function intent, and then conducts step-wise in-execution analysis to trace each tool's behavioral trajectories and detect deviations from its function intent. Evaluation on our curated dataset indicates that Connor achieves an F1-score of 94.6%, outperforming the state of the art by 8.9% to 59.6%. In real-world detection, Connor identifies two malicious servers.
Abstract（参考訳）: モデルコンテキストプロトコル(MCP)は、LCMが外部ツールやデータソースと接続する方法を標準化し、より高速な統合を可能にすると同時に、新たな攻撃ベクトルを導入する。 MCPの採用が増加しているにもかかわらず、既存のMCPセキュリティ研究は、監視可能な効果によって攻撃を分類し、異なるMSPサーバーコンポーネント間での攻撃の振る舞いを隠蔽し、マルチコンポーネントの攻撃チェーンを見渡す。一方、既存の防御策は、マルチコンポーネント攻撃や以前未知の悪意のある行動に直面する場合、効果が低い。本研究は、悪意のあるMPPサーバの理解と検出のためのコンポーネント中心の視点を示す。まず、悪意のある114のMPPサーバで最初のコンポーネント中心のPoCデータセットを構築し、MPPコンポーネントとその構成に対する操作として攻撃を行う。これらの攻撃の有効性を2つのMPPホストと5つのLCMで評価し、(1)コンポーネント位置形状が成功率を攻撃すること、(2)悪意のあるロジックを分散することにより、複数のコンポーネント構成がシングルコンポーネント攻撃より優れていることを明らかにする。第2に、悪意のあるMPPサーバのための2段階の行動偏差検出器であるConnorを提案し、実装する。まず、悪意のあるシェルコマンドを検出し、各ツールの機能意図を抽出し、次に、各ツールの行動軌跡を追跡し、その機能意図から逸脱を検出するステップワイドな実行分析を実行する。得られたデータセットから、コナーは94.6%のF1スコアを達成し、最先端の8.9%から59.6%を上回りました。現実世界の検知では、Connorは2つの悪意のあるサーバーを識別する。

論文の概要: From Component Manipulation to System Compromise: Understanding and Detecting Malicious MCP Servers

関連論文リスト