Fugu-MT 論文翻訳(概要): ClawHub Security Signals: When VirusTotal, Static Analysis, and SkillSpector Disagree

論文の概要: ClawHub Security Signals: When VirusTotal, Static Analysis, and SkillSpector Disagree

arxiv url: http://arxiv.org/abs/2606.01494v1
Date: Sun, 31 May 2026 23:20:25 GMT
ステータス: 翻訳完了
システム内更新日: 2026-06-02 21:34:29.73539
Title: ClawHub Security Signals: When VirusTotal, Static Analysis, and SkillSpector Disagree
Title（参考訳）: ClawHubのセキュリティシグナル: ウイルス、静的分析、スキルスペクターが診断されたとき
Authors: Vincent Koc, Patrick Erichsen, Jacob Tomlinson, Agustin Rivera, Michael Appel, Nir Paz,
Abstract要約: ClawHub Security Signalsは67,453の最新のOpenClawスキルバージョンをサニタイズしたデータセットである。各行はSKILL.mdコンテンツと、最後のClawScanレジストリの検証と3つのスキャナーファミリからのエビデンスを含む、サニタイズされたバンドルファイルとをペアリングする。悪質なスキルの有病率を推定するよりも、スキャナの不一致を調査する。これらの結果は、エージェントスキルのセキュリティは、単一スキャナの許容/ブロック決定ではなく、階層化されたガバナンスを必要とすることを示している。
参考スコア（独自算出の注目度）: 0.4464102544889847
License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
Abstract: Agent skills extend AI agents with reusable instructions, tools, scripts, references, and workflows, establishing a security boundary distinct from both model safety and traditional package-malware detection. ClawHub Security Signals is a sanitized dataset of 67,453 latest public OpenClaw skill versions. Each row pairs redacted SKILL.md content and sanitized bundled files where present with a final ClawScan registry verdict and evidence from three scanner families: VirusTotal, static heuristic analysis, and NVIDIA SkillSpector. Rather than estimating malicious-skill prevalence, we study scanner disagreement. The three scanners rarely flag the same skills: any pair overlaps on at most 10.4% of their combined positives, only 0.69% of skills are flagged by all three, and 81.9% of flagged skills are identified by a single scanner. The disagreement is structured by attack surface. SkillSpector, which raises semantic agentic-risk advisories rather than malware-reputation signals, is positive for 19,209 of 25,504 suspicious rows (75.3%) but only 14 of 206 malicious rows (6.8%). The malicious-verdict region shows the inverse profile: 150 of 206 malicious rows (72.8%) are VirusTotal-positive, consistent with bundled-code malware evidence. These results show that agent-skill security requires layered governance, not single-scanner allow/block decisions. The corpus is released as a sanitized silver-standard dataset: labels are the registry's automated verdicts, not human-annotated ground truth, and the release represents an early, versioned snapshot intended to support the community while a human-annotated subset is developed. Further research is encouraged, including models tailored for skill-security triage.
Abstract（参考訳）: エージェントスキルは、再利用可能な命令、ツール、スクリプト、参照、ワークフローを備えたAIエージェントを拡張し、モデルの安全性と従来のパッケージマルウェア検出の両方とは異なるセキュリティ境界を確立する。 ClawHub Security Signalsは67,453の最新のOpenClawスキルバージョンをサニタイズしたデータセットである。各行ペアは、SKILL.mdコンテンツと、最終ClawScanレジストリの検証と3つのスキャナーファミリーからのエビデンス(VirusTotal、静的ヒューリスティック分析、NVIDIA SkillSpector)を提示する、サニタイズされたバンドルファイルを再実行した。悪質なスキルの有病率を推定するよりも、スキャナの不一致を調査する。 3つのスキャナーは同じスキルにフラグを付けることは滅多にない: どのペアも少なくとも10.4%のプラスでオーバーラップし、わずか0.69%のスキルが3つすべてでフラグ付けされ、81.9%のスキルが1つのスキャナーで識別される。不一致は攻撃面によって構成される。 SkillSpectorは、マルウェアの報告信号ではなく、セマンティック・エージェント・リスク・アドバイザリを上昇させるが、25,504行(75.3%)のうち19,209行は肯定的だが、悪意のある行(6.8%)のうち14行だけが正である。 206行中150行(72.8%)はウイルスTotal陽性で、バンドルコードによるマルウェアの証拠と一致している。これらの結果は、エージェントスキルのセキュリティは、単一スキャナの許容/ブロック決定ではなく、階層化されたガバナンスを必要とすることを示している。ラベルは登録者の自動検証であり、人間に注釈を付けた真実ではない。このリリースは、人間に注釈を付けたサブセットが開発されている間、コミュニティをサポートすることを意図した、バージョン付きの初期のスナップショットを表している。スキルセキュリティトリアージに適したモデルなど、さらなる研究が奨励されている。

論文の概要: ClawHub Security Signals: When VirusTotal, Static Analysis, and SkillSpector Disagree

関連論文リスト