Fugu-MT 論文翻訳(概要): Contractual Skills: A GovernSpec Design Framework for Enterprise AI Agents

論文の概要: Contractual Skills: A GovernSpec Design Framework for Enterprise AI Agents

arxiv url: http://arxiv.org/abs/2605.22634v1
Date: Thu, 21 May 2026 15:40:05 GMT
ステータス: 翻訳完了
システム内更新日: 2026-05-22 16:35:42.32768
Title: Contractual Skills: A GovernSpec Design Framework for Enterprise AI Agents
Title（参考訳）: Contractual Skills: エンタープライズAIエージェントのためのGovernSpecデザインフレームワーク
Authors: Ting Liu,
Abstract要約: 本稿では,SKILL.mdファイルを可読性のあるタスクコントラクトとして整理するための,GovernSpecにインスパイアされた設計フレームワークであるコントラクトスキルを提案する。このフレームワークは、コントラクトスキル、GovernSpec YAMLコントラクト、Model Context Protocolサーフェス、ツールアダプタ、ランタイムガードレール、トレース、評価システムの境界を明確にしている。
参考スコア（独自算出の注目度）: 8.419155861590548
License: http://creativecommons.org/licenses/by/4.0/
Abstract: Skills are increasingly used to package agent instructions, workflows, scripts, and reference materials. In enterprise settings, however, skills often need to express more than task guidance: they must make goals, input boundaries, permissions, evidence requirements, output contracts, quality criteria, verification steps, human approval points, and handoff rules inspectable. This paper proposes contractual skills, a GovernSpec-inspired design framework for organizing SKILL.md files as readable task contracts while preserving lightweight skill discovery and progressive loading. The framework clarifies the boundary between contractual skills, GovernSpec YAML contracts, Model Context Protocol surfaces, tool adapters, runtime guardrails, tracing, and evaluation systems. We evaluate the framework with two offline experiments. A text-generation study covers three enterprise skills, fifteen synthetic tasks, four instruction conditions, and eight generation models, yielding 960 outputs and 1680 cross-judge score records. Contractual skills outperform no-skill and minimal-skill baselines on all tested models. Relative to information-rich plain expanded skills, the gains are small and mixed, suggesting that contractual fields mainly improve checkability and maintainability rather than raw generation quality. A tool-calling challenge covers eight models and 192 simulated tool-call records. Skills usually reduce high-risk tool attempts, but model differences remain and runtime tool guardrails are still required. The results suggest that contractual skills are best understood as a governance layer that makes task intent, boundaries, and acceptance criteria explicit, not as a standalone safety mechanism.
Abstract（参考訳）: エージェント命令、ワークフロー、スクリプト、リファレンス資料のパッケージ化には、スキルがますます使われています。目標、入力境界、許可、エビデンス要件、出力契約、品質基準、検証ステップ、人間承認ポイント、検査可能なハンドオフルールをしなければならない。本稿では、軽量なスキル発見とプログレッシブローディングを保ちながら、SKILL.mdファイルを読みやすいタスクコントラクトとして整理する、GovernSpecにインスパイアされた設計フレームワークであるコントラクトスキルを提案する。このフレームワークは、コントラクトスキル、GovernSpec YAMLコントラクト、Model Context Protocolサーフェス、ツールアダプタ、ランタイムガードレール、トレース、評価システムの境界を明確にしている。このフレームワークを2つのオフライン実験で評価する。テキスト生成調査では,3つの企業スキル,15の合成タスク,4つの指導条件,および8つの世代モデルを対象として,960のアウトプットと1680のクロスジャッジスコアを記録した。契約スキルは、テストされたすべてのモデルにおいて、非スキルと最小スキルのベースラインを上回ります。情報に富んだ平易な拡張スキルとは対照的に、ゲインは小さく混ざり合っており、契約分野は生の世代品質よりも、主にチェック容易性と保守性を改善することが示唆されている。ツールコールの課題は、8つのモデルと192のシミュレーションツールコールレコードをカバーする。スキルは通常、リスクの高いツールの試みを減らすが、モデルの違いは残り、実行時のツールガードレールは依然として必要である。その結果、契約上のスキルは、独立した安全メカニズムとしてではなく、タスク意図、バウンダリ、受け入れ基準を明確にするガバナンス層として理解されていることが示唆された。

論文の概要: Contractual Skills: A GovernSpec Design Framework for Enterprise AI Agents

関連論文リスト