Fugu-MT 論文翻訳(概要): SkillRT: Compiling Skills for Efficient Execution Everywhere

論文の概要: SkillRT: Compiling Skills for Efficient Execution Everywhere

arxiv url: http://arxiv.org/abs/2604.03088v1
Date: Fri, 03 Apr 2026 15:11:45 GMT
ステータス: 翻訳完了
システム内更新日: 2026-04-06 17:20:24.504523
Title: SkillRT: Compiling Skills for Efficient Execution Everywhere
Title（参考訳）: SkillRT: あらゆる場所で効率的な実行スキルをコンパイルする
Authors: Le Chen, Erhu Feng, Yubin Xia, Haibo Chen,
Abstract要約: LLMエージェントは、再利用可能な構成単位としてスキルをますます採用している。現在のシステムは、スキルを生のコンテキストとして扱い、異なるエージェントに対して同じスキルを一貫性なく振る舞う。本稿では,ポータブルかつ効率的なスキル実行を目的としたコンパイルおよび実行システムであるSkillRTを提案する。
参考スコア（独自算出の注目度）: 8.617039060377483
License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
Abstract: LLM agents increasingly adopt skills as a reusable unit of composition. While skills are shared across diverse agent platforms, current systems treat them as raw context, causing the same skill to behave inconsistently for different agents. This fragility undermines skill portability and execution efficiency. To address this challenge, we analyze 118,000 skills and draw inspiration from traditional compiler design. We treat skills as code and LLMs as heterogeneous processors. To make portability actionable, we decompose a skill's requirements into a set of primitive capabilities, and measure how well each model-harness pair supports them. Based on these capability profiles, we propose SkillRT, a compilation and runtime system designed for portable and efficient skill execution. At compile time, SkillRT performs capability-based compilation, environment binding, and concurrency extraction. At runtime, SkillRT applies JIT code solidification and adaptive recompilation for performance optimization. We evaluate SkillRT across eight LLMs of varying scales and three agent harnesses, covering SkillsBench and representative skill tasks. Results demonstrate that SkillRT significantly improves task completion rates across different models and environments while reducing token consumption by up to 40%. In terms of performance, SkillRT achieves up to 3.2x speedup with enhanced parallelism, and 19-50x latency reduction through code solidification.
Abstract（参考訳）: LLMエージェントは、再利用可能な構成単位としてスキルをますます採用している。スキルは多様なエージェントプラットフォーム間で共有されるが、現在のシステムはそれらを生のコンテキストとして扱う。この脆弱さは、スキルポータビリティと実行効率を損なう。この課題に対処するため、118,000のスキルを分析し、従来のコンパイラ設計からインスピレーションを得る。スキルをコードとして、LLMをヘテロジニアスプロセッサとして扱います。ポータビリティを実用的なものにするために、スキルの要件を一連のプリミティブな機能に分解し、各モデルとハーネスのペアがどれだけうまくそれらをサポートするかを測定する。これらの機能プロファイルに基づいて,ポータブルかつ効率的なスキル実行用に設計されたコンパイルおよびランタイムシステムであるSkillRTを提案する。コンパイル時に、SkillRTは機能ベースのコンパイル、環境バインディング、並行性抽出を実行する。実行時に、SkillRTはJITコードの固化と、パフォーマンス最適化のための適応的な再コンパイルを適用します。我々は,SkillsBenchと代表的スキルタスクをカバーする8つのLLMと3つのエージェントハーネスでSkillRTを評価した。その結果、SkillRTはさまざまなモデルや環境におけるタスク完了率を大幅に改善し、トークン消費量を最大40%削減した。パフォーマンス面では、SkillRTは並列性を強化して最大3.2倍のスピードアップを実現し、コードの固化による19-50倍のレイテンシ低減を実現している。

論文の概要: SkillRT: Compiling Skills for Efficient Execution Everywhere

関連論文リスト