Fugu-MT 論文翻訳(概要): From Multi-Agent to Single-Agent: When Is Skill Distillation Beneficial?

論文の概要: From Multi-Agent to Single-Agent: When Is Skill Distillation Beneficial?

arxiv url: http://arxiv.org/abs/2604.01608v1
Date: Thu, 02 Apr 2026 04:38:32 GMT
ステータス: 翻訳完了
システム内更新日: 2026-04-03 14:21:10.273035
Title: From Multi-Agent to Single-Agent: When Is Skill Distillation Beneficial?
Title（参考訳）: マルチエージェントからシングルエージェントへ:スキル蒸留はいつ有効か?
Authors: Binyan Xu, Dong Fang, Haitao Li, Kehuan Zhang,
Abstract要約: マルチエージェントシステム(MAS)は専門知識を分散することで複雑なタスクに対処するが、これは重度の調整オーバーヘッドの犠牲になることが多い。スキルユーティリティはタスクではなく評価基準によって管理されていることを示す。スキルユーティリティの先駆的な予測器であるMetric Freedom(F$)を紹介します。
参考スコア（独自算出の注目度）: 6.434750896227443
License: http://creativecommons.org/licenses/by/4.0/
Abstract: Multi-agent systems (MAS) tackle complex tasks by distributing expertise, though this often comes at the cost of heavy coordination overhead, context fragmentation, and brittle phase ordering. Distilling a MAS into a single-agent skill can bypass these costs, but this conversion lacks a principled answer for when and what to distill. Instead, the empirical outcome is surprisingly inconsistent: skill lift ranges from a 28% improvement to a 2% degradation across metrics of the exact same task. In this work, we reveal that skill utility is governed not by the task, but by the evaluation metric. We introduce Metric Freedom ($F$), the first a priori predictor of skill utility. $F$ measures the topological rigidity of a metric's scoring landscape by quantifying how output diversity couples with score variance via a Mantel test. Guided by $F$, we propose a two-stage adaptive distillation framework. Stage 1 acts as a selective extraction mechanism, extracting tools and knowledge while discarding restrictive structures on "free" metrics to preserve exploration. Stage 2 targets computationally intensive iterative refinement exclusively toward "rigid" metrics ($F \lesssim 0.6$) to eliminate trajectory-local overfitting. Evaluating across 4 tasks, 11 datasets, and 6 metrics, $F$ strongly predicts skill utility ($ρ= -0.62$, $p < 0.05$). Strikingly, identical agent trajectories yield diametrically opposite skill lifts under rigid versus free metrics, demonstrating that skill utility is fundamentally a metric-level property. Driven by this signal, our adaptive agent matches or exceeds the original MAS while reducing cost up to 8$\times$ and latency by up to 15$\times$.
Abstract（参考訳）: マルチエージェントシステム(MAS)は専門知識を分散することで複雑なタスクに対処するが、これは重度の調整オーバーヘッド、コンテキストの断片化、不安定なフェーズオーダリングといったコストがかかる。 MASをシングルエージェントのスキルに蒸留すると、これらのコストを回避できるが、この変換はいつ、何を蒸留するかという原則的な答えを欠く。スキルリフトは28%の改善から2%の劣化まで、まったく同じタスクのメトリクスで行われています。本研究では,タスクではなく評価基準によって,スキルユーティリティが管理されていることを明らかにする。スキルユーティリティの先駆的な予測器であるMetric Freedom(F$)を紹介します。 F$は、測定値のスコアランドスケープのトポロジカル剛性を測定し、アウトプットの多様性とスコアのばらつきがMantelテストによってどのように結合するかを定量化する。 F$でガイドされた2段階の適応蒸留フレームワークを提案する。ステージ1は選択的な抽出メカニズムとして機能し、ツールと知識を抽出し、「自由」なメトリクスの制限された構造を捨てて探索を維持する。ステージ2は「厳密」なメトリクス(F \lesssim 0.6$)に限定して計算集約的な反復的洗練を目標とし、軌道局所的なオーバーフィッティングを排除している。 4つのタスク、11のデータセット、6つのメトリクスを評価し、$F$はスキルユーティリティ(ρ= -0.62$, $p < 0.05$)を強く予測する。厳密に言えば、同一のエージェント・トラジェクトリは、厳密な対自由なメトリクスの下でのスキル・リフトと対等に反対のスキル・リフトを生じさせ、スキル・ユーティリティが基本的にメートルレベルの特性であることを証明している。このシグナルによって駆動される当社の適応エージェントは、元のMASと一致またはオーバーし、コストを最大8$\times$、レイテンシを最大15$\times$に削減します。

論文の概要: From Multi-Agent to Single-Agent: When Is Skill Distillation Beneficial?

関連論文リスト