Fugu-MT 論文翻訳(概要): HarnessX: A Composable, Adaptive, and Evolvable Agent Harness Foundry

論文の概要: HarnessX: A Composable, Adaptive, and Evolvable Agent Harness Foundry

arxiv url: http://arxiv.org/abs/2606.14249v1
Date: Fri, 12 Jun 2026 08:27:11 GMT
ステータス: 翻訳完了
システム内更新日: 2026-06-15 16:00:42.827877
Title: HarnessX: A Composable, Adaptive, and Evolvable Agent Harness Foundry
Title（参考訳）: HarnessX: 構成可能で適応的で進化可能なエージェントHarness Foundry
Authors: Tingyang Chen, Shuo Lu, Kang Zhao, Weicheng Meng, Hanlin Teng, Tianhao Li, Chao Li, Xule Liu, Jian Liang, Zhizhong Zhang, Yuan Xie, Heng Qu, Kun Shao, Jian Luan,
Abstract要約: HarnessXは、構成可能、適応可能、進化可能なエージェントハーネス用のファウンダリーである。型付きハーネスプリミティブを置換代数学で組み立て、AIGISを介して適応する。軌道をハーネス更新とモデルトレーニング信号の両方に変換することでハーネスモデルループを閉じる。
参考スコア（独自算出の注目度）: 35.87794858139959
License: http://creativecommons.org/licenses/by/4.0/
Abstract: AI agent performance depends critically on the runtime harness, comprising the prompts, tools, memory, and control flow that mediate how a model observes, reasons, and acts. Yet today's harnesses remain largely hand-crafted and static: each new model or task still demands bespoke scaffolding, and the rich traces produced during execution are rarely distilled back into systematic improvement. We introduce HarnessX, a foundry for composable, adaptive, and evolvable agent harnesses. HarnessX assembles typed harness primitives via a substitution algebra, adapts them through AEGIS, a trace-driven multi-agent evolution engine grounded in an operational mirror between symbolic adaptation and reinforcement learning, and closes the harness-model loop by turning trajectories into both harness updates and model training signal. Across five benchmarks (ALFWorld, GAIA, WebShop, tau^3-Bench, and SWE-bench Verified), HarnessX yields an average gain of +14.5% (up to +44.0%), with gains largest where baselines are lowest. These results suggest that agent progress need not come from model scaling alone: composing and evolving runtime interfaces from execution feedback is an actionable and complementary lever. The complete codebase will be open-sourced in a future release.
Abstract（参考訳）: AIエージェントのパフォーマンスは、モデルがどのように観察、理由、動作を行うかを仲介するプロンプト、ツール、メモリ、制御フローを含むランタイムハーネスに大きく依存する。しかし、今日のハーネスは、主に手作りで静的であり、新しいモデルやタスクは、未だに、スキャフォールディングを必要としており、実行中に生成された豊富なトレースは、体系的な改善のために蒸留されることはめったにない。本稿では, コンポーザブル, アダプティブ, 進化可能なエージェントハーネスのためのファウンダリーであるHarnessXを紹介する。 HarnessXは、置換代数を介して型付きハーネスプリミティブを組み立て、シンボル適応と強化学習の間の運用ミラーに基礎を置くトレース駆動マルチエージェント進化エンジンであるAEGISを介してそれらを適応させ、軌道をハーネス更新とモデルトレーニング信号の両方に変換することでハーネスモデルループを閉じる。 5つのベンチマーク(ALFWorld, GAIA, WebShop, tau^3-Bench, SWE-bench Verified)で、HarnessXの平均利得は+14.5%(最大+44.0%)で、ベースラインが最も低い。これらの結果は、エージェントの進捗がモデルスケーリングだけでは発生しないことを示している。実行時のフィードバックからランタイムインターフェースを合成し、進化させることは、実行可能な補完的なレバーである。完全なコードベースは、将来のリリースでオープンソース化される予定だ。

論文の概要: HarnessX: A Composable, Adaptive, and Evolvable Agent Harness Foundry

関連論文リスト