Fugu-MT 論文翻訳(概要): What makes a harness a harness: necessary and sufficient conditions for an agent harness

論文の概要: What makes a harness a harness: necessary and sufficient conditions for an agent harness

arxiv url: http://arxiv.org/abs/2606.10106v1
Date: Mon, 08 Jun 2026 19:35:37 GMT
ステータス: 翻訳完了
システム内更新日: 2026-06-10 15:40:58.157137
Title: What makes a harness a harness: necessary and sufficient conditions for an agent harness
Title（参考訳）: ハーネスをハーネスにするもの--エージェントハーネスに必要な十分な条件
Authors: Sanderson Oliveira de Macedo,
Abstract要約: エージェントハーネスという用語は、生成人工知能を用いたソフトウェア工学において広く流通している。本稿では,エージェントハーネスとなるシステムに必要な,十分な条件を記述した定義を提案する。この貢献はエージェントハーネスの運用定義であり、共通語彙を持ち、エンジニアリングの実践を導くことができる。
参考スコア（独自算出の注目度）: 0.0
License: http://creativecommons.org/licenses/by/4.0/
Abstract: The term agent harness now circulates widely in software engineering with generative artificial intelligence. It names the layer that wraps a language model and turns it into a coding agent able to act on a repository. The usage is loose and polysemous. Sometimes the term denotes the whole product (Claude Code, Codex CLI); sometimes it denotes the evaluation scaffold that runs an agent against tasks (the SWE-bench harness); sometimes it gets conflated with an agent framework, an SDK, an IDE plugin, or an orchestrator. What is missing is a reference definition that works as an instrument, one that includes and excludes cases consistently. We build that definition through a conceptual analysis that combines works with persistent identifiers and primary grey-literature sources, such as official documentation, glossaries, and engineering reports. We reconstruct the genealogy of the term, from the horse's tack to the classic test harness, to the machine-learning evaluation harness, and finally to the agent harness. We then propose a constitutive definition that states the necessary and sufficient conditions for a system to be an agent harness, we operationalize it as an inclusion and exclusion test, and we draw the boundary of the concept against an agent framework, an agent SDK, an IDE plugin, an eval harness, and an orchestrator. We apply the definition to six real harnesses (Claude Code, Codex CLI, Aider, Cline, OpenHands, and SWE-agent) and to deliberate edge cases; the test includes and excludes consistently. We close with a research agenda organized by design tension axes. The contribution is an operational definition of agent harness, with a shared vocabulary, able to guide engineering practice and the scientific comparison of agentic systems.
Abstract（参考訳）: エージェントハーネスという用語は現在、生成人工知能を使ったソフトウェア工学において広く流通している。言語モデルをラップし、それをレポジトリで動作可能なコーディングエージェントに変換するレイヤを名付ける。用途は緩く多様である。時々、この用語は製品全体を表す(Claude Code、Codex CLI、SWE-benchのハーネス)。また、エージェントフレームワーク、SDK、IDEプラグイン、オーケストレータと混同されることもある。欠けているのは、ケースを一貫して含んで除外する、インスツルメンテーションとして機能する参照定義です。我々は、その定義を、永続的な識別子と公式文書、用語集、エンジニアリングレポートなどの主要な灰色文字ソースと組み合わせた概念分析によって構築する。馬のタックから古典的なテストハーネス、機械学習評価ハーネス、最後にエージェントハーネスまで、この用語の系譜を再構築する。次に,エージェント・ハーネスであるシステムに必要な十分な条件を記述した構成的定義を提案し,それを包括的かつ排除的テストとして運用し,エージェント・フレームワーク,エージェント・SDK,IDEプラグイン,エバル・ハーネス,オーケストレータに対して概念の境界線を描く。定義は6つの実ハーネス(Claude Code、Codex CLI、Aider、Cline、OpenHands、SWE-agent)と、意図的にエッジケースに適用します。われわれは、デザインの緊張軸によって組織された研究の議題を締めくくっている。この貢献はエージェントハーネスの運用的定義であり、共有語彙を持ち、エンジニアリングの実践とエージェントシステムの科学的比較をガイドすることができる。

論文の概要: What makes a harness a harness: necessary and sufficient conditions for an agent harness

関連論文リスト