Fugu-MT 論文翻訳(概要): No Accidental Software Agent First Canonical Code for Human Code Entropy Reduction and 30 to 500 times Lower Frontier Model Requirements

論文の概要: No Accidental Software Agent First Canonical Code for Human Code Entropy Reduction and 30 to 500 times Lower Frontier Model Requirements

arxiv url: http://arxiv.org/abs/2606.14357v1
Date: Fri, 12 Jun 2026 11:35:54 GMT
ステータス: 翻訳完了
システム内更新日: 2026-06-15 16:00:42.878771
Title: No Accidental Software Agent First Canonical Code for Human Code Entropy Reduction and 30 to 500 times Lower Frontier Model Requirements
Title（参考訳）: コードエントロピー削減のためのアクシデントソフトウェアエージェント第一標準コードとフロンティアモデル要件の30～500倍
Authors: Jepson Taylor,
Abstract要約: 本稿では,日常的な製品ソフトウェアを標準的行動プロファイルに書き換えるエビデンスキャリング基板を提案する。除去可能な事故は、残余の新規性、証拠、ガバナンス、リスク、将来のオプション性が支配されるまで減少する。 Qwen2.5-Coder-14BのQLoRA実験は、64,088の標準軌道が学習可能であり、試験された禁止言語マーカーを抑えることを示した。
参考スコア（独自算出の注目度）: 0.0
License: http://creativecommons.org/publicdomain/zero/1.0/
Abstract: Frontier coding models may spend substantial capacity learning not only program behavior, but also accidental entropy in human repositories. Such repositories contain valuable signals: tests, incidents, migrations, edge cases, product judgment, and operational history. These signals are entangled with framework churn, naming drift, generated-source ambiguity, dependency rituals, CI dialects, weak proof routes, and human-oriented review customs. We propose agent-first canonical code, a proof-carrying substrate that rewrites routine product software into canonical behavior profiles, typed change algebra, proof lanes, constrained edit grammars, semantic patch cells, runtime negative memory, and proof-carrying change objects. The core hypothesis is that quotienting software by behavior equivalence under a declared oracle can collapse equivalent encodings into governed representatives with explicit evidence and proof obligations. The endpoint is amortized cost per verified correct change, including source, context, reasoning, tools, verification, security, provenance, review, failed loops, defects, and foundry cost under a common oracle. Reported reduction bands are hypotheses, not measured frontier results. The proposed limit is a No-Accident Horizon: removable accident decreases until residual novelty, evidence, governance, risk, and future optionality dominate. For supported routine-product distributions, this gives a defensible planning target near 100-fold all-in cost reduction, not a guarantee for all software. Preliminary QLoRA experiments on Qwen2.5-Coder-14B show that 64,088 canonical trajectories are learnable and suppress tested forbidden-language markers, but do not establish behavior preservation, scaling economics, or verified-change cost. The contribution is a falsifiable program centered on minimum functional description length and verified-change cost.
Abstract（参考訳）: フロンティアコーディングモデルは、プログラムの振る舞いだけでなく、人間のリポジトリにおける偶発的エントロピーについてもかなりの能力学習に費やす可能性がある。このようなリポジトリには、テスト、インシデント、マイグレーション、エッジケース、製品判断、運用履歴といった、貴重なシグナルが含まれている。これらのシグナルには、フレームワークのチャーン、命名のドリフト、生成されたソースの曖昧さ、依存関係の儀式、CI方言、弱い証明ルート、人間指向のレビュー習慣が絡み合っている。提案するエージェントファースト・カノニカル・コード(エージェントファースト・カノニカル・コード)は,通常の製品ソフトウェアを標準動作プロファイル,型付き変更代数,証明レーン,制約付き編集文法,セマンティック・パッチ・セル,実行時負メモリ,証明型変更オブジェクトに書き換える。中心となる仮説は、宣言された託宣の下での行動等価性によるソフトウェアの引用は、明確な証拠と証明義務を持つ支配的な代表者に等価なエンコーディングを崩壊させる可能性がある、というものである。エンドポイントは、ソース、コンテキスト、推論、ツール、検証、セキュリティ、証明、レビュー、失敗ループ、欠陥、ファウンデーリコストなど、検証済みの正しい変更毎に償却される。報告されている還元帯は仮説であり、測定されたフロンティアの結果ではない。除去可能な事故は、残余の新規性、証拠、ガバナンス、リスク、将来のオプション性が支配されるまで減少する。サポート対象の定期的な製品分布では、全ソフトウェアの保証ではなく、100倍近い全コスト削減を目標とする。 Qwen2.5-Coder-14BにおけるQLoRA実験は、64,088個の標準軌跡が学習可能であり、テストされた禁止言語マーカーを抑えるが、行動保存、スケーリング経済、検証された変更コストは確立しないことを示した。コントリビューションは、最小機能記述長と検証-変更コストを中心とした、偽装可能なプログラムである。

論文の概要: No Accidental Software Agent First Canonical Code for Human Code Entropy Reduction and 30 to 500 times Lower Frontier Model Requirements

関連論文リスト