Fugu-MT 論文翻訳(概要): Chain of Unit-Physics: A Primitive-Centric Approach to Scientific Code Synthesis

論文の概要: Chain of Unit-Physics: A Primitive-Centric Approach to Scientific Code Synthesis

arxiv url: http://arxiv.org/abs/2512.01010v1
Date: Sun, 30 Nov 2025 18:16:50 GMT
ステータス: 翻訳完了
システム内更新日: 2025-12-02 19:46:34.538357
Title: Chain of Unit-Physics: A Primitive-Centric Approach to Scientific Code Synthesis
Title（参考訳）: 単位物理の連鎖:科学コード合成への原始的中心的アプローチ
Authors: Vansh Sharma, Venkat Raman,
Abstract要約: 第一原理(またはプリミティブ)中心のマルチエージェントシステムで、人間の知識をコード生成を明示的に制約する単体物理学テストとしてコード化します。データセットやモデルが進化するにつれて、ゼロショットコードの精度は向上するが、しかしながら、単位-物理フレームワークの連鎖は、科学的なコードの基本となる第一原理の分析を埋め込むことによってさらに進んでいる。
参考スコア（独自算出の注目度）: 0.0
License: http://creativecommons.org/licenses/by-nc-nd/4.0/
Abstract: Agentic large language models are proposed as autonomous code generators for scientific computing, yet their reliability in high-stakes problems remains unclear. Developing computational scientific software from natural-language queries remains challenging broadly due to (a) sparse representation of domain codes during training and (b) the limited feasibility of RLHF with a small expert community. To address these limitations, this work conceptualizes an inverse approach to code design, embodied in the Chain of Unit-Physics framework: a first-principles (or primitives)-centric, multi-agent system in which human expert knowledge is encoded as unit-physics tests that explicitly constrain code generation. The framework is evaluated on a nontrivial combustion task, used here as a representative benchmark for scientific problem with realistic physical constraints. Closed-weight systems and code-focused agentic variants fail to produce correct end-to-end solvers, despite tool and web access, exhibiting four recurrent error classes: interface (syntax/API) hallucinations, overconfident assumptions, numerical/physical incoherence, and configuration fragility. Open-weight models with chain-of-thought (CoT) decoding reduce interface errors but still yield incorrect solutions. On the benchmark task, the proposed framework converges within 5-6 iterations, matches the human-expert implementation (mean error of $3.1\times10^{-3}$ %), with a $\sim$33.4 % faster runtime and a $\sim$30 % efficient memory usage at a cost comparable to mid-sized commercial APIs, yielding a practical template for physics-grounded scientific code generation. As datasets and models evolve, zero-shot code accuracy will improve; however, the Chain of Unit-Physics framework goes further by embedding first-principles analysis that is foundational to scientific codes.
Abstract（参考訳）: エージェント型大規模言語モデルは、科学計算のための自律型コード生成器として提案されているが、その信頼性は未定である。自然言語クエリによる計算科学ソフトウェアの開発は、いまだに幅広い課題を抱えている。 (a)訓練中及び訓練中のドメインコードの疎らな表現 (b)RLHFが小規模な専門家コミュニティに限定可能であること。これらの制限に対処するために、この研究は、コード設計の逆アプローチを概念化し、コード生成を明示的に制約する単体物理テストとして人間の知識がコード化される第一原理(またはプリミティブ)中心のマルチエージェントシステムである、単体物理フレームワークの連鎖に具体化している。このフレームワークは非自明な燃焼タスクで評価され、現実的な物理的制約を伴う科学的問題の代表的なベンチマークとしてここで使用される。クローズドウェイトシステムとコード中心のエージェント変種は、ツールとWebアクセスにもかかわらず、正しいエンドツーエンドの解法を生成することができず、繰り返し発生する4つのエラークラス:インタフェース(構文/API)幻覚、過信的な仮定、数値的/物理的不整合、構成上の脆弱性を示す。チェーン・オブ・シンクレット(CoT)デコードによるオープンウェイトモデルでは、インターフェースエラーは低減されるが、誤った解が得られる。ベンチマークタスクでは、提案されたフレームワークは5～6イテレーション以内に収束し、人間-専門家による実装(平均誤差は3.1\times10^{-3}$ %)と、$\sim$33.4%、$\sim$30 %の効率的なメモリ使用率と、中規模の商用APIに匹敵するコストで一致し、物理地上の科学コード生成のための実用的なテンプレートを生成する。データセットやモデルが進化するにつれて、ゼロショットコードの精度は向上するが、しかしながら、単位-物理フレームワークの連鎖は、科学的なコードの基本となる第一原理の分析を埋め込むことによってさらに進んでいる。

論文の概要: Chain of Unit-Physics: A Primitive-Centric Approach to Scientific Code Synthesis

関連論文リスト