Fugu-MT 論文翻訳(概要): EcoScratch: Cost-Effective Multimodal Repair for Scratch Using Execution Feedback

論文の概要: EcoScratch: Cost-Effective Multimodal Repair for Scratch Using Execution Feedback

arxiv url: http://arxiv.org/abs/2603.29624v1
Date: Tue, 31 Mar 2026 11:45:36 GMT
ステータス: 翻訳完了
システム内更新日: 2026-04-01 15:25:03.584249
Title: EcoScratch: Cost-Effective Multimodal Repair for Scratch Using Execution Feedback
Title（参考訳）: EcoScratch: 実行フィードバックを用いたスクラッチのコスト効果マルチモーダル修復
Authors: Yuan Si, Ming Wang, Daming Li, Hanyuan Shi, Jialu Zhang,
Abstract要約: EcoScratchは、ライトウェイトなランタイム信号を使用して、次の試みがテキストのみであり続けるか、マルチモーダルプロンプトにエスカレートするかを判断する修復パイプラインである。我々は,100個のScratch補修プロジェクトを4つのコントローラ設定で評価し,4800個の補修軌道を得た。最高世代(30.3%)に到達し、同じ有界軌道予算の下での2つの非適応的マルチモーダルベースライン(テキストのみの最低コストフロア)よりも平均コストと局所実行エネルギーを削減した。
参考スコア（独自算出の注目度）: 3.6908036186618314
License: http://creativecommons.org/licenses/by-nc-nd/4.0/
Abstract: Scratch is the most popular programming environment for novices, with over 1.15 billion projects created worldwide. Unlike traditional languages, correctness in Scratch is defined by visible behavior on the stage rather than by code structure alone, so programs that appear correct in the workspace can still fail at runtime due to timing, event ordering, or cross-sprite interactions. Visual execution evidence such as gameplay videos can therefore be essential for diagnosis and repair. However, capturing and processing this evidence inside an automated repair loop introduces substantial overhead. Probing execution, recording stage behavior, rebuilding executable .sb3 projects, and verifying candidate fixes consume time, monetary cost, and resources across an entire repair trajectory rather than a single model call. We present EcoScratch, a repair pipeline that uses lightweight runtime signals to decide whether the next attempt stays text-only or escalates to multimodal prompting. The controller also sets the JSON Patch budget and verification effort, so evidence choice and repair budget are coupled inside the same decision. EcoScratch rebuilds candidate fixes into executable .sb3 projects and records per-trajectory traces, monetary cost, local-runtime energy. We evaluate 12 models on 100 executable Scratch repair projects under four controller settings, yielding 4800 repair trajectories. In this matrix, a selective multimodal policy gives the strongest observed success-cost-energy tradeoff. It reaches the highest generation success (30.3%) while using less average cost and local-runtime energy than the two non-adaptive multimodal baselines under the same bounded trajectory budget; text-only remains the lowest-cost floor. Across the evaluated matrix, multimodal evidence helps most when it is used to control escalation within a bounded trajectory budget rather than applied uniformly.
Abstract（参考訳）: Scratchは初心者向けの最も人気のあるプログラミング環境であり、世界中で15億以上のプロジェクトが作成されている。従来の言語とは異なり、Scratchの正確性はコード構造だけでではなくステージ上の可視的な振る舞いによって定義されているため、ワークスペースで正しいように見えるプログラムは、タイミング、イベント順序、あるいはクロススプライトインタラクションによって実行時に失敗する可能性がある。したがって、ゲームプレイビデオのような視覚的実行証拠は診断と修復に不可欠である。しかし、この証拠を自動修理ループ内でキャプチャして処理することは、かなりのオーバーヘッドをもたらす。実行のプロービング、ステージの振る舞いの記録、実行可能.NETファイルの再構築。 sb3は、単一のモデル呼び出しではなく、修理軌道全体にわたって時間、金銭的コスト、リソースを消費する。 EcoScratchは、ライトウェイトなランタイム信号を使用して、次の試みがテキストのみであり続けるか、マルチモーダルプロンプトにエスカレートするかを判断する修復パイプラインである。コントローラはJSON Patchの予算と検証の労力も設定するので、エビデンスの選択と修復の予算は、同じ決定の中で結合される。 EcoScratchは、候補修正を実行可能な.NETファイルに再構築する。 sb3は、軌跡ごとのプロジェクトと記録、金銭的コスト、ローカル・ランタイム・エネルギ。我々は,100個のScratch補修プロジェクトを4つのコントローラ設定で評価し,4800個の補修軌道を得た。この行列において、選択的なマルチモーダルポリシーは、最も観測された成功-コスト-エネルギーのトレードオフを与える。最高世代(30.3%)に到達し、同じ有界軌道予算の下での2つの非適応的マルチモーダルベースラインよりも平均コストと局所実行エネルギーを少なくし、テキストのみのフロアは最低価格のままである。評価された行列全体にわたって、マルチモーダルなエビデンスは、一様に適用するのではなく、有界軌道予算内でのエスカレーションを制御するのに最も役立ちます。

論文の概要: EcoScratch: Cost-Effective Multimodal Repair for Scratch Using Execution Feedback

関連論文リスト