Fugu-MT 論文翻訳(概要): Architecture-Aware Multi-Design Generation for Repository-Level Feature Addition

論文の概要: Architecture-Aware Multi-Design Generation for Repository-Level Feature Addition

arxiv url: http://arxiv.org/abs/2603.01814v1
Date: Mon, 02 Mar 2026 12:50:40 GMT
ステータス: 翻訳完了
システム内更新日: 2026-03-03 19:50:56.876029
Title: Architecture-Aware Multi-Design Generation for Repository-Level Feature Addition
Title（参考訳）: アーキテクチャを考慮したレポジトリレベル特徴付加のためのマルチデザイン生成
Authors: Mingwei Liu, Zhenxi Chen, Zheng Pei, Zihao Wang, Yanlin Wang, Zibin Zheng,
Abstract要約: RAIMは、リポジトリレベルの機能追加のための、多設計およびアーキテクチャ対応のフレームワークである。複数の多様な実装設計を生成することで、線形パッチから切り離される。 NoCode-bench Verifiedデータセットの実験では、RAIMが新しい最先端のパフォーマンスを確立することが示されている。
参考スコア（独自算出の注目度）: 53.50448142467294
License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
Abstract: Implementing new features across an entire codebase presents a formidable challenge for Large Language Models (LLMs). This proactive task requires a deep understanding of the global system architecture to prevent unintended disruptions to legacy functionalities. Conventional pipeline and agentic frameworks often fall short in this area because they suffer from architectural blindness and rely on greedy single-path code generation. To overcome these limitations, we propose RAIM, a multi-design and architecture-aware framework for repository-level feature addition. This framework introduces a localization mechanism that conducts multi-round explorations over a repository-scale code graph to accurately pinpoint dispersed cross-file modification targets. Crucially, RAIM shifts away from linear patching by generating multiple diverse implementation designs. The system then employs a rigorous impact-aware selection process based on static and dynamic analysis to choose the most architecturally sound patch and avoid system regressions. Comprehensive experiments on the NoCode-bench Verified dataset demonstrate that RAIM establishes a new state-of-the-art performance with a 39.47% success rate, achieving a 36.34% relative improvement over the strongest baseline. Furthermore, the approach exhibits robust generalization across various foundation models and empowers open-weight models like DeepSeek-v3.2 to surpass baseline systems powered by leading proprietary models. Detailed ablation studies confirm that the multi-design generation and impact validation modules are critical to effectively managing complex dependencies and reducing code errors. These findings highlight the vital role of structural awareness in automated software evolution.
Abstract（参考訳）: コードベース全体にわたって新機能を実装することは、LLM(Large Language Models)にとって非常に難しい課題である。この積極的なタスクは、レガシー機能に対する意図しない破壊を防止するために、グローバルシステムアーキテクチャの深い理解を必要とする。従来のパイプラインとエージェントフレームワークは、アーキテクチャの盲目に悩まされ、欲張りのシングルパスコード生成に依存しているため、この分野では不足することが多い。これらの制限を克服するため,リポジトリレベルの機能追加のためのマルチ設計およびアーキテクチャ対応フレームワークであるRAIMを提案する。このフレームワークは、分散ファイル修正ターゲットを正確に特定するために、リポジトリスケールのコードグラフ上で複数ラウンドの探索を行うローカライズメカニズムを導入している。 RAIMは、複数の多様な実装設計を生成することで、線形パッチから切り離される。次に、静的および動的解析に基づく厳密な影響認識選択プロセスを用いて、最もアーキテクチャ的に健全なパッチを選択し、システムの回帰を回避する。 NoCode-bench Verifiedデータセットに関する包括的な実験は、RAIMが39.47%の成功率で新しい最先端のパフォーマンスを確立し、最強のベースラインに対して36.34%の相対的な改善を達成していることを示している。さらに、このアプローチは様々な基礎モデルに対して堅牢な一般化を示し、DeepSeek-v3.2のようなオープンウェイトモデルにより、主要なプロプライエタリモデルによるベースラインシステムを超えることができる。詳細なアブレーション調査では、複数設計の生成と影響検証モジュールが、複雑な依存関係を効果的に管理し、コードエラーを減らすために重要であることが確認されている。これらの発見は、ソフトウェアの自動進化における構造的認識の重要な役割を浮き彫りにした。

論文の概要: Architecture-Aware Multi-Design Generation for Repository-Level Feature Addition

関連論文リスト