Fugu-MT 論文翻訳(概要): Evaluating Classical Software Process Models as Coordination Mechanisms for LLM-Based Software Generation

論文の概要: Evaluating Classical Software Process Models as Coordination Mechanisms for LLM-Based Software Generation

arxiv url: http://arxiv.org/abs/2509.13942v1
Date: Wed, 17 Sep 2025 13:11:49 GMT
ステータス: 翻訳完了
システム内更新日: 2025-09-18 18:41:50.849035
Title: Evaluating Classical Software Process Models as Coordination Mechanisms for LLM-Based Software Generation
Title（参考訳）: LLMベースのソフトウェア生成のための協調メカニズムとしての古典的ソフトウェアプロセスモデルの評価
Authors: Duc Minh Ha, Phu Trac Kien, Tho Quan, Anh Nguyen-Duc,
Abstract要約: 本研究では,Large Language Model (LLM) ベースのMASのための協調足場として,従来のソフトウェア開発プロセスをどのように適応させるかを検討する。 3つのプロセスモデルと4つのGPT変種の下で11の多様なソフトウェアプロジェクトを実行し、合計132回の実行を行いました。プロセスモデルとLLMの選択はシステム性能に大きく影響した。ウォーターフォールは最も効率的で、Vモデルが最も冗長なコードを生成し、アジャイルは最高のコード品質を達成しました。
参考スコア（独自算出の注目度）: 4.583390874772685
License: http://creativecommons.org/licenses/by/4.0/
Abstract: [Background] Large Language Model (LLM)-based multi-agent systems (MAS) are transforming software development by enabling autonomous collaboration. Classical software processes such asWaterfall, V-Model, and Agile offer structured coordination patterns that can be repurposed to guide these agent interactions. [Aims] This study explores how traditional software development processes can be adapted as coordination scaffolds for LLM based MAS and examines their impact on code quality, cost, and productivity. [Method] We executed 11 diverse software projects under three process models and four GPT variants, totaling 132 runs. Each output was evaluated using standardized metrics for size (files, LOC), cost (execution time, token usage), and quality (code smells, AI- and human detected bugs). [Results] Both process model and LLM choice significantly affected system performance. Waterfall was most efficient, V-Model produced the most verbose code, and Agile achieved the highest code quality, albeit at higher computational cost. [Conclusions] Classical software processes can be effectively instantiated in LLM-based MAS, but each entails trade-offs across quality, cost, and adaptability. Process selection should reflect project goals, whether prioritizing efficiency, robustness, or structured validation.
Abstract（参考訳）: [背景]大規模言語モデル(LLM)ベースのマルチエージェントシステム(MAS)は、自律的なコラボレーションを可能にすることでソフトウェア開発を変革している。 WaterfallやV-Model、アジャイルといった古典的なソフトウェアプロセスは、これらのエージェントのインタラクションをガイドするために再利用可能な構造化された調整パターンを提供します。目的]本研究では,従来のソフトウェア開発プロセスをLCMベースのMASのコーディネート足場として適用し,コード品質,コスト,生産性に与える影響について検討する。 [方法]3つのプロセスモデルと4つのGPT変種の下で11の多様なソフトウェアプロジェクトを実行し、合計132回の実行を行いました。各アウトプットは、サイズ(ファイル、LOC)、コスト(実行時間、トークン使用量)、品質(コードの臭い、AI、検出されたバグ)の標準化されたメトリクスを使用して評価された。結果]プロセスモデルとLLMの選択の両方がシステムパフォーマンスに大きな影響を与えました。ウォーターフォールは最も効率的で、V-Modelは最も冗長なコードを作り、アジャイルは高い計算コストで最高のコード品質を達成しました。結論] 古典的なソフトウェアプロセスは LLM ベースの MAS で効果的にインスタンス化できますが,それぞれが品質,コスト,適応性といったトレードオフを伴います。プロセスの選択は、効率の優先順位付け、堅牢性、構造化された検証など、プロジェクトの目標を反映するべきです。

論文の概要: Evaluating Classical Software Process Models as Coordination Mechanisms for LLM-Based Software Generation

関連論文リスト