Fugu-MT 論文翻訳(概要): Lifecycle-Aware code generation: Leveraging Software Engineering Phases in LLMs

論文の概要: Lifecycle-Aware code generation: Leveraging Software Engineering Phases in LLMs

arxiv url: http://arxiv.org/abs/2510.24019v1
Date: Tue, 28 Oct 2025 02:54:02 GMT
ステータス: 翻訳完了
システム内更新日: 2025-10-29 15:35:36.727666
Title: Lifecycle-Aware code generation: Leveraging Software Engineering Phases in LLMs
Title（参考訳）: ライフサイクル対応コード生成 - LLMにおけるソフトウェアエンジニアリングフェーズの活用
Authors: Xing Xing, Wei Wang, Lipeng Ma, Weidong Yang, Junjie Zheng,
Abstract要約: トレーニング段階と推論段階の両方に中間アーティファクトを組み込んだライフサイクル対応フレームワークを導入する。実験によると、ライフサイクルレベルの微調整は、微調整の前に同じモデルで最大75%の精度でコードの正しさを向上させる。オープンソース LLM は、かつて私たちのフレームワークの下で微調整され、コードで事前訓練されたモデルにマッチするか、わずかに優れています。
参考スコア（独自算出の注目度）: 12.70863561286374
License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
Abstract: Recent progress in large language models (LLMs) has advanced automatic code generation, yet most approaches rely on direct, single-step translation from problem descriptions to code, disregarding structured software engineering practices. We introduce a lifecycle-aware framework that systematically incorporates intermediate artifacts such as requirements analysis, state machine modeling, and pseudocode into both the training and inference stages. This design aligns code generation with standard software development phases and enables more structured reasoning. Experiments show that lifecycle-level fine-tuning improves code correctness by up to 75% over the same model before fine-tuning, with performance gains compounding across intermediate stages. Multi-step inference consistently surpasses single-step generation, demonstrating the effectiveness of intermediate scaffolding. Notably, open-source LLMs, once fine-tuned under our framework, match or slightly outperform models pretrained on code. When applied to DeepSeek-Coder-1.3B, our framework yields relative CodeBLEU improvements of 34.3%, 20.0%, 11.2%, and 22.3% over ChatGPT-3.5, ChatGPT-4o-mini, DeepSeek-R1, and LLaMA-8B, respectively. Our pipeline also proves robust with up to 80\% less training data, confirming its resilience. Ablation studies further reveal that each intermediate artifact contributes distinctly to final code quality, with state machine modeling yielding the most substantial impact. Our source code and detailed experimental data are available at https://anonymous.4open.science/r/Lifecycle-Aware-3CCB.
Abstract（参考訳）: 大規模言語モデル(LLM)の最近の進歩は、高度な自動コード生成が進んでいるが、ほとんどのアプローチは、構造化されたソフトウェアエンジニアリングプラクティスを無視し、問題記述からコードへの直接的な単一ステップの変換に依存している。本稿では,要件分析や状態マシンモデリング,擬似コードといった中間的成果物を,トレーニング段階と推論段階の両方に体系的に組み込むライフサイクルアウェアフレームワークを提案する。この設計は、コード生成を標準的なソフトウェア開発フェーズと整合させ、より構造化された推論を可能にする。実験により、ライフサイクルレベルの微調整は、中間段階にまたがるパフォーマンス向上により、同じモデル上で最大75%のコード正当性を向上することが示された。多段階推論は、一段階生成を一貫して超え、中間足場の有効性を示す。特に、オープンソースのLLMは、かつて私たちのフレームワークの下で微調整され、コードで事前訓練されたモデルにマッチするか、わずかに優れています。 DeepSeek-Coder-1.3Bに適用すると、当社のフレームワークは、ChatGPT-3.5、ChatGPT-4o-mini、DeepSeek-R1、LLaMA-8Bに対して、相対的なCodeBLEUの改善を34.3%、20.0%、11.2%、22.3%とした。当社のパイプラインでは,トレーニングデータの最大80%が堅牢で,レジリエンスの確認も行なっています。アブレーション研究により、各中間アーティファクトが最終的なコード品質に明確に寄与し、状態マシンモデリングが最も実質的な影響をもたらすことが明らかになった。ソースコードと詳細な実験データはhttps://anonymous.4open.science/r/Lifecycle-Aware-3CCBで公開されている。

論文の概要: Lifecycle-Aware code generation: Leveraging Software Engineering Phases in LLMs

関連論文リスト