Fugu-MT 論文翻訳(概要): On Integrating Large Language Models and Scenario-Based Programming for Improving Software Reliability

論文の概要: On Integrating Large Language Models and Scenario-Based Programming for Improving Software Reliability

arxiv url: http://arxiv.org/abs/2509.09194v1
Date: Thu, 11 Sep 2025 07:10:25 GMT
ステータス: 翻訳完了
システム内更新日: 2025-09-12 16:52:24.25662
Title: On Integrating Large Language Models and Scenario-Based Programming for Improving Software Reliability
Title（参考訳）: ソフトウェア信頼性向上のための大規模言語モデルとシナリオベースプログラミングの統合について
Authors: Ayelet Berzack, Guy Katz,
Abstract要約: 大規模言語モデル(LLM)は、ソフトウェア開発者にとって急速に欠かせないツールになりつつある。 LLMは、しばしば重大なエラーを導入し、説得力のある信頼を持って間違ったコードを提示する。本研究では,LLMと従来のソフトウェア工学技術を組み合わせる手法を構造化された方法で提案する。
参考スコア（独自算出の注目度）: 2.2058293096044586
License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
Abstract: Large Language Models (LLMs) are fast becoming indispensable tools for software developers, assisting or even partnering with them in crafting complex programs. The advantages are evident -- LLMs can significantly reduce development time, generate well-organized and comprehensible code, and occasionally suggest innovative ideas that developers might not conceive on their own. However, despite their strengths, LLMs will often introduce significant errors and present incorrect code with persuasive confidence, potentially misleading developers into accepting flawed solutions. In order to bring LLMs into the software development cycle in a more reliable manner, we propose a methodology for combining them with ``traditional'' software engineering techniques in a structured way, with the goal of streamlining the development process, reducing errors, and enabling users to verify crucial program properties with increased confidence. Specifically, we focus on the Scenario-Based Programming (SBP) paradigm -- an event-driven, scenario-based approach for software engineering -- to allow human developers to pour their expert knowledge into the LLM, as well as to inspect and verify its outputs. To evaluate our methodology, we conducted a significant case study, and used it to design and implement the Connect4 game. By combining LLMs and SBP we were able to create a highly-capable agent, which could defeat various strong existing agents. Further, in some cases, we were able to formally verify the correctness of our agent. Finally, our experience reveals interesting insights regarding the ease-of-use of our proposed approach. The full code of our case-study will be made publicly available with the final version of this paper.
Abstract（参考訳）: 大規模言語モデル(LLM)は、ソフトウェア開発者にとっては急速に欠かせないツールになりつつある。 LLMは開発時間を著しく削減し、十分に整理され、理解可能なコードを生成します。しかし、その強みにもかかわらず、LLMは、しばしば重大なエラーを導入し、説得力のある自信を持って間違ったコードを提示し、開発者が欠陥のあるソリューションを受け入れることを誤解させる可能性がある。 LLMをより信頼性の高い方法でソフトウェア開発サイクルに導入するために、我々は、開発プロセスの合理化、エラーの低減、ユーザが信頼性を高めて重要なプログラムプロパティの検証を可能にすることを目的として、構造化された方法で'伝統的な'ソフトウェアエンジニアリング技術と組み合わせるための方法論を提案する。具体的には、シナリオベースプログラミング(SBP)パラダイム – ソフトウェアエンジニアリングのためのイベント駆動のシナリオベースのアプローチ – に注目して、専門家の知識をLLMに注ぐと同時に、アウトプットの検査と検証を可能にします。提案手法を評価するために,我々は重要なケーススタディを行い,Connect4ゲームの設計と実装に利用した。 LLMとSBPを組み合わせることで、さまざまな強力な既存のエージェントを倒すことができる高い能力を持つエージェントを作れるようになりました。さらに,ある場合には,エージェントの正当性を確認することができた。最後に、私たちの経験から、提案手法の使いやすさに関する興味深い洞察が浮かび上がっています。ケーススタディの全コードは、この記事の最終バージョンで公開されます。

論文の概要: On Integrating Large Language Models and Scenario-Based Programming for Improving Software Reliability

関連論文リスト