Fugu-MT 論文翻訳(概要): From LLMs to Agents in Programming: The Impact of Providing an LLM with a Compiler

論文の概要: From LLMs to Agents in Programming: The Impact of Providing an LLM with a Compiler

arxiv url: http://arxiv.org/abs/2601.12146v2
Date: Fri, 23 Jan 2026 08:51:50 GMT
ステータス: 翻訳完了
システム内更新日: 2026-01-26 14:27:27.290796
Title: From LLMs to Agents in Programming: The Impact of Providing an LLM with a Compiler
Title（参考訳）: LLMからプログラミングエージェントへ:コンパイラによるLLMの提供が及ぼす影響
Authors: Viktor Kjellberg, Miroslaw Staron, Farnaz Fotrousi,
Abstract要約: 大規模言語モデルは、自然言語とプログラム生成とソフトウェア開発において顕著な能力を示してきた。本稿では,このようなエージェントがソフトウェア開発ツール,例えばgccコンパイラへのアクセスから恩恵を受ける程度について検討する。我々は,コンパイラとの連携により,言語モデルの役割を受動的生成器から,コンパイラからのフィードバックに基づいて実行可能なプログラムを反復的に開発可能なアクティブエージェントへ移行させる方法について評価する。
参考スコア（独自算出の注目度）: 2.7400724993677703
License: http://creativecommons.org/licenses/by/4.0/
Abstract: Large Language Models have demonstrated a remarkable capability in natural language and program generation and software development. However, the source code generated by the LLMs does not always meet quality requirements and may fail to compile. Therefore, many studies evolve into agents that can reason about the problem before generating the source code for the solution. The goal of this paper is to study the degree to which such agents benefit from access to software development tools, in our case, a gcc compiler. We conduct a computational experiment on the RosettaCode dataset, on 699 programming tasks in C. We evaluate how the integration with a compiler shifts the role of the language model from a passive generator to an active agent capable of iteratively developing runnable programs based on feedback from the compiler. We evaluated 16 language models with sizes ranging from small (135 million) to medium (3 billion) and large (70 billion). Our results show that access to a compiler improved the compilation success by 5.3 to 79.4 percentage units in compilation without affecting the semantics of the generated program. Syntax errors dropped by 75%, and errors related to undefined references dropped by 87% for the tasks where the agents outperformed the baselines. We also observed that in some cases, smaller models with a compiler outperform larger models with a compiler. We conclude that it is essential for LLMs to have access to software engineering tools to enhance their performance and reduce the need for large models in software engineering, such as reducing our energy footprint.
Abstract（参考訳）: 大規模言語モデルは、自然言語とプログラム生成とソフトウェア開発において顕著な能力を示してきた。しかし、LLMによって生成されたソースコードは必ずしも品質要件を満たしておらず、コンパイルに失敗する可能性がある。したがって、多くの研究は、ソリューションのソースコードを生成する前に問題を推論できるエージェントへと進化する。本研究の目的は,このようなエージェントがソフトウェア開発ツール,特にgccコンパイラにアクセスできることのメリットを調査することである。我々は,C言語における699のプログラムタスク上で,RosettaCodeデータセット上で計算実験を行い,コンパイラとのインテグレーションによって,言語モデルの役割が受動的生成器から,コンパイラからのフィードバックに基づいて実行可能なプログラムを反復的に開発可能なアクティブエージェントに移行したかを評価する。我々は,小 (1億3500万) から中 (3億) ,大 (7000億) までの16の言語モデルを評価した。その結果,コンパイラへのアクセスにより,生成プログラムのセマンティクスに影響を与えることなく,コンパイルにおける5.3～79.4パーセントのコンパイル成功率が向上した。構文エラーは75%減少し、未定義参照に関するエラーは、エージェントがベースラインを上回ったタスクに対して87%低下した。また、場合によっては、コンパイラを持つ小さなモデルの方が、コンパイラでより大きなモデルより優れています。我々は、LCMがソフトウェア工学ツールにアクセスし、その性能を高め、我々のエネルギーフットプリントを減らすなど、ソフトウェア工学における大規模なモデルの必要性を減らすことが不可欠であると結論付けている。

論文の概要: From LLMs to Agents in Programming: The Impact of Providing an LLM with a Compiler

関連論文リスト