Fugu-MT 論文翻訳(概要): How Natural Language Proficiency Shapes GenAI Code for Software Engineering Tasks

論文の概要: How Natural Language Proficiency Shapes GenAI Code for Software Engineering Tasks

arxiv url: http://arxiv.org/abs/2511.04115v1
Date: Thu, 06 Nov 2025 07:06:20 GMT
ステータス: 翻訳完了
システム内更新日: 2025-11-07 20:17:53.336568
Title: How Natural Language Proficiency Shapes GenAI Code for Software Engineering Tasks
Title（参考訳）: 自然言語の習熟度がソフトウエアエンジニアリングタスクのGenAIコードをどのように形作るか
Authors: Ruksit Rojpaisarnkit, Youmei Fan, Kenichi Matsumoto, Raula Gaikovina Kula,
Abstract要約: 本稿では,Large Language Models (LLMs) が生成するコードの習熟度と正しさに,英語の習熟度自体が影響を及ぼすかを検討する。我々は164のプログラミングタスクにおいて、基本的なプロンプトから高度なプロンプトの英語の習熟度を変化させ、その結果のコード習熟度と正しさを測定した。結果のコード習熟度への影響はモデルに依存したものの、高い習熟率のプロンプトが一貫してすべてのモデルに対してより正確なコードを出力していることが判明した。
参考スコア（独自算出の注目度）: 3.487093088832285
License: http://creativecommons.org/licenses/by/4.0/
Abstract: With the widespread adoption of Foundation Model (FM)-powered tools in software engineering, the natural language prompt has become a critical interface between developers and Large Language Models (LLMs). While much research has focused on prompt structure, the natural language proficiency is an underexplored factor that can influence the quality of generated code. This paper investigates whether the English language proficiency itself independent of the prompting technique affects the proficiency and correctness of code generated by LLMs. Using the HumanEval dataset, we systematically varied the English proficiency of prompts from basic to advanced for 164 programming tasks and measured the resulting code proficiency and correctness. Our findings show that LLMs default to an intermediate (B2) natural language level. While the effect on the resulting code proficiency was model-dependent, we found that higher-proficiency prompts consistently yielded more correct code across all models. These results demonstrate that natural language proficiency is a key lever for controlling code generation, helping developers tailor AI output and improve the reliability of solutions.
Abstract（参考訳）: ソフトウェア工学におけるファンデーションモデル(FM)ベースのツールの普及により、自然言語プロンプトは開発者とLLM(Large Language Models)の間で重要なインターフェースとなっている。多くの研究が迅速な構造に焦点を当てているが、自然言語の習熟度は、生成されたコードの品質に影響を与える未発見の要素である。本稿では,英語の習熟度自体がLLMの習熟度や正確性に影響を及ぼすかどうかを考察する。我々はHumanEvalデータセットを用いて、164のプログラミングタスクの基本から高度なプロンプトの英語の習熟度を体系的に変化させ、その結果のコード習熟度と正しさを測定した。以上の結果から, LLM は中間言語レベル (B2) にデフォルトであることがわかった。結果のコード習熟度への影響はモデルに依存したものの、高い習熟率のプロンプトが一貫してすべてのモデルに対してより正確なコードを出力していることが判明した。これらの結果は、自然言語の習熟度がコード生成を制御するためのキーレバーであり、開発者がAI出力をカスタマイズし、ソリューションの信頼性を向上させるのに役立つことを示している。

論文の概要: How Natural Language Proficiency Shapes GenAI Code for Software Engineering Tasks

関連論文リスト