Fugu-MT 論文翻訳(概要): How do Humans and LLMs Process Confusing Code?

論文の概要: How do Humans and LLMs Process Confusing Code?

arxiv url: http://arxiv.org/abs/2508.18547v1
Date: Mon, 25 Aug 2025 22:50:55 GMT
ステータス: 翻訳完了
システム内更新日: 2025-08-27 17:42:38.6209
Title: How do Humans and LLMs Process Confusing Code?
Title（参考訳）: 人間とLLMはどのようにコードを混同するか?
Authors: Youssef Abdelsalam, Norman Peitek, Anna-Maria Maurer, Mariya Toneva, Sven Apel,
Abstract要約: プログラミングアシスタント(LLM)とプログラマがコードを理解する方法の相違は、誤解や非効率性、コード品質の低下、バグにつながる可能性がある。クリーンで紛らわしいコードを解釈し,LLMを人間プログラマと比較した実証的研究を行った。 LLMの急激なスパイクは、場所と振幅の両方において、混乱を示す人間の神経生理学的反応と相関していることがわかった。
参考スコア（独自算出の注目度）: 10.975229558223964
License: http://creativecommons.org/licenses/by/4.0/
Abstract: Already today, humans and programming assistants based on large language models (LLMs) collaborate in everyday programming tasks. Clearly, a misalignment between how LLMs and programmers comprehend code can lead to misunderstandings, inefficiencies, low code quality, and bugs. A key question in this space is whether humans and LLMs are confused by the same kind of code. This would not only guide our choices of integrating LLMs in software engineering workflows, but also inform about possible improvements of LLMs. To this end, we conducted an empirical study comparing an LLM to human programmers comprehending clean and confusing code. We operationalized comprehension for the LLM by using LLM perplexity, and for human programmers using neurophysiological responses (in particular, EEG-based fixation-related potentials). We found that LLM perplexity spikes correlate both in terms of location and amplitude with human neurophysiological responses that indicate confusion. This result suggests that LLMs and humans are similarly confused about the code. Based on these findings, we devised a data-driven, LLM-based approach to identify regions of confusion in code that elicit confusion in human programmers.
Abstract（参考訳）: 現在、人間とプログラミングアシスタントは、大規模な言語モデル(LLM)に基づいた日々のプログラミングタスクで協力しています。明らかに、LLMとプログラマがコードを理解する方法の相違は誤解、非効率性、コード品質の低下、バグにつながる可能性がある。この領域で重要な問題は、人間とLLMが同じ種類のコードで混同されているかどうかである。これは、ソフトウェアエンジニアリングワークフローにLLMを統合するという私たちの選択を導くだけでなく、LLMの改善の可能性についても知らせてくれるでしょう。この目的のために、クリーンで紛らわしいコードを解釈する人間プログラマとLLMを比較した実証的研究を行った。神経生理学的反応(特に脳波による固定関連電位)を用いた人間のプログラマに対しては,LSMのパープレキシティを用いて,LSMの理解を操作した。 LLMの急激なスパイクは、場所と振幅の両方において、混乱を示す人間の神経生理学的反応と相関していることがわかった。この結果は、LLMと人間も同様にコードについて混乱していることを示唆している。これらの知見に基づいて,人間のプログラマに混乱をもたらすコード内の混乱領域を特定するために,データ駆動型LLMベースのアプローチを開発した。

論文の概要: How do Humans and LLMs Process Confusing Code?

関連論文リスト