Fugu-MT 論文翻訳(概要): CodeMapper: A Language-Agnostic Approach to Mapping Code Regions Across Commits

論文の概要: CodeMapper: A Language-Agnostic Approach to Mapping Code Regions Across Commits

arxiv url: http://arxiv.org/abs/2511.05205v1
Date: Fri, 07 Nov 2025 12:50:56 GMT
ステータス: 翻訳完了
システム内更新日: 2025-11-10 21:00:44.761577
Title: CodeMapper: A Language-Agnostic Approach to Mapping Code Regions Across Commits
Title（参考訳）: CodeMapper: コミッタ間のコード領域マッピングのための言語に依存しないアプローチ
Authors: Huimin Hu, Michael Pradel,
Abstract要約: 開発者は一般的に、特定のコード領域をコミットから別のコミットにマッピングする問題に直面します。 git diffのような既存のテクニックは、開発者の選択するコード領域にフォーカスするのではなく、ファイルに行われたすべての変更を表示するためだけに、この問題に対処する。本稿では,特定のプログラム要素やプログラミング言語に依存しない方法で,コードマッピング問題に対処するアプローチであるCodeMapperを紹介する。
参考スコア（独自算出の注目度）: 14.265317773238529
License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
Abstract: During software evolution, developers commonly face the problem of mapping a specific code region from one commit to another. For example, they may want to determine how the condition of an if-statement, a specific line in a configuration file, or the definition of a function changes. We call this the code mapping problem. Existing techniques, such as git diff, address this problem only insufficiently because they show all changes made to a file instead of focusing on a code region of the developer's choice. Other techniques focus on specific code elements and programming languages (e.g., methods in Java), limiting their applicability. This paper introduces CodeMapper, an approach to address the code mapping problem in a way that is independent of specific program elements and programming languages. Given a code region in one commit, CodeMapper finds the corresponding region in another commit. The approach consists of two phases: (i) computing candidate regions by analyzing diffs, detecting code movements, and searching for specific code fragments, and (ii) selecting the most likely target region by calculating similarities. Our evaluation applies CodeMapper to four datasets, including two new hand-annotated datasets containing code region pairs in ten popular programming languages. CodeMapper correctly identifies the expected target region in 71.0%--94.5% of all cases, improving over the best available baselines by 1.5--58.8 absolute percent points.
Abstract（参考訳）: ソフトウェア進化の間、開発者は一般的に、特定のコード領域をコミットから別のコミットにマッピングするという問題に直面します。例えば、ifステートメントの条件、設定ファイルの特定の行、関数の定義がどう変わるかを決めたい場合もあります。これをコードマッピング問題と呼びます。 git diffのような既存のテクニックは、開発者の選択するコード領域にフォーカスするのではなく、ファイルに行われたすべての変更を表示するためだけに、この問題に対処する。他のテクニックは、特定のコード要素とプログラミング言語(例えば、Javaのメソッド)に焦点を当て、適用性を制限する。本稿では,特定のプログラム要素やプログラミング言語に依存しない方法で,コードマッピング問題に対処するアプローチであるCodeMapperを紹介する。あるコミットでコード領域が与えられた場合、CodeMapperは別のコミットで対応するリージョンを見つける。アプローチは2つのフェーズで構成されます。一差分を解析し、符号の動きを検出し、特定符号の断片を検索して、候補領域を演算すること。二類似性を計算することにより、最も可能性の高い対象領域を選択すること。評価では、CodeMapperを4つのデータセットに適用し、その中には10の人気のあるプログラミング言語のコード領域ペアを含む2つの手書きのデータセットが含まれています。 CodeMapperは、すべてのケースの71.0%--94.5%で予測対象領域を正しく識別し、最高のベースラインを1.5-58.8絶対値で改善している。

論文の概要: CodeMapper: A Language-Agnostic Approach to Mapping Code Regions Across Commits

関連論文リスト