Fugu-MT 論文翻訳(概要): Detecting Semantic Clones of Unseen Functionality

論文の概要: Detecting Semantic Clones of Unseen Functionality

arxiv url: http://arxiv.org/abs/2510.04143v1
Date: Sun, 05 Oct 2025 10:45:52 GMT
ステータス: 翻訳完了
システム内更新日: 2025-10-07 16:52:59.488364
Title: Detecting Semantic Clones of Unseen Functionality
Title（参考訳）: 目に見えない機能を有する意味クローンの検出
Authors: Konstantinos Kitsios, Francesco Sovrano, Earl T. Barr, Alberto Bacchelli,
Abstract要約: 我々は,未確認機能のクローンを検出するタスクにおいて,タスク固有モデルと生成LDMの両方を含む6つの最先端モデルを再評価する。そこで本研究では,既存モデルの非可視機能のクローン上での性能向上を図るために,コントラッシブ・ラーニング(コントラッシブ・ラーニング)の使用法を提案し,評価する。
参考スコア（独自算出の注目度）: 7.660632979515074
License: http://creativecommons.org/licenses/by/4.0/
Abstract: Semantic code clone detection is the task of detecting whether two snippets of code implement the same functionality (e.g., Sort Array). Recently, many neural models achieved near-perfect performance on this task. These models seek to make inferences based on their training data. Consequently, they better detect clones similar to those they have seen during training and may struggle to detect those they have not. Developers seeking clones are, of course, interested in both types of clones. We confirm this claim through a literature review, identifying three practical clone detection tasks in which the model's goal is to detect clones of a functionality even if it was trained on clones of different functionalities. In light of this finding, we re-evaluate six state-of-the-art models, including both task-specific models and generative LLMs, on the task of detecting clones of unseen functionality. Our experiments reveal a drop in F1 of up to 48% (average 31%) for task-specific models. LLMs perform on par with task-specific models without explicit training for clone detection, but generalize better to unseen functionalities, where F1 drops up to 5% (average 3%) instead. We propose and evaluate the use of contrastive learning to improve the performance of existing models on clones of unseen functionality. We draw inspiration from the computer vision and natural language processing fields where contrastive learning excels at measuring similarity between two objects, even if they come from classes unseen during training. We replace the final classifier of the task-specific models with a contrastive classifier, while for the generative LLMs we propose contrastive in-context learning, guiding the LLMs to focus on the differences between clones and non-clones. The F1 on clones of unseen functionality is improved by up to 26% (average 9%) for task-specific models and up to 5% (average 3%) for LLMs.
Abstract（参考訳）: セマンティックコードクローン検出(Semantic code clone detection)は、2つのコードスニペットが同じ機能(例: Sort Array)を実装しているかどうかを検出するタスクである。近年,多くのニューラルモデルがこの課題に対してほぼ完全な性能を達成している。これらのモデルは、トレーニングデータに基づいて推論を試みる。その結果、彼らは訓練中に見たクローンとよく似たクローンを検知し、まだ検出していないクローンを検出するのに苦労する可能性がある。クローンを探しているデベロッパーは、もちろんどちらのタイプのクローンにも興味がある。この主張を文献レビューを通じて確認し、異なる機能のクローンで訓練された場合でも、モデルの目的が機能のクローンを検出することである3つの実用的なクローン検出タスクを特定した。この発見を踏まえて、未確認機能のクローンを検出するタスクにおいて、タスク固有モデルと生成LDMの両方を含む6つの最先端モデルを再評価する。実験の結果,タスク固有モデルではF1が最大48%(平均31%)減少していることがわかった。 LLMは、クローン検出のための明示的なトレーニングをすることなく、タスク固有のモデルと同等に動作するが、F1が5%(平均3%)まで低下する、見知らぬ機能に最適化される。そこで本研究では,既存モデルの非可視機能のクローン上での性能向上を図るために,コントラッシブ・ラーニング(コントラッシブ・ラーニング)の利用法を提案する。コンピュータビジョンや自然言語処理の分野からインスピレーションを得て,2つのオブジェクト間の類似度を,たとえ学習中に見つからないクラスから来たとしても,対照的な学習が優れている。タスク固有モデルの最終分類をコントラスト型分類器に置き換える一方、生成型LLMではコントラスト型インコンテキスト学習を提案し、LLMはクローンと非クローンの違いに焦点をあてる。未確認機能のクローン上のF1は、タスク固有のモデルでは最大26%(平均9%)、LLMでは最大5%(平均3%)改善されている。

論文の概要: Detecting Semantic Clones of Unseen Functionality

関連論文リスト