Fugu-MT 論文翻訳(概要): R2ComSync: Improving Code-Comment Synchronization with In-Context Learning and Reranking

論文の概要: R2ComSync: Improving Code-Comment Synchronization with In-Context Learning and Reranking

arxiv url: http://arxiv.org/abs/2510.21106v1
Date: Fri, 24 Oct 2025 02:48:59 GMT
ステータス: 翻訳完了
システム内更新日: 2025-10-28 09:00:15.363224
Title: R2ComSync: Improving Code-Comment Synchronization with In-Context Learning and Reranking
Title（参考訳）: R2ComSync: インコンテキスト学習によるコードコミュニティ同期の改善とリグレード
Authors: Zhen Yang, Hongyi Lin, Xiao Yu, Jacky Wai Keung, Shuo Liu, Pak Yuen Patrick Chan, Yicheng Sun, Fengji Zhang,
Abstract要約: ICLベースのコード・コンメント同期手法であるR2ComSyncをRetrievalとRe-levelで拡張した。大規模な実験は、R2ComSyncの他のアプローチに対する優れたパフォーマンスを示している。
参考スコア（独自算出の注目度）: 14.480581715739975
License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
Abstract: Code-Comment Synchronization (CCS) aims to synchronize the comments with code changes in an automated fashion, thereby significantly reducing the workload of developers during software maintenance and evolution. While previous studies have proposed various solutions that have shown success, they often exhibit limitations, such as a lack of generalization ability or the need for extensive task-specific learning resources. This motivates us to investigate the potential of Large Language Models (LLMs) in this area. However, a pilot analysis proves that LLMs fall short of State-Of-The-Art (SOTA) CCS approaches because (1) they lack instructive demonstrations for In-Context Learning (ICL) and (2) many correct-prone candidates are not prioritized.To tackle the above challenges, we propose R2ComSync, an ICL-based code-Comment Synchronization approach enhanced with Retrieval and Re-ranking. Specifically, R2ComSync carries corresponding two novelties: (1) Ensemble hybrid retrieval. It equally considers the similarity in both code-comment semantics and change patterns when retrieval, thereby creating ICL prompts with effective examples. (2) Multi-turn re-ranking strategy. We derived three significant rules through large-scale CCS sample analysis. Given the inference results of LLMs, it progressively exploits three re-ranking rules to prioritize relatively correct-prone candidates. We evaluate R2ComSync using five recent LLMs on three CCS datasets covering both Java and Python programming languages, and make comparisons with five SOTA approaches. Extensive experiments demonstrate the superior performance of R2ComSync against other approaches. Moreover, both quantitative and qualitative analyses provide compelling evidence that the comments synchronized by our proposal exhibit significantly higher quality.}
Abstract（参考訳）: Code-Comment Synchronization (CCS)は、コメントとコードの変更を自動で同期させることを目的としている。これまでの研究では、成功を示す様々なソリューションが提案されているが、一般化能力の欠如やタスク固有の学習リソースの必要性といった制限がしばしば現れている。このことは、この分野におけるLarge Language Models(LLM)の可能性を探る動機となっている。しかし、パイロット分析では、(1)インコンテキスト学習(ICL)の指導的実証が欠けていること、(2)多くの正しい候補が優先されないことなどから、LCMがステートオフ・ザ・アーツ(SOTA)のアプローチに不足していることが証明され、上記の課題に対処するために、リトリーバルとリグレードによって強化されたICLベースのコードコム同期アプローチであるR2ComSyncを提案する。具体的には、R2ComSyncは、(1)エンサンブルハイブリッド検索という2つの新しい特徴を持つ。コード記述のセマンティクスと検索時の変更パターンの両方の類似性を同様に考慮し、効果的な例でICLプロンプトを作成する。 2) マルチターン・リグレード戦略。我々は大規模CCSサンプル分析により3つの重要なルールを導出した。 LLMの推測結果を考えると、相対的に正しい候補を優先順位付けするために3つの規則を段階的に活用する。我々は、JavaとPythonのプログラミング言語をカバーする3つのCSデータセット上で、5つの最近のLCMを用いてR2ComSyncを評価し、5つのSOTAアプローチと比較した。大規模な実験は、R2ComSyncの他のアプローチに対する優れたパフォーマンスを示している。さらに,定量分析と定性解析の両方が,提案手法によって同期されたコメントが著しく高い品質を示すことを示す説得力のある証拠を提供する。 ※

論文の概要: R2ComSync: Improving Code-Comment Synchronization with In-Context Learning and Reranking

関連論文リスト