Fugu-MT 論文翻訳(概要): Improving Code Translation with Syntax-Guided and Semantic-aware Preference Optimization

論文の概要: Improving Code Translation with Syntax-Guided and Semantic-aware Preference Optimization

arxiv url: http://arxiv.org/abs/2605.13229v1
Date: Wed, 13 May 2026 09:19:39 GMT
ステータス: 翻訳完了
システム内更新日: 2026-05-14 23:30:27.939524
Title: Improving Code Translation with Syntax-Guided and Semantic-aware Preference Optimization
Title（参考訳）: Syntax-Guided and Semantic-Aware Preference Optimization によるコード翻訳の改善
Authors: Yuhan Wu, Huan Zhang, Wei Cheng, Chen Shen, Jingyue Yang, Wei Hu,
Abstract要約: 我々は、ソースコードから直接、コード翻訳に対する堅牢なセマンティック報酬を導き出さなければならないと論じている。本稿では,構文ガイダンスとセマンティック・アウェア・プライオリティ最適化によるコード翻訳改善のためのCTOを提案する。
参考スコア（独自算出の注目度）: 22.90890448332095
License: http://creativecommons.org/licenses/by/4.0/
Abstract: LLMs have shown immense potential for code translation, yet they often struggle to ensure both syntactic correctness and semantic consistency. While preference-based learning offers a promising alignment strategy, it is hindered by unreliable semantic rewards derived from sparse test cases or restrictive reference translations. We argue that a robust semantic reward for code translation must be derived directly from the source code. In this paper, we propose CTO to improve code translation with syntax-guided and semantic-aware preference optimization. Through contrastive learning, we train a cross-lingual semantic model to directly assess functional equivalence between source and translated code. By formulating code translation as a multi-objective optimization problem, this robust semantic signal is seamlessly unified with compiler-based syntactic feedback within the direct preference optimization framework. Extensive experiments on C++, Java, and Python translations demonstrate that CTO significantly outperforms existing baselines and alternative preference optimization strategies.
Abstract（参考訳）: LLMはコード翻訳に大きな可能性を示しているが、構文的正当性と意味的整合性の両方を保証するのに苦労することが多い。嗜好に基づく学習は、有望なアライメント戦略を提供するが、スパーステストケースや制限付き参照翻訳から派生した信頼できないセマンティック報酬によって妨げられる。我々は、ソースコードから直接、コード翻訳に対する堅牢なセマンティック報酬を導き出さなければならないと論じている。本稿では,構文ガイダンスとセマンティック・アウェア・プライオリティ最適化によるコード翻訳改善のためのCTOを提案する。コントラスト学習を通じて、ソースコードと翻訳コードの機能的等価性を直接評価するために、言語間セマンティックモデルを訓練する。多目的最適化問題としてコード翻訳を定式化することにより、この堅牢なセマンティック信号は、直接選好最適化フレームワーク内のコンパイラベースの構文フィードバックとシームレスに統合される。 C++、Java、Pythonの翻訳に関する大規模な実験は、CTOが既存のベースラインと代替の最適化戦略を大幅に上回っていることを示している。

論文の概要: Improving Code Translation with Syntax-Guided and Semantic-aware Preference Optimization

関連論文リスト