Fugu-MT 論文翻訳(概要): EffiReasonTrans: RL-Optimized Reasoning for Code Translation

論文の概要: EffiReasonTrans: RL-Optimized Reasoning for Code Translation

arxiv url: http://arxiv.org/abs/2510.18863v1
Date: Tue, 21 Oct 2025 17:55:39 GMT
ステータス: 翻訳完了
システム内更新日: 2025-10-25 03:08:14.121174
Title: EffiReasonTrans: RL-Optimized Reasoning for Code Translation
Title（参考訳）: EffiReasonTrans:コード翻訳のためのRL最適化推論
Authors: Yanlin Wang, Rongyi Ou, Yanli Wang, Mingwei Liu, Jiachi Chen, Ensheng Shi, Xilin Liu, Yuchi Ma, Zibin Zheng,
Abstract要約: EffiReasonTransは、推論レイテンシのバランスを保ちながら翻訳精度を向上させるために設計されたトレーニングフレームワークである。 EffiReasonTransは、生成トークンの数を減らしながら、翻訳精度(ベースモデルと比較して+49.2% CAと+27.8% CodeBLEU)を一貫して改善している。
参考スコア（独自算出の注目度）: 48.862761776440635
License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
Abstract: Code translation is a crucial task in software development and maintenance. While recent advancements in large language models (LLMs) have improved automated code translation accuracy, these gains often come at the cost of increased inference latency, hindering real-world development workflows that involve human-in-the-loop inspection. To address this trade-off, we propose EffiReasonTrans, a training framework designed to improve translation accuracy while balancing inference latency. We first construct a high-quality reasoning-augmented dataset by prompting a stronger language model, DeepSeek-R1, to generate intermediate reasoning and target translations. Each (source code, reasoning, target code) triplet undergoes automated syntax and functionality checks to ensure reliability. Based on this dataset, we employ a two-stage training strategy: supervised fine-tuning on reasoning-augmented samples, followed by reinforcement learning to further enhance accuracy and balance inference latency. We evaluate EffiReasonTrans on six translation pairs. Experimental results show that it consistently improves translation accuracy (up to +49.2% CA and +27.8% CodeBLEU compared to the base model) while reducing the number of generated tokens (up to -19.3%) and lowering inference latency in most cases (up to -29.0%). Ablation studies further confirm the complementary benefits of the two-stage training framework. Additionally, EffiReasonTrans demonstrates improved translation accuracy when integrated into agent-based frameworks. Our code and data are available at https://github.com/DeepSoftwareAnalytics/EffiReasonTrans.
Abstract（参考訳）: コード翻訳は、ソフトウェア開発とメンテナンスにおいて重要なタスクです。大規模言語モデル(LLM)の最近の進歩は、自動コード翻訳の精度を向上しているが、これらの向上は、推論レイテンシの増加と、実際の開発ワークフローの妨げとなることが多い。このトレードオフに対処するために,推論遅延のバランスを保ちながら翻訳精度を向上させるためのトレーニングフレームワークであるEffiReasonTransを提案する。まず、より強力な言語モデルであるDeepSeek-R1を誘導し、中間的推論とターゲット翻訳を生成することによって、高品質な推論拡張データセットを構築する。それぞれの(ソースコード、推論、ターゲットコード)トリプルトは、自動シンタックスと機能チェックを実行し、信頼性を確保します。このデータセットに基づいて、推論強化サンプルの微調整を監督し、さらに精度を高め、推論遅延のバランスをとるための強化学習を行う2段階のトレーニング戦略を採用する。 EffiReasonTransを6つの翻訳対で評価した。実験の結果、翻訳精度(ベースモデルと比較して+49.2% CAと+27.8% CodeBLEU)を一貫して改善し、生成トークンの数(19.3%まで)を減らし、ほとんどの場合(-29.0%まで)推論遅延を減らした。アブレーション研究は、2段階のトレーニングフレームワークの相補的な利点をさらに確認する。さらに、EffiReasonTransでは、エージェントベースのフレームワークに統合された場合の翻訳精度が改善されている。私たちのコードとデータはhttps://github.com/DeepSoftwareAnalytics/EffiReasonTransで公開されています。

論文の概要: EffiReasonTrans: RL-Optimized Reasoning for Code Translation

関連論文リスト