Fugu-MT 論文翻訳(概要): LLM Reasoning for Machine Translation: Synthetic Data Generation over Thinking Tokens

論文の概要: LLM Reasoning for Machine Translation: Synthetic Data Generation over Thinking Tokens

arxiv url: http://arxiv.org/abs/2510.11919v1
Date: Mon, 13 Oct 2025 20:41:01 GMT
ステータス: 翻訳完了
システム内更新日: 2025-10-15 19:02:32.090568
Title: LLM Reasoning for Machine Translation: Synthetic Data Generation over Thinking Tokens
Title（参考訳）: 機械翻訳のためのLLM推論:思考トークンによる合成データ生成
Authors: Armel Zebaze, Rachel Bawden, Benoît Sagot,
Abstract要約: シンキングトークン」は、LRMが機械翻訳をより良く実行するのに役立ちません。合成CoT説明によるモデル微調整は、標準入力出力微調整よりは良くない。以上の結果から,教師が目標翻訳を洗練したり,並列コーパスを拡張することは,CoTの説明を「思考」MTモデルに蒸留するよりも影響が大きいことが示唆された。
参考スコア（独自算出の注目度）: 25.257363122413395
License: http://creativecommons.org/licenses/by/4.0/
Abstract: Large reasoning models (LRMs) have led to new possibilities in terms of problem-solving, through the devising of a natural language thought process prior to answering a query. While their capabilities are well known across mathematics and coding tasks, their impact on the task of machine translation (MT) remains underexplored. In this work, we explore the benefits of the generation of intermediate tokens when performing MT across multiple language pairs of different levels of resourcedness and multiple setups. We find that "thinking tokens" do not help LRMs better perform MT. This result generalizes to models fine-tuned to reason before translating using distilled chain of thought (CoT) inspired by human translators' practices. Specifically, fine-tuning a model with synthetic CoT explanations detailing how to translate step-by-step does not outperform standard input-output fine-tuning. However, constructing the intermediate tokens by combining the outputs of modular translation-specific prompting strategies results in improvements. Our findings underscore that the contribution of intermediate tokens during fine-tuning highly depends on the presence of translation attempts within them. More broadly, our results suggest that using a teacher to refine target translations or to expand parallel corpora is more impactful than distilling their CoT explanations into "thinking" MT models.
Abstract（参考訳）: 大きな推論モデル(LRM)は、クエリに応答する前に自然言語の思考プロセスを開発することによって、問題解決の観点から新たな可能性をもたらしている。それらの能力は数学やコーディングのタスクでよく知られているが、機械翻訳(MT)のタスクに対する影響はいまだ解明されていない。本研究では,複数の言語対にまたがる複数のリソースと複数のセットアップでMTを実行する場合の中間トークン生成の利点について検討する。この結果は、人間の翻訳者の実践にインスパイアされた蒸留された思考の連鎖(CoT)を用いて翻訳する前に、合理的に微調整されたモデルに一般化される。具体的には、ステップバイステップの翻訳方法を詳述した合成CoT説明によるモデル微調整は、標準入力出力微調整よりも優れていない。しかし、モジュール翻訳固有のプロンプト戦略の出力を組み合わせて中間トークンを構築することにより、改善がもたらされる。その結果, 微調整中の中間トークンの寄与は, 翻訳の試みの有無に大きく依存していることが判明した。より広範に,本研究の結果から,教師が目標翻訳を洗練したり,並列コーパスを拡張することは,CoTの説明を「思考」MTモデルに蒸留するよりも影響が大きいことが示唆された。

論文の概要: LLM Reasoning for Machine Translation: Synthetic Data Generation over Thinking Tokens

関連論文リスト