Fugu-MT 論文翻訳(概要): Think Multilingual, Not Harder: A Data-Efficient Framework for Teaching Reasoning Models to Code-Switch

論文の概要: Think Multilingual, Not Harder: A Data-Efficient Framework for Teaching Reasoning Models to Code-Switch

arxiv url: http://arxiv.org/abs/2604.15490v1
Date: Thu, 16 Apr 2026 19:44:21 GMT
ステータス: 翻訳完了
システム内更新日: 2026-04-20 22:00:19.638003
Title: Think Multilingual, Not Harder: A Data-Efficient Framework for Teaching Reasoning Models to Code-Switch
Title（参考訳）: マルチ言語を思い浮かべる - コードスイッチに推論モデルを教えるためのデータ効率のよいフレームワーク
Authors: Eleanor M. Lin, David Jurgens,
Abstract要約: 我々は,大規模言語モデルにおける有益なコード切替推論行動を特定するための,言語的かつ行動的に動機付けられたファインチューニングフレームワークを紹介した。まず、さまざまなモデル、言語、タスク、ドメインからの推論トレースのデータセットを分析し、既存の推論モデルに見られるコードスイッチングの振る舞いのタイプを理解する。我々は、既存のモデルで有用な振る舞いの観察に基づいて、推論モデルにコードスウィッチを教える微調整の介入を開発する。
参考スコア（独自算出の注目度）: 24.894866118925847
License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
Abstract: Recent developments in reasoning capabilities have enabled large language models to solve increasingly complex mathematical, symbolic, and logical tasks. Interestingly, while reasoning models are often trained to generate monolingual text, these models have also been observed to code-switch (i.e., mix languages). Prior works have either viewed code-switching as an undesirable error, attempted to control code-switching through modifications to input prompts or the output decoding process, or focus on narrow subsets of languages, domains, tasks, and models. We address these gaps by introducing the first linguistically and behaviorally motivated fine-tuning framework for identifying beneficial code-switched reasoning behaviors in large language models and teaching these models to code-switch more effectively for reasoning. First, we create and systematically analyze a dataset of reasoning traces from diverse models, languages, tasks, and domains to understand the types of code-switching behaviors found in existing reasoning models. Then, we develop fine-tuning interventions that teach reasoning models to code-switch based on our observations of helpful behaviors in existing models. We find that our framework can significantly increase beneficial code-switched reasoning behaviors in a data-efficient manner. Interestingly, we also find that code-switching behaviors in reasoning models can be modified by fine-tuning for tasks that do not directly demonstrate code-switching in reasoning (e.g., machine translation). Our work suggests that data-efficient interventions can instill helpful forms of code-switching behavior in reasoning models.
Abstract（参考訳）: 近年の推論能力の発達により、大規模言語モデルはますます複雑な数学的、記号的、論理的タスクを解けるようになった。興味深いことに、推論モデルは単言語テキストを生成するためにしばしば訓練されるが、これらのモデルはコードスウィッチ(ミックス言語)にも観察されている。以前の作業では、コードスイッチングを望ましくないエラーと見なしていたり、入力プロンプトや出力デコードプロセスの変更によるコードスイッチングの制御を試みたり、言語、ドメイン、タスク、モデルの狭いサブセットにフォーカスしたりしていた。これらのギャップに対処するために,我々は,大規模言語モデルにおいて有益なコードスイッチによる推論行動を特定するための,言語的かつ行動的に動機付けられた初歩的な微調整フレームワークを導入し,それらのモデルに推論をより効果的に教えることによって,これらのギャップに対処する。まず、既存の推論モデルに見られるコードスイッチングの振る舞いのタイプを理解するために、さまざまなモデル、言語、タスク、ドメインからの推論トレースのデータセットを作成し、体系的に分析する。そこで我々は,既存のモデルにおける有用な行動の観察に基づいて,推論モデルをコードスウィッチに教える微調整の介入を開発する。当社のフレームワークは,データ効率のよいコード変更による推論行動を大幅に向上させることができる。興味深いことに、推論モデルにおけるコードスイッチングの動作は、推論におけるコードスイッチングを直接示さないタスク(例えば機械翻訳)を微調整することで修正できる。我々の研究は、データ効率のよい介入が推論モデルに有用なコードスイッチング行動をもたらすことを示唆している。

論文の概要: Think Multilingual, Not Harder: A Data-Efficient Framework for Teaching Reasoning Models to Code-Switch

関連論文リスト