Fugu-MT 論文翻訳(概要): Rotation-Preserving Supervised Fine-Tuning

論文の概要: Rotation-Preserving Supervised Fine-Tuning

arxiv url: http://arxiv.org/abs/2605.10973v1
Date: Fri, 08 May 2026 20:20:05 GMT
ステータス: 翻訳完了
システム内更新日: 2026-05-13 21:48:56.269875
Title: Rotation-Preserving Supervised Fine-Tuning
Title（参考訳）: 回転保存型微細調整
Authors: Hangzhan Jin, Tianwei Ni, Lu Li, Pierre-Luc Bacon, Mohammad Hamdaqa, Doina Precup,
Abstract要約: Supervised Fine-tuning (SFT) はドメイン内のパフォーマンスを改善するが、ドメイン外の一般化を分解することができる。本稿では,魚の感覚方向の効率的なプロキシとして,事前訓練された特異部分空間における投影回転を保存することを提案する。 RPSFTは、各事前訓練された重み行列の投影された最高値の特異ベクトルブロックの変化を罰し、タスク適応を保ちながら不要な回転を制限する。
参考スコア（独自算出の注目度）: 39.442074320811585
License: http://creativecommons.org/publicdomain/zero/1.0/
Abstract: Supervised fine-tuning (SFT) improves in-domain performance but can degrade out-of-domain (OOD) generalization. Prior work suggests that this degradation is related to changes in dominant singular subspaces of pretrained weight matrices. However, directly identifying loss-sensitive directions with Hessian or Fisher information is computationally expensive at LLM scale. In this work, we propose preserving projected rotations in pretrained singular subspaces as an efficient proxy for Fisher-sensitive directions, which we call Rotation-Preserving Supervised Fine-Tuning (RPSFT). RPSFT penalizes changes in the projected top-$k$ singular-vector block of each pretrained weight matrix, limiting unnecessary rotation while preserving task adaptation. Across model families and sizes trained on math reasoning data, RPSFT improves the in-domain/OOD trade-off over standard SFT and strong SFT baselines, better preserves pretrained representations, and provides stronger initializations for downstream RL fine-tuning. Code is available at \href{https://github.com/jinhangzhan/RPSFT.git}{https://github.com/jinhangzhan/RPSFT}.
Abstract（参考訳）: Supervised Fine-tuning (SFT) はドメイン内のパフォーマンスを向上するが、外部ドメイン(OOD)の一般化を低下させることができる。以前の研究は、この分解が事前訓練された重み行列の支配的な特異部分空間の変化と関連していることを示唆している。しかし、ロスセンシティブな方向をHessianまたはFisher情報で直接識別することは、LLMスケールで計算的に高価である。そこで本研究では,魚の捕食方向の効率的なプロキシとして,事前訓練された特異部分空間における投影された回転を保存することを提案し,これを回転保存スーパーバイザード・ファインタニング(RPSFT)と呼ぶ。 RPSFTは、各事前訓練された重み行列の投影された最高値の特異ベクトルブロックの変化を罰し、タスク適応を保ちながら不要な回転を制限する。 RPSFTは、数学推論データに基づいて訓練されたモデルファミリーとサイズにわたって、標準SFTと強力なSFTベースラインとのドメイン内/OODトレードオフを改善し、事前訓練された表現をより良く保存し、下流RL微調整のためのより強力な初期化を提供する。コードは \href{https://github.com/jinhangzhan/RPSFT.git}{https://github.com/jinhangzhan/RPSFT} で公開されている。

論文の概要: Rotation-Preserving Supervised Fine-Tuning

関連論文リスト