Fugu-MT 論文翻訳(概要): SFT Doesn't Always Hurt General Capabilities: Revisiting Domain-Specific Fine-Tuning in LLMs

論文の概要: SFT Doesn't Always Hurt General Capabilities: Revisiting Domain-Specific Fine-Tuning in LLMs

arxiv url: http://arxiv.org/abs/2509.20758v1
Date: Thu, 25 Sep 2025 05:28:22 GMT
ステータス: 翻訳完了
システム内更新日: 2025-09-26 20:58:12.708271
Title: SFT Doesn't Always Hurt General Capabilities: Revisiting Domain-Specific Fine-Tuning in LLMs
Title（参考訳）: SFTは一般の能力を常に発揮していない: LLMにおけるドメイン特化ファインチューニングの再考
Authors: Jiacheng Lin, Zhongruo Wang, Kun Qian, Tian Wang, Arvind Srinivasan, Hansi Zeng, Ruochen Jiao, Xie Zhou, Jiri Gesi, Dakuo Wang, Yufan Guo, Kai Zhong, Weiqi Zhang, Sujay Sanghavi, Changyou Chen, Hyokun Yun, Lihong Li,
Abstract要約: Supervised Fine-Tuning (SFT) は、大規模言語モデル(LLM)を特殊タスクに適用するための一般的なアプローチである。より少ない学習率で一般的な性能劣化を著しく軽減することができる。
参考スコア（独自算出の注目度）: 53.77646961962239
License: http://creativecommons.org/licenses/by-nc-sa/4.0/
Abstract: Supervised Fine-Tuning (SFT) on domain-specific datasets is a common approach to adapt Large Language Models (LLMs) to specialized tasks but is often believed to degrade their general capabilities. In this work, we revisit this trade-off and present both empirical and theoretical insights. First, we show that SFT does not always hurt: using a smaller learning rate can substantially mitigate general performance degradation while preserving comparable target-domain performance. We then provide a theoretical analysis that explains these phenomena and further motivates a new method, Token-Adaptive Loss Reweighting (TALR). Building on this, and recognizing that smaller learning rates alone do not fully eliminate general-performance degradation in all cases, we evaluate a range of strategies for reducing general capability loss, including L2 regularization, LoRA, model averaging, FLOW, and our proposed TALR. Experimental results demonstrate that while no method completely eliminates the trade-off, TALR consistently outperforms these baselines in balancing domain-specific gains and general capabilities. Finally, we distill our findings into practical guidelines for adapting LLMs to new domains: (i) using a small learning rate to achieve a favorable trade-off, and (ii) when a stronger balance is further desired, adopt TALR as an effective strategy.
Abstract（参考訳）: ドメイン固有データセット上での監視ファインチューニング(SFT)は、大規模言語モデル(LLM)を特定のタスクに適応させる一般的なアプローチであるが、一般的な能力を低下させると考えられていることが多い。本研究では,このトレードオフを再考し,実証的および理論的知見を提示する。まず、SFTが常に傷つくわけではないことを示し、より少ない学習率で、同等の目標ドメイン性能を維持しながら、一般的なパフォーマンス低下を大幅に軽減できることを示す。次に,これらの現象を説明する理論的解析を行い,新たな手法であるToken-Adaptive Loss Reweighting(TALR)の動機付けを行う。これに基づいて,L2正則化,LoRA,モデル平均化,FLOW,提案するTALRなど,すべてのケースにおいて,より小さな学習率だけでは汎用的な劣化を完全に排除できないことを認識し,汎用能力損失を低減するための戦略範囲を評価した。実験結果から、メソッドが完全にトレードオフを除去することはないが、TALRはドメイン固有のゲインと一般的な能力のバランスをとる上で、これらのベースラインを一貫して上回ることを示した。最後に,LLMを新たな領域に適用するための実践的ガイドラインとして,本研究の成果を抽出する。 (i)少ない学習率で有利なトレードオフを達成し、 (二)より強いバランスが望まれる場合には、TALRを効果的な戦略として採用する。

論文の概要: SFT Doesn't Always Hurt General Capabilities: Revisiting Domain-Specific Fine-Tuning in LLMs

関連論文リスト