Fugu-MT 論文翻訳(概要): Mitigating Catastrophic Forgetting in Target Language Adaptation of LLMs via Source-Shielded Updates

論文の概要: Mitigating Catastrophic Forgetting in Target Language Adaptation of LLMs via Source-Shielded Updates

arxiv url: http://arxiv.org/abs/2512.04844v1
Date: Thu, 04 Dec 2025 14:28:14 GMT
ステータス: 翻訳完了
システム内更新日: 2025-12-05 21:11:46.217869
Title: Mitigating Catastrophic Forgetting in Target Language Adaptation of LLMs via Source-Shielded Updates
Title（参考訳）: ソースシールド更新によるLLMのターゲット言語適応におけるカタストロフィック・フォーミングの緩和
Authors: Atsuki Yamaguchi, Terufumi Morishita, Aline Villavicencio, Nikolaos Aletras,
Abstract要約: 本稿では,ソース知識を積極的に保存するパラメータ更新戦略である Source-Shielded Updates (SSU) を紹介する。 SSUは破滅的な忘れを和らげることに成功した。モノリンガルソースタスクのパフォーマンス劣化を平均で3.4% (7B) と2.8% (13B) に減らし、フル微調整の20.3%と22.3%とは対照的である。
参考スコア（独自算出の注目度）: 36.05883134265614
License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
Abstract: Expanding the linguistic diversity of instruct large language models (LLMs) is crucial for global accessibility but is often hindered by the reliance on costly specialized target language labeled data and catastrophic forgetting during adaptation. We tackle this challenge under a realistic, low-resource constraint: adapting instruct LLMs using only unlabeled target language data. We introduce Source-Shielded Updates (SSU), a selective parameter update strategy that proactively preserves source knowledge. Using a small set of source data and a parameter importance scoring method, SSU identifies parameters critical to maintaining source abilities. It then applies a column-wise freezing strategy to protect these parameters before adaptation. Experiments across five typologically diverse languages and 7B and 13B models demonstrate that SSU successfully mitigates catastrophic forgetting. It reduces performance degradation on monolingual source tasks to just 3.4% (7B) and 2.8% (13B) on average, a stark contrast to the 20.3% and 22.3% from full fine-tuning. SSU also achieves target-language performance highly competitive with full fine-tuning, outperforming it on all benchmarks for 7B models and the majority for 13B models.
Abstract（参考訳）: 命令型大規模言語モデル(LLM)の言語多様性の拡大は、グローバルなアクセシビリティにとって重要であるが、コストがかかる特定のターゲット言語ラベル付きデータへの依存や、適応中の破滅的な忘れ込みによって、しばしば妨げられる。我々はこの課題に,ラベルなしのターゲット言語データのみを用いて命令LDMを適用するという,現実的で低リソースな制約の下で対処する。本稿では,ソース知識を積極的に保存するパラメータ更新戦略である Source-Shielded Updates (SSU) を紹介する。ソースデータの小さなセットとパラメータ重要度スコアリング手法を用いて、SSUはソース能力の維持に不可欠なパラメータを識別する。そして、適応する前にこれらのパラメータを保護するためにカラムワイズフリーズ戦略を適用します。 5つのタイプ的多様言語と7Bおよび13Bモデルにわたる実験により、SSUは破滅的な忘れを緩和することに成功した。モノリンガルソースタスクのパフォーマンス劣化を平均で3.4% (7B) と2.8% (13B) に減らし、フル微調整の20.3%と22.3%とは対照的である。 SSUはまた、ターゲット言語のパフォーマンスを、完全な微調整と高い競争力で達成し、7Bモデルのすべてのベンチマークで、そして13Bモデルの多数で、パフォーマンスを上回ります。

論文の概要: Mitigating Catastrophic Forgetting in Target Language Adaptation of LLMs via Source-Shielded Updates

関連論文リスト