Fugu-MT 論文翻訳(概要): MixSD: Mixed Contextual Self-Distillation for Knowledge Injection

論文の概要: MixSD: Mixed Contextual Self-Distillation for Knowledge Injection

arxiv url: http://arxiv.org/abs/2605.16865v2
Date: Thu, 21 May 2026 07:04:19 GMT
ステータス: 翻訳完了
システム内更新日: 2026-05-22 20:14:18.422779
Title: MixSD: Mixed Contextual Self-Distillation for Knowledge Injection
Title（参考訳）: MixSD:知識注入のための混合文脈自己蒸留
Authors: Jiarui Liu, Lechen Zhang, Yongjin Yang, Yinghui He, Yingheng Wang, Weihao Xuan, Zhijing Jin, Mona Diab,
Abstract要約: Supervised Fine-tuning (SFT) は言語モデルに新しい知識を注入するために広く使われている。このことは、人間や外部システムからの微調整対象が、モデルの自動回帰分布から逸脱するためである、と我々は主張する。分布整合型知識注入のための簡易な外部教師なし手法であるMixSDを提案する。
参考スコア（独自算出の注目度）: 29.7616760417696
License: http://creativecommons.org/licenses/by/4.0/
Abstract: Supervised fine-tuning (SFT) is widely used to inject new knowledge into language models, but it often degrades pretrained capabilities such as reasoning and general-domain performance. We argue this forgetting arises because fine-tuning targets from humans or external systems diverge from the model's autoregressive distribution, forcing the optimizer to imitate low-probability token sequences. To address this problem, we propose MixSD, a simple external-teacher-free method for distribution-aligned knowledge injection. Instead of training on fixed targets, MixSD constructs supervision dynamically by mixing tokens from two conditionals of the base model itself: an expert conditional that observes the injected fact in context, and a naive conditional that reflects the model's original prior. The resulting supervision sequences preserve the factual learning signal while remaining substantially closer to the base model's distribution. We evaluate MixSD on two synthetic corpora that we construct to study factual recall and arithmetic function acquisition in a controlled setting, together with established benchmarks for open-domain factual question answering and knowledge editing. Across multiple model scales and settings, MixSD consistently achieves a better memorization-retention trade-off compared to SFT and on-policy self distillation baselines, retaining up to 100% of the base model's held-out capability while maintaining near-perfect training accuracy, whereas standard SFT retains as little as 1%. We further show that MixSD produces substantially lower-NLL supervision targets under the base model and reduces harmful movement along Fisher-sensitive parameter directions. These results suggest that aligning supervision with the model's native generation distribution is a simple and effective principle for knowledge injection that mitigates catastrophic forgetting.
Abstract（参考訳）: Supervised Fine-tuning (SFT) は言語モデルに新しい知識を注入するために広く使われているが、推論や一般ドメインのパフォーマンスなどの事前訓練能力は劣化することが多い。このことは、人間や外部システムからの微調整対象が自己回帰分布から逸脱し、オプティマイザに低確率トークンシーケンスを模倣させてしまうためである、と我々は主張する。この問題に対処するために、分布整合型知識注入のためのシンプルな外部教師なし手法であるMixSDを提案する。固定されたターゲットをトレーニングする代わりに、MixSDは、ベースモデル自体の2つの条件からトークンを混合することによって、動的に監視を構築する。結果として得られた監視シーケンスは、ベースモデルの分布にかなり近づきながら、実際の学習信号を保存する。そこで我々は,MixSDを2つの合成コーパス上で評価し,実ミスリコールと算術関数の獲得を制御環境で研究すると共に,オープンドメインの事実質問応答と知識編集のための確立されたベンチマークを作成した。複数のモデルスケールと設定にわたって、MixSDはSFTやオンラインの自己蒸留ベースラインと比較して、記憶と保持のトレードオフを一貫して達成し、ベースモデルの保持能力の最大100%を維持しながら、ほぼ完璧なトレーニング精度を維持しているのに対して、標準のSFTは1%しか保持していない。さらに、MixSDはベースモデルの下でかなり低NLL監視目標を生成し、フィッシャー感受性パラメータの方向に沿って有害な動きを減少させることを示す。これらの結果から,モデル固有の世代分布の調整は,破滅的忘れを緩和する知識注入の簡便かつ効果的な原理であることが示唆された。

論文の概要: MixSD: Mixed Contextual Self-Distillation for Knowledge Injection

関連論文リスト