Fugu-MT 論文翻訳(概要): MLLM Machine Unlearning via Visual Knowledge Distillation

論文の概要: MLLM Machine Unlearning via Visual Knowledge Distillation

arxiv url: http://arxiv.org/abs/2512.11325v1
Date: Fri, 12 Dec 2025 06:51:02 GMT
ステータス: 翻訳完了
システム内更新日: 2025-12-15 15:48:11.684277
Title: MLLM Machine Unlearning via Visual Knowledge Distillation
Title（参考訳）: 視覚的知識蒸留によるMLLM機械の学習
Authors: Yuhang Wang, Zhenxing Niu, Haoxuan Ji, Guangyu He, Haichang Gao, Gang Hua,
Abstract要約: テキスト知識を保存しながら,対象とする視覚的知識を選択的に消去する専用手法を提案する。出力レベルの監視に依存する従来の未学習手法とは異なり,本手法では視覚的知識蒸留方式を導入している。我々は,再学習攻撃に対するMLLMアンラーニングの頑健さを初めて評価した。
参考スコア（独自算出の注目度）: 29.266057382659728
License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
Abstract: Recently, machine unlearning approaches have been proposed to remove sensitive information from well-trained large models. However, most existing methods are tailored for LLMs, while MLLM-oriented unlearning remains at its early stage. Inspired by recent studies exploring the internal mechanisms of MLLMs, we propose to disentangle the visual and textual knowledge embedded within MLLMs and introduce a dedicated approach to selectively erase target visual knowledge while preserving textual knowledge. Unlike previous unlearning methods that rely on output-level supervision, our approach introduces a Visual Knowledge Distillation (VKD) scheme, which leverages intermediate visual representations within the MLLM as supervision signals. This design substantially enhances both unlearning effectiveness and model utility. Moreover, since our method only fine-tunes the visual components of the MLLM, it offers significant efficiency advantages. Extensive experiments demonstrate that our approach outperforms state-of-the-art unlearning methods in terms of both effectiveness and efficiency. Moreover, we are the first to evaluate the robustness of MLLM unlearning against relearning attacks.
Abstract（参考訳）: 近年,よく訓練された大規模モデルからセンシティブな情報を除去するための機械学習手法が提案されている。しかし、既存のほとんどの手法はLSM向けに調整されているが、MLLM指向の未学習はまだ初期段階にある。 MLLMの内部メカニズムを探求する最近の研究から着想を得て,MLLMの内部に埋め込まれた視覚的知識とテキスト的知識を分離し,テキスト的知識を保存しながら,対象の視覚的知識を選択的に消去する専用のアプローチを提案する。出力レベルの監視に依存する従来の未学習手法とは異なり,本手法では,MLLM内の中間的視覚表現を監視信号として活用する視覚的知識蒸留(VKD)方式を導入している。この設計により、未学習の有効性とモデルの有用性が大幅に向上する。さらに,本手法はMLLMの視覚成分を微調整するだけであるため,高い効率性が得られる。大規模な実験により,本手法は,有効性と効率の両面で,最先端の未学習手法より優れていることが示された。さらに,再学習攻撃に対するMLLMアンラーニングの頑健さを初めて評価した。

論文の概要: MLLM Machine Unlearning via Visual Knowledge Distillation

関連論文リスト