Fugu-MT 論文翻訳(概要): Forget What Matters, Keep the Rest: Selective Unlearning of Informative Tokens

論文の概要: Forget What Matters, Keep the Rest: Selective Unlearning of Informative Tokens

arxiv url: http://arxiv.org/abs/2604.17785v1
Date: Mon, 20 Apr 2026 04:20:29 GMT
ステータス: 翻訳完了
システム内更新日: 2026-04-21 21:52:52.687535
Title: Forget What Matters, Keep the Rest: Selective Unlearning of Informative Tokens
Title（参考訳）: 何が重要か忘れる, 残りを保て - インフォーマティブ・トークンの選択的アンラーニング
Authors: Seunghee Koh, Sunghyun Baek, Youngdong Kim, Junmo Kim,
Abstract要約: トークンレベルの未学習正規化器として,Entropy-Guided Token Weighting (ETW)を提案する。 ETWは、予測分布のエントロピーをトークン情報性のプロキシとして利用する。情報トークンはエントロピーが高い傾向を示す一方、構造トークンはエントロピーが低い傾向を示す。
参考スコア（独自算出の注目度）: 9.36751822006717
License: http://creativecommons.org/licenses/by/4.0/
Abstract: Unlearning in large language models (LLMs) has emerged as a promising safeguard against adversarial behaviors. When the forgetting loss is applied uniformly without considering token-level semantic importance, model utility can be unnecessarily degraded. Recent studies have explored token-wise loss regularizers that prioritize informative tokens, but largely rely on ground-truth confidence or external linguistic parsers, which limits their ability to capture contextual information or the model's overall predictive state. Intuitively, function words like "the" primarily serve syntactic roles and are highly predictable with little ambiguity, but informative words admit multiple plausible alternatives with greater uncertainty. Based on this intuition, we propose Entropy-guided Token Weighting (ETW), a token-level unlearning regularizer that uses entropy of the predictive distribution as a proxy for token informativeness. We demonstrate that informative tokens tend to have higher entropy, whereas structural tokens tend to have lower entropy. This behavior enables ETW to achieve more effective unlearning while better preserving model utility than existing token-level approaches.
Abstract（参考訳）: 大規模言語モデル(LLM)における未学習は、敵の行動に対する有望な保護として現れている。トークンレベルのセマンティックな重要性を考慮せずに、忘れる損失を均一に適用する場合、モデルユーティリティは不要に劣化させることができる。近年の研究では、情報的トークンを優先するトークンワイド・ロス・レギュレータについて検討されているが、多くの場合は、文脈情報やモデル全体の予測状態を捉える能力を制限する、地道的信頼や外部言語パーサに依存している。直感的には、"the"のような関数語は主に統語的役割を担い、曖昧さがほとんどないほど高い予測性を持つが、情報的な単語はより不確実な複数の可算的な選択肢を認める。この直感に基づいて,予測分布のエントロピーを用いたトークンレベルの未学習正規化器であるEntropy-Guided Token Weighting (ETW)を提案する。情報トークンはエントロピーが高い傾向を示す一方、構造トークンはエントロピーが低い傾向を示す。この振る舞いにより、ETWは既存のトークンレベルのアプローチよりもモデルの有用性を保ちながら、より効果的な未学習を実現することができる。

論文の概要: Forget What Matters, Keep the Rest: Selective Unlearning of Informative Tokens

関連論文リスト