Fugu-MT 論文翻訳(概要): Scrub It Out! Erasing Sensitive Memorization in Code Language Models via Machine Unlearning

論文の概要: Scrub It Out! Erasing Sensitive Memorization in Code Language Models via Machine Unlearning

arxiv url: http://arxiv.org/abs/2509.13755v1
Date: Wed, 17 Sep 2025 07:12:35 GMT
ステータス: 翻訳完了
システム内更新日: 2025-09-18 18:41:50.751398
Title: Scrub It Out! Erasing Sensitive Memorization in Code Language Models via Machine Unlearning
Title（参考訳）: Scrub it out! 機械学習によるコード言語モデルの感性記憶の消去
Authors: Zhaoyang Chu, Yao Wan, Zhikun Zhang, Di Wang, Zhou Yang, Hongyu Zhang, Pan Zhou, Xuanhua Shi, Hai Jin, David Lo,
Abstract要約: Code Language Models (CLMs)は、機密性のあるトレーニングデータの意図しない記憶を示し、具体的に指示された場合に機密情報の冗長な再現を可能にする。 CodeEraserは、コードの構造的整合性と機能的正当性を保ちながら、センシティブな記憶されたセグメントを選択的にコードに解放する高度な変種である。
参考スコア（独自算出の注目度）: 50.45435841411193
License: http://creativecommons.org/licenses/by/4.0/
Abstract: While Code Language Models (CLMs) have demonstrated superior performance in software engineering tasks such as code generation and summarization, recent empirical studies reveal a critical privacy vulnerability: these models exhibit unintended memorization of sensitive training data, enabling verbatim reproduction of confidential information when specifically prompted. To address this issue, several approaches, including training data de-duplication and differential privacy augmentation, have been proposed. However, these methods require full-model retraining for deployed CLMs, which incurs substantial computational costs. In this paper, we aim to answer the following research question: Can sensitive information memorized by CLMs be erased effectively and efficiently? We conduct a pioneering investigation into erasing sensitive memorization in CLMs through machine unlearning - a post-hoc modification method that removes specific information from trained models without requiring full retraining. Specifically, we first quantify the memorization risks of sensitive data within CLM training datasets and curate a high-risk dataset of 50,000 sensitive memorized samples as unlearning targets. We study two widely used gradient ascent-based unlearning approaches: the vanilla and constraint-based methods, and introduce CodeEraser, an advanced variant that selectively unlearns sensitive memorized segments in code while preserving the structural integrity and functional correctness of the surrounding code. Extensive experiments on three families of CLMs, i.e., CodeParrot, CodeGen-Mono, and Qwen2.5-Coder, validate the effectiveness and efficiency of CodeEraser in erasing targeted sensitive memorization while maintaining model utility.
Abstract（参考訳）: コード言語モデル(CLM)は、コード生成や要約などのソフトウェアエンジニアリングタスクにおいて優れたパフォーマンスを示しているが、最近の実証的研究は、重大なプライバシー上の脆弱性を明らかにしている。この問題に対処するため、データ重複のトレーニングや差分プライバシー強化など、いくつかのアプローチが提案されている。しかし、これらの手法はCLMのフルモデル再訓練を必要とするため、かなりの計算コストがかかる。本稿では,CLMが記憶する機密情報を効果的かつ効率的に消去できるか,という課題に対処することを目的とする。完全再トレーニングを必要とせずに、トレーニングされたモデルから特定の情報を除去するポストホックな修正手法である、機械学習によるCLMのセンシティブな記憶の消去に関する先駆的な調査を行う。具体的には、まず、CLMトレーニングデータセット内の機密データの記憶リスクを定量化し、5万件の機密記憶サンプルの高リスクデータセットを学習対象としてキュレートする。我々は、バニラ法と制約法という2つの広く使われている勾配上昇に基づくアンラーニング手法について検討し、周辺コードの構造的整合性と機能的正当性を保ちながら、コード中のセンシティブな記憶セグメントを選択的に解放する先進的な変種であるCodeEraserを紹介した。 CLMの3つのファミリー、すなわちCodeParrot、CodeGen-Mono、Qwen2.5-Coderに対する大規模な実験は、モデルユーティリティを維持しながら、ターゲットの機密記憶を消去するCodeEraserの有効性と効率を検証する。

論文の概要: Scrub It Out! Erasing Sensitive Memorization in Code Language Models via Machine Unlearning

関連論文リスト