Fugu-MT 論文翻訳(概要): DP-SelFT: Differentially Private Selective Fine-Tuning for Large Language Models

論文の概要: DP-SelFT: Differentially Private Selective Fine-Tuning for Large Language Models

arxiv url: http://arxiv.org/abs/2605.17432v1
Date: Sun, 17 May 2026 12:55:11 GMT
ステータス: 翻訳完了
システム内更新日: 2026-05-19 17:57:48.015604
Title: DP-SelFT: Differentially Private Selective Fine-Tuning for Large Language Models
Title（参考訳）: DP-SelFT:大規模言語モデルのための微分プライベート選択ファインチューニング
Authors: Haichao Sha, Zihao Wang, Yuncheng Wu, Hong Chen, Wei Dong,
Abstract要約: 大規模言語モデル(LLM)の個人選択微調整のためのフレームワークであるDP-SelFTを提案する。 DP-SelFTは、パラメータ選択におけるDP固有の3つの課題に対処している。実験の結果、DP-SelFTはプライバシー保証の下で、既存のDPファインチューニングベースラインに対するプライバシーとユーティリティのトレードオフを一貫して改善していることがわかった。
参考スコア（独自算出の注目度）: 28.62057098410814
License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
Abstract: Large language models (LLMs) are commonly adapted to downstream tasks through fine-tuning, but fine-tuning data often contains sensitive information that may be leaked by the resulting model. Differential privacy (DP) offers formal protection against such leakage, yet DP fine-tuning of LLMs still suffers from substantial utility degradation due to gradient clipping and noise injection. Existing work improves this trade-off by combining DP with parameter-efficient fine-tuning methods such as LoRA, which constrain the form of updates. In this work, we study a complementary direction: selective fine-tuning, which constrains where updates are applied. We propose DP-SelFT, a framework for differentially private selective fine-tuning of LLMs. DP-SelFT addresses three DP-specific challenges in parameter selection: avoiding repeated privacy cost, improving stability under noisy estimates, and selecting parameters that remain useful under clipped and noisy updates. It first constructs a lightweight DP synthetic dataset and performs selection only on this synthetic data, so the selection stage incurs no additional privacy cost. It then conducts layer-level selection by temporarily training candidate layer subsets on a synthetic training split and evaluating them on a synthetic validation split. Crucially, this temporary training is performed under a perturbation regime matched to downstream DP fine-tuning, with worst-case perturbations of the same scale as DP noise. This favors layer subsets that are not only learnable but also robust to noisy private updates. Experiments on benchmark tasks show that DP-SelFT consistently improves the privacy--utility trade-off over existing DP fine-tuning baselines under the same privacy guarantees.
Abstract（参考訳）: 大規模言語モデル(LLM)は、細調整によって下流タスクに適応することが多いが、微調整データには、結果のモデルによってリークされる可能性のある機密情報が含まれていることが多い。差分プライバシー(DP)は、そのような漏洩に対する公式な保護を提供するが、DPによるLCMの微調整は、勾配クリッピングやノイズ注入による実用性低下に悩まされている。既存の作業は、DPとLoRAのようなパラメータ効率のよい微調整手法を組み合わせることで、このトレードオフを改善している。本研究では,更新が適用される制約である選択的微調整について,補完的な方向について検討する。 DP-SelFTは, LLMの選択的微調整のためのフレームワークである。 DP-SelFTは、パラメータ選択におけるDP固有の3つの課題に対処している。まず、軽量なDP合成データセットを構築し、この合成データに対してのみ選択を行うため、選択段階では追加のプライバシコストが発生しない。次に、合成トレーニングスプリット上で候補層サブセットを一時的に訓練し、合成検証スプリット上で評価することにより、層レベルの選択を行う。重要なことは、この一時的な訓練は、下流のDP微調整と一致する摂動体制の下で行われ、最悪ケースの摂動はDPノイズと同じ規模である。これにより、学習可能なだけでなく、ノイズの多いプライベートアップデートにも堅牢なレイヤサブセットが好まれる。ベンチマークタスクの実験では、DP-SelFTは同じプライバシー保証の下で、既存のDPファインチューニングベースラインに対するプライバシーとユーティリティのトレードオフを一貫して改善している。

論文の概要: DP-SelFT: Differentially Private Selective Fine-Tuning for Large Language Models

関連論文リスト