Fugu-MT 論文翻訳(概要): EditMF: Drawing an Invisible Fingerprint for Your Large Language Models

論文の概要: EditMF: Drawing an Invisible Fingerprint for Your Large Language Models

arxiv url: http://arxiv.org/abs/2508.08836v1
Date: Tue, 12 Aug 2025 10:52:48 GMT
ステータス: 翻訳完了
システム内更新日: 2025-08-13 21:07:34.398163
Title: EditMF: Drawing an Invisible Fingerprint for Your Large Language Models
Title（参考訳）: EditMF:大きな言語モデルのための見えないフィンガープリントを描く
Authors: Jiaxuan Wu, Yinghan Zhou, Wanli Peng, Yiming Xue, Juan Wen, Ping Zhong,
Abstract要約: EditMFはトレーニング不要なフィンガープリントのパラダイムであり、最小の計算オーバーヘッドで非常に受け入れ難いフィンガープリントの埋め込みを実現する。 EditMF は,LoRA ベースの指紋認証をはるかに超越した堅牢性を提供しながら,認識不能なモデルの性能損失と高い認識性を兼ね備えていることを示す。
参考スコア（独自算出の注目度）: 11.691985114214162
License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
Abstract: Training large language models (LLMs) is resource-intensive and expensive, making protecting intellectual property (IP) for LLMs crucial. Recently, embedding fingerprints into LLMs has emerged as a prevalent method for establishing model ownership. However, existing back-door-based methods suffer from limited stealth and efficiency. To simultaneously address these issues, we propose EditMF, a training-free fingerprinting paradigm that achieves highly imperceptible fingerprint embedding with minimal computational overhead. Ownership bits are mapped to compact, semantically coherent triples drawn from an encrypted artificial knowledge base (e.g., virtual author-novel-protagonist facts). Causal tracing localizes the minimal set of layers influencing each triple, and a zero-space update injects the fingerprint without perturbing unrelated knowledge. Verification requires only a single black-box query and succeeds when the model returns the exact pre-embedded protagonist. Empirical results on LLaMA and Qwen families show that EditMF combines high imperceptibility with negligible model's performance loss, while delivering robustness far beyond LoRA-based fingerprinting and approaching that of SFT embeddings. Extensive experiments demonstrate that EditMF is an effective and low-overhead solution for secure LLM ownership verification.
Abstract（参考訳）: 大規模言語モデル(LLM)の訓練は資源集約的で高価であり、LLMの知的財産権(IP)を保護することが重要である。近年, LLM に指紋を埋め込むことが, モデルオーナシップを確立する方法として広く普及している。しかし、既存のバックドア方式では、ステルスと効率が制限されている。これらの問題に同時に対処するために,我々は,計算オーバーヘッドを最小限に抑止できる指紋埋め込みを実現する,トレーニング不要なフィンガープリントパラダイムであるEditMFを提案する。オーナシップビットは、暗号化された人工的な知識ベースから引き出されたコンパクトでセマンティックにコヒーレントなトリプルにマップされる。因果トレースは、各トリプルに影響を与える最小限の層をローカライズし、ゼロスペース更新は無関係な知識を邪魔することなく指紋を注入する。検証には単一のブラックボックスクエリしか必要とせず、モデルが正確な埋め込み済みのプロタゴニストを返すと成功する。 LLaMAとQwenファミリーの実証的な結果から、EditMFは認識不能なモデルの性能損失と高い非認識性を組み合わせ、LoRAベースの指紋認証を超えて堅牢性を提供し、SFT埋め込みに近づいた。大規模な実験により、EditMFは安全なLLMオーナシップ検証のための有効かつ低オーバヘッドのソリューションであることが示された。

論文の概要: EditMF: Drawing an Invisible Fingerprint for Your Large Language Models

関連論文リスト