Fugu-MT 論文翻訳(概要): SeedPrints: Fingerprints Can Even Tell Which Seed Your Large Language Model Was Trained From

論文の概要: SeedPrints: Fingerprints Can Even Tell Which Seed Your Large Language Model Was Trained From

arxiv url: http://arxiv.org/abs/2509.26404v1
Date: Tue, 30 Sep 2025 15:34:08 GMT
ステータス: 翻訳完了
システム内更新日: 2025-10-01 17:09:04.598937
Title: SeedPrints: Fingerprints Can Even Tell Which Seed Your Large Language Model Was Trained From
Title（参考訳）: SeedPrints:フィンガープリントは、大きめの言語モデルを訓練したことを教えてくれる
Authors: Yao Tong, Haonan Wang, Siquan Li, Kenji Kawaguchi, Tianyang Hu,
Abstract要約: 我々は,LDMフィンガープリントのより強く,より本質的な概念であるSeedPrintsを提案する。トレーニングされていないモデルでは,パラメータのみに依存した再現可能なトークン選択バイアスが示される。 LLaMAスタイルとQwenスタイルのモデルの実験では、SeedPrintsはシードレベルの識別性を実現し、バイオメトリック指紋に似た生来からライフサイクルの識別認証を提供する。
参考スコア（独自算出の注目度）: 65.75182441010327
License: http://creativecommons.org/licenses/by-nc-sa/4.0/
Abstract: Fingerprinting Large Language Models (LLMs) is essential for provenance verification and model attribution. Existing methods typically extract post-hoc signatures based on training dynamics, data exposure, or hyperparameters -- properties that only emerge after training begins. In contrast, we propose a stronger and more intrinsic notion of LLM fingerprinting: SeedPrints, a method that leverages random initialization biases as persistent, seed-dependent identifiers present even before training. We show that untrained models exhibit reproducible token selection biases conditioned solely on their parameters at initialization. These biases are stable and measurable throughout training, enabling our statistical detection method to recover a model's lineage with high confidence. Unlike prior techniques, unreliable before convergence and vulnerable to distribution shifts, SeedPrints remains effective across all training stages and robust under domain shifts or parameter modifications. Experiments on LLaMA-style and Qwen-style models show that SeedPrints achieves seed-level distinguishability and can provide birth-to-lifecycle identity verification akin to a biometric fingerprint. Evaluations on large-scale pretrained models and fingerprinting benchmarks further confirm its effectiveness under practical deployment scenarios. These results suggest that initialization itself imprints a unique and persistent identity on neural language models, forming a true ''Galtonian'' fingerprint.
Abstract（参考訳）: フィンガープリンティング大規模言語モデル(LLM)は、証明の検証とモデルの帰属に不可欠である。既存のメソッドは通常、トレーニングの開始後にのみ現れる特性であるトレーニングダイナミクス、データ露出、ハイパーパラメータに基づいて、ポストホックシグネチャを抽出する。対照的に、私たちはLDMフィンガープリントのより強く、より本質的な概念を提唱する: SeedPrintsは、ランダム初期化バイアスをトレーニング前にも持続的で、種に依存しない識別子として活用する手法である。未学習モデルでは初期化時にパラメータにのみ条件付きで再現可能なトークン選択バイアスを示す。これらのバイアスはトレーニングを通じて安定して測定可能であり、統計的検出手法によりモデルの系統を高い信頼性で復元することができる。収束前の信頼性が低く、分散シフトに弱い従来のテクニックとは異なり、SeedPrintsはすべてのトレーニングステージで有効であり、ドメインシフトやパラメータ修正の下で堅牢である。 LLaMAスタイルとQwenスタイルのモデルの実験では、SeedPrintsはシードレベルの識別性を実現し、バイオメトリック指紋に似た生来からライフサイクルの識別認証を提供する。大規模な事前訓練モデルとフィンガープリントベンチマークの評価により、実際のデプロイシナリオ下での有効性がさらに確認される。これらの結果は、初期化自体がニューラルネットワークモデルにユニークで永続的なアイデンティティを印字し、真の'ガルトン'指紋を形成することを示唆している。

論文の概要: SeedPrints: Fingerprints Can Even Tell Which Seed Your Large Language Model Was Trained From

関連論文リスト