Fugu-MT 論文翻訳(概要): SimKey: A Semantically Aware Key Module for Watermarking Language Models

論文の概要: SimKey: A Semantically Aware Key Module for Watermarking Language Models

arxiv url: http://arxiv.org/abs/2510.12828v1
Date: Sat, 11 Oct 2025 20:07:54 GMT
ステータス: 翻訳完了
システム内更新日: 2025-10-16 20:13:28.337969
Title: SimKey: A Semantically Aware Key Module for Watermarking Language Models
Title（参考訳）: SimKey: ウォーターマーキング言語モデルのためのセマンティックなキーモジュール
Authors: Shingo Kodama, Haya Diwan, Lucas Rosenblatt, R. Teal Witter, Niv Cohen,
Abstract要約: 透かしの堅牢性を強化するセマンティックキーモジュールであるSimKeyを紹介する。 SimKeyはセマンティック埋め込みよりも局所性に敏感なハッシュを使用して、パラフレーズ付きテキストが同じ透かしキーを生成する。言い換えや翻訳に対する透かしの堅牢性を向上し、有害な内容が偽の帰属を防ぎます。
参考スコア（独自算出の注目度）: 19.115617392855768
License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
Abstract: The rapid spread of text generated by large language models (LLMs) makes it increasingly difficult to distinguish authentic human writing from machine output. Watermarking offers a promising solution: model owners can embed an imperceptible signal into generated text, marking its origin. Most leading approaches seed an LLM's next-token sampling with a pseudo-random key that can later be recovered to identify the text as machine-generated, while only minimally altering the model's output distribution. However, these methods suffer from two related issues: (i) watermarks are brittle to simple surface-level edits such as paraphrasing or reordering; and (ii) adversaries can append unrelated, potentially harmful text that inherits the watermark, risking reputational damage to model owners. To address these issues, we introduce SimKey, a semantic key module that strengthens watermark robustness by tying key generation to the meaning of prior context. SimKey uses locality-sensitive hashing over semantic embeddings to ensure that paraphrased text yields the same watermark key, while unrelated or semantically shifted text produces a different one. Integrated with state-of-the-art watermarking schemes, SimKey improves watermark robustness to paraphrasing and translation while preventing harmful content from false attribution, establishing semantic-aware keying as a practical and extensible watermarking direction.
Abstract（参考訳）: 大規模言語モデル (LLM) が生成するテキストの急速な拡散により, 機械出力との区別がますます困難になっている。モデル所有者は、認識できない信号を生成されたテキストに埋め込むことができ、その起源をマークすることができる。ほとんどの主要なアプローチでは、LLMの次点サンプリングを擬似ランダムキーでシードし、後になってテキストをマシン生成として識別できるが、モデルの出力分布を最小限に変化させるだけである。しかし、これらの手法は2つの問題に悩まされている。 (i)透かしは、言い換えや並べ替えなどの簡単な表面の編集に脆弱である。 (二敵は、無関係で潜在的に有害なテキストを付加することができ、その透かしを継承し、モデル所有者に評判を損なうおそれがある。これらの問題に対処するため、SimKeyというセマンティックキーモジュールを導入し、キー生成を事前のコンテキストの意味に結びつけることで、透かしの堅牢性を強化する。 SimKeyはセマンティック埋め込みよりも局所性に敏感なハッシュを使用して、パラフレーズ付きテキストが同じ透かしキーを出力することを保証する。シムキーは最先端の透かし方式と統合され、言い換えや翻訳に対する透かしの堅牢性を改善しつつ、有害な内容を偽の帰属から防ぎ、意味認識キーを実用的で拡張可能な透かし方向として確立する。

論文の概要: SimKey: A Semantically Aware Key Module for Watermarking Language Models

関連論文リスト