Fugu-MT 論文翻訳(概要): HeavyWater and SimplexWater: Watermarking Low-Entropy Text Distributions

論文の概要: HeavyWater and SimplexWater: Watermarking Low-Entropy Text Distributions

arxiv url: http://arxiv.org/abs/2506.06409v1
Date: Fri, 06 Jun 2025 13:52:34 GMT
ステータス: 翻訳完了
システム内更新日: 2025-06-10 16:33:10.266858
Title: HeavyWater and SimplexWater: Watermarking Low-Entropy Text Distributions
Title（参考訳）: HeavyWaterとSimplexWater:低エントロピーテキスト分布の透かし
Authors: Dor Tsur, Carol Xuan Long, Claudio Mayrink Verdun, Hsiang Hsu, Chen-Fu Chen, Haim Permuter, Sajani Vithana, Flavio P. Calmon,
Abstract要約: 大きな言語モデル(LLM)の透かしは、テキストの証明の認証、機械生成されたテキストの誤用の抑制、AIシステムの信頼の促進を可能にする。 LLMウォーターマーキングは、コーディングのような低エントロピー生成タスクでは困難である。我々のゴールは、透かし検出の可能性を最大化し、生成したテキストの歪みを最小限に抑えるために、ランダムな側情報をどのように効果的に利用するかを理解することである。
参考スコア（独自算出の注目度）: 9.08291061147965
License: http://creativecommons.org/licenses/by/4.0/
Abstract: Large language model (LLM) watermarks enable authentication of text provenance, curb misuse of machine-generated text, and promote trust in AI systems. Current watermarks operate by changing the next-token predictions output by an LLM. The updated (i.e., watermarked) predictions depend on random side information produced, for example, by hashing previously generated tokens. LLM watermarking is particularly challenging in low-entropy generation tasks - such as coding - where next-token predictions are near-deterministic. In this paper, we propose an optimization framework for watermark design. Our goal is to understand how to most effectively use random side information in order to maximize the likelihood of watermark detection and minimize the distortion of generated text. Our analysis informs the design of two new watermarks: HeavyWater and SimplexWater. Both watermarks are tunable, gracefully trading-off between detection accuracy and text distortion. They can also be applied to any LLM and are agnostic to side information generation. We examine the performance of HeavyWater and SimplexWater through several benchmarks, demonstrating that they can achieve high watermark detection accuracy with minimal compromise of text generation quality, particularly in the low-entropy regime. Our theoretical analysis also reveals surprising new connections between LLM watermarking and coding theory. The code implementation can be found in https://github.com/DorTsur/HeavyWater_SimplexWater
Abstract（参考訳）: 大きな言語モデル(LLM)の透かしは、テキストの証明の認証、機械生成されたテキストの誤用の抑制、AIシステムの信頼の促進を可能にする。現在の透かしは、LCMによって出力される次のトーケン予測を変更して動作する。更新された(すなわち透かし付き)予測は、例えば、以前に生成されたトークンをハッシュすることによって生成されたランダムな側情報に依存する。コーディングのような低エントロピー生成タスクでは、次のトーケン予測がほぼ決定論的である場合、LLMウォーターマーキングは特に困難である。本稿では,透かし設計のための最適化フレームワークを提案する。我々のゴールは、透かし検出の可能性を最大化し、生成したテキストの歪みを最小限に抑えるために、ランダムな側情報をどのように効果的に利用するかを理解することである。本分析では,HavyWaterとSimplexWaterの2つの新しい透かしの設計について報告する。両方の透かしは調整可能で、検出精度とテキスト歪みの間に優雅にトレードオフがある。 LLMにも適用可能で、サイド情報生成に非依存である。いくつかのベンチマークでヘビーウォーターとSimplexWaterの性能を検証し、特に低エントロピー状態においてテキスト生成品質の最小限の妥協で高い透かし検出精度を達成できることを実証した。我々の理論解析は、LLM透かしと符号化理論の間の驚くべき新しい関係も明らかにしている。コードの実装はhttps://github.com/DorTsur/HeavyWater_SimplexWaterで確認できる。

論文の概要: HeavyWater and SimplexWater: Watermarking Low-Entropy Text Distributions

関連論文リスト