Fugu-MT 論文翻訳(概要): Optimized Couplings for Watermarking Large Language Models

論文の概要: Optimized Couplings for Watermarking Large Language Models

arxiv url: http://arxiv.org/abs/2505.08878v1
Date: Tue, 13 May 2025 18:08:12 GMT
ステータス: 翻訳完了
システム内更新日: 2025-05-15 21:44:09.263903
Title: Optimized Couplings for Watermarking Large Language Models
Title（参考訳）: 大規模言語モデルの透かしのための最適結合
Authors: Dor Tsur, Carol Xuan Long, Claudio Mayrink Verdun, Hsiang Hsu, Haim Permuter, Flavio P. Calmon,
Abstract要約: 大規模言語モデル(LLM)は、多くの場合、人間が生成したコンテンツと区別できないようなテキストを生成することができる。本稿では,テキスト透かしをワンショットで解析する。
参考スコア（独自算出の注目度）: 8.585779208433465
License: http://creativecommons.org/licenses/by/4.0/
Abstract: Large-language models (LLMs) are now able to produce text that is, in many cases, seemingly indistinguishable from human-generated content. This has fueled the development of watermarks that imprint a ``signal'' in LLM-generated text with minimal perturbation of an LLM's output. This paper provides an analysis of text watermarking in a one-shot setting. Through the lens of hypothesis testing with side information, we formulate and analyze the fundamental trade-off between watermark detection power and distortion in generated textual quality. We argue that a key component in watermark design is generating a coupling between the side information shared with the watermark detector and a random partition of the LLM vocabulary. Our analysis identifies the optimal coupling and randomization strategy under the worst-case LLM next-token distribution that satisfies a min-entropy constraint. We provide a closed-form expression of the resulting detection rate under the proposed scheme and quantify the cost in a max-min sense. Finally, we provide an array of numerical results, comparing the proposed scheme with the theoretical optimum and existing schemes, in both synthetic data and LLM watermarking. Our code is available at https://github.com/Carol-Long/CC_Watermark
Abstract（参考訳）: 大規模言語モデル(LLM)は、多くの場合、人間が生成したコンテンツと区別できないようなテキストを生成することができる。これにより、LLMの出力を最小限の摂動で LLM 生成テキストに ``signal'' を印字する透かしの開発が加速された。本稿では,テキスト透かしをワンショットで解析する。副次的情報を用いた仮説テストのレンズを用いて,テキスト品質における透かし検出力と歪みの基本的なトレードオフを定式化し,解析する。透かし設計における重要な要素は、透かし検出器とLLM語彙のランダムな分割とで共有される側情報とを結合させることである。本分析では,最小エントロピー制約を満たす最短ケースLLM次トーケン分布下での最適結合とランダム化の戦略を明らかにした。提案手法で得られた検出率のクローズドフォーム表現を行い,最大値でコストを定量化する。最後に,提案手法と理論的最適スキームと既存スキームを比較し,合成データとLLM透かしの両方で数値的な結果を示す。私たちのコードはhttps://github.com/Carol-Long/CC_Watermarkで利用可能です。

論文の概要: Optimized Couplings for Watermarking Large Language Models

関連論文リスト