Fugu-MT 論文翻訳(概要): Invisible Entropy: Towards Safe and Efficient Low-Entropy LLM Watermarking

論文の概要: Invisible Entropy: Towards Safe and Efficient Low-Entropy LLM Watermarking

arxiv url: http://arxiv.org/abs/2505.14112v1
Date: Tue, 20 May 2025 09:19:06 GMT
ステータス: 翻訳完了
システム内更新日: 2025-05-21 14:49:52.961417
Title: Invisible Entropy: Towards Safe and Efficient Low-Entropy LLM Watermarking
Title（参考訳）: 可視エントロピー:安全かつ効率的な低エントロピーLCM透かし
Authors: Tianle Gu, Zongqi Wang, Kexin Huang, Yuanqi Yao, Xiangliang Zhang, Yujiu Yang, Xiuying Chen,
Abstract要約: Invisible Entropy (IE)は、安全性と効率性の両方を高めるために設計された透かしパラダイムである。 IEはパラメータサイズを99%削減し、最先端のメソッドと同等のパフォーマンスを実現している。
参考スコア（独自算出の注目度）: 48.26359966929394
License: http://creativecommons.org/licenses/by/4.0/
Abstract: Logit-based LLM watermarking traces and verifies AI-generated content by maintaining green and red token lists and increasing the likelihood of green tokens during generation. However, it fails in low-entropy scenarios, where predictable outputs make green token selection difficult without disrupting natural text flow. Existing approaches address this by assuming access to the original LLM to calculate entropy and selectively watermark high-entropy tokens. However, these methods face two major challenges: (1) high computational costs and detection delays due to reliance on the original LLM, and (2) potential risks of model leakage. To address these limitations, we propose Invisible Entropy (IE), a watermarking paradigm designed to enhance both safety and efficiency. Instead of relying on the original LLM, IE introduces a lightweight feature extractor and an entropy tagger to predict whether the entropy of the next token is high or low. Furthermore, based on theoretical analysis, we develop a threshold navigator that adaptively sets entropy thresholds. It identifies a threshold where the watermark ratio decreases as the green token count increases, enhancing the naturalness of the watermarked text and improving detection robustness. Experiments on HumanEval and MBPP datasets demonstrate that IE reduces parameter size by 99\% while achieving performance on par with state-of-the-art methods. Our work introduces a safe and efficient paradigm for low-entropy watermarking. https://github.com/Carol-gutianle/IE https://huggingface.co/datasets/Carol0110/IE-Tagger
Abstract（参考訳）: ログベースのLLMウォーターマーキングは、緑と赤のトークンリストを維持し、生成中のグリーントークンの可能性を高めることによって、AI生成コンテンツをトレースし、検証する。しかし、予測可能な出力が自然のテキストフローを乱すことなくグリーントークンの選択を難しくする低エントロピーのシナリオでは失敗する。既存のアプローチでは、エントロピーを計算し、選択的にハイエントロピートークンを透かし、元のLLMへのアクセスを仮定することでこの問題に対処している。しかし, これらの手法は, 1) 計算コストの増大と, 元のLCMに依存した検出遅延, (2) モデル漏洩の潜在的なリスクの2つの大きな課題に直面している。これらの制約に対処するために,安全と効率の両立を図った透かしパラダイムである可視エントロピー(IE)を提案する。オリジナルのLLMに頼る代わりに、IEは軽量な特徴抽出器とエントロピータグを導入し、次のトークンのエントロピーが高いか低いかを予測する。さらに,理論解析に基づいて,エントロピー閾値を適応的に設定するしきい値ナビゲータを開発した。グリーントークン数の増加に伴って透かし比が減少する閾値を特定し、透かしテキストの自然性を高め、検出堅牢性を向上させる。 HumanEvalとMBPPデータセットの実験では、IEはパラメータサイズを99\%削減し、最先端のメソッドと同等のパフォーマンスを実現している。我々の研究は、低エントロピー透かしのための安全で効率的なパラダイムを導入している。 https://github.com/Carol-gutianle/IE https://huggingface.co/datasets/Carol0110/IE-Tagger

論文の概要: Invisible Entropy: Towards Safe and Efficient Low-Entropy LLM Watermarking

関連論文リスト