Fugu-MT 論文翻訳(概要): Adaptive Testing for Segmenting Watermarked Texts From Language Models

論文の概要: Adaptive Testing for Segmenting Watermarked Texts From Language Models

arxiv url: http://arxiv.org/abs/2511.06645v1
Date: Mon, 10 Nov 2025 02:50:19 GMT
ステータス: 翻訳完了
システム内更新日: 2025-11-11 21:18:45.0436
Title: Adaptive Testing for Segmenting Watermarked Texts From Language Models
Title（参考訳）: 言語モデルからの透かしテキストのセグメンテーションのための適応的テスト
Authors: Xingchi Li, Xiaochi Liu, Guanxun Li,
Abstract要約: 大規模言語モデル(LLM)の急速な普及は、LLM生成したテキストと人間が書いたコンテンツとを区別する必要性を浮き彫りにしている。ウォーターマーク技術は、微妙な統計信号をLCM生成テキストに埋め込んで信頼性の高い識別を可能にする。我々は、この適応検出戦略を拡張し、与えられたテキストを透かしや非透かしに分割するより困難な問題に対処する。
参考スコア（独自算出の注目度）: 2.5489046505746704
License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
Abstract: The rapid adoption of large language models (LLMs), such as GPT-4 and Claude 3.5, underscores the need to distinguish LLM-generated text from human-written content to mitigate the spread of misinformation and misuse in education. One promising approach to address this issue is the watermark technique, which embeds subtle statistical signals into LLM-generated text to enable reliable identification. In this paper, we first generalize the likelihood-based LLM detection method of a previous study by introducing a flexible weighted formulation, and further adapt this approach to the inverse transform sampling method. Moving beyond watermark detection, we extend this adaptive detection strategy to tackle the more challenging problem of segmenting a given text into watermarked and non-watermarked substrings. In contrast to the approach in a previous study, which relies on accurate estimation of next-token probabilities that are highly sensitive to prompt estimation, our proposed framework removes the need for precise prompt estimation. Extensive numerical experiments demonstrate that the proposed methodology is both effective and robust in accurately segmenting texts containing a mixture of watermarked and non-watermarked content.
Abstract（参考訳）: GPT-4やClaude 3.5のような大規模言語モデル(LLM)の急速な普及は、教育における誤情報や誤用の拡散を緩和するために、LLM生成したテキストと人文コンテンツとを区別する必要性を浮き彫りにしている。この問題に対処する1つの有望なアプローチは、微妙な統計信号をLCM生成テキストに埋め込んで信頼性の高い識別を可能にする透かし技術である。本稿では、まず、フレキシブルな重み付き定式化を導入することで、従来の研究の確率に基づくLCM検出法を一般化し、さらに逆変換サンプリング法に適用する。透かし検出を超えて、この適応検出戦略を拡張して、与えられたテキストを透かしや非透かしのサブストリングに分割するより難しい問題に取り組む。提案手法は, 高速推定に非常に敏感な次トーケン確率の正確な推定に依拠する従来の手法とは対照的に, 提案手法は高精度な高速推定の必要性を排除している。大規模な数値実験により, 提案手法は, 透かしと非透かしの混合物を含むテキストを正確に区分けする上で, 有効かつ堅牢であることが示された。

論文の概要: Adaptive Testing for Segmenting Watermarked Texts From Language Models

関連論文リスト