Fugu-MT 論文翻訳(概要): Luminol-AIDetect: Fast Zero-shot Machine-Generated Text Detection based on Perplexity under Text Shuffling

論文の概要: Luminol-AIDetect: Fast Zero-shot Machine-Generated Text Detection based on Perplexity under Text Shuffling

arxiv url: http://arxiv.org/abs/2604.25860v1
Date: Tue, 28 Apr 2026 16:58:55 GMT
ステータス: 翻訳完了
システム内更新日: 2026-04-29 16:49:17.965898
Title: Luminol-AIDetect: Fast Zero-shot Machine-Generated Text Detection based on Perplexity under Text Shuffling
Title（参考訳）: Luminol-AIDetect:テキストシャッフル下でのパープレキシティに基づく高速ゼロショットマシン生成テキスト検出
Authors: Lucio La Cava, Andrea Tagarelli,
Abstract要約: 我々は,機械生成テキスト(MGT)検出のための新しいゼロショット統計手法であるLuminol-AIDetectを提案する。単純なランダム化テキストシャッフル手法を適用することで、結果として生じるパープレキシティの変化が、原則的、モデルに依存しない識別要因となることを示す。我々は,Luminol-AIDetectが最先端性能を示し,FPRの最大17倍の低下を示し,従来の手法よりも安価であることを示した。
参考スコア（独自算出の注目度）: 9.241565393225953
License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
Abstract: Machine-generated text (MGT) detection requires identifying structurally invariant signals across generation models, rather than relying on model-specific fingerprints. In this respect, we hypothesize that while large language models excel at local semantic consistency, their autoregressive nature results in a specific kind of structural fragility compared to human writing. We propose Luminol-AIDetect, a novel, zero-shot statistical approach that exposes this fragility through coherence disruption. By applying a simple randomized text-shuffling procedure, we demonstrate that the resulting shift in perplexity serves as a principled, model-agnostic discriminant, as MGT displays a characteristic dispersion in perplexity-under-shuffling that differs markedly from the more stable structural variability of human-written text. Luminol-AIDetect leverages this distinction to inform its decision process, where a handful of perplexity-based scalar features are extracted from an input text and its shuffled version, then detection is performed via density estimation and ensemble-based prediction. Evaluated across 8 content domains, 11 adversarial attack types, and 18 languages, Luminol-AIDetect demonstrates state-of-the-art performance, with gains up to 17x lower FPR while being cheaper than prior methods.
Abstract（参考訳）: 機械生成テキスト(MGT)検出は、モデル固有の指紋に頼るのではなく、世代モデル間で構造的に不変な信号を特定する必要がある。この観点から,大規模言語モデルは局所的な意味的一貫性に優れるが,その自己回帰的な性質は,人間の記述と比較して特定の構造的脆弱性をもたらすと仮定する。我々は,コヒーレンス・ディスラプションによるこの脆弱性を明らかにする新しいゼロショット統計手法であるLuminol-AIDetectを提案する。単純なランダム化テキストシャッフル手法を適用することで、MGTは人間のテキストのより安定した構造的変動と著しく異なるパープレキシティ・アンダーシャッフルの特徴的な分散を示すため、結果として生じるパープレキシティのシフトが、原則的、モデルに依存しない識別要因となることを示した。 Luminol-AIDetectはこの区別を利用し、入力テキストとそのシャッフルバージョンからわずかなパープレキシティベースのスカラー特徴を抽出し、密度推定とアンサンブルベースの予測によって検出を行う。 8つのコンテンツドメイン、11の逆攻撃タイプ、および18の言語で評価され、Luminol-AIDetectは最先端のパフォーマンスを示し、従来の方法よりも17倍低いFPRが得られた。

論文の概要: Luminol-AIDetect: Fast Zero-shot Machine-Generated Text Detection based on Perplexity under Text Shuffling

関連論文リスト