Fugu-MT 論文翻訳(概要): Machine Text Detectors are Membership Inference Attacks

論文の概要: Machine Text Detectors are Membership Inference Attacks

arxiv url: http://arxiv.org/abs/2510.19492v1
Date: Wed, 22 Oct 2025 11:39:01 GMT
ステータス: 翻訳完了
システム内更新日: 2025-10-25 03:08:15.743649
Title: Machine Text Detectors are Membership Inference Attacks
Title（参考訳）: 機械テキスト検出器はメンバーシップ推論攻撃である
Authors: Ryuto Koike, Liam Dugan, Masahiro Kaneko, Chris Callison-Burch, Naoaki Okazaki,
Abstract要約: 我々は,あるタスクに対してもともと開発された手法が,他方でどのように機能するかを理論的,実証的に検討する。 7つの最先端MIA法と5つの最先端マシンテキスト検出器を含む大規模実験は、クロスタスク性能において非常に強いランク相関(rho >)を示す。この結果から,両研究コミュニティ間のクロスタスク意識とコラボレーションの必要性が示唆された。
参考スコア（独自算出の注目度）: 55.07733196689313
License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
Abstract: Although membership inference attacks (MIAs) and machine-generated text detection target different goals, identifying training samples and synthetic texts, their methods often exploit similar signals based on a language model's probability distribution. Despite this shared methodological foundation, the two tasks have been independently studied, which may lead to conclusions that overlook stronger methods and valuable insights developed in the other task. In this work, we theoretically and empirically investigate the transferability, i.e., how well a method originally developed for one task performs on the other, between MIAs and machine text detection. For our theoretical contribution, we prove that the metric that achieves the asymptotically highest performance on both tasks is the same. We unify a large proportion of the existing literature in the context of this optimal metric and hypothesize that the accuracy with which a given method approximates this metric is directly correlated with its transferability. Our large-scale empirical experiments, including 7 state-of-the-art MIA methods and 5 state-of-the-art machine text detectors across 13 domains and 10 generators, demonstrate very strong rank correlation (rho > 0.6) in cross-task performance. We notably find that Binoculars, originally designed for machine text detection, achieves state-of-the-art performance on MIA benchmarks as well, demonstrating the practical impact of the transferability. Our findings highlight the need for greater cross-task awareness and collaboration between the two research communities. To facilitate cross-task developments and fair evaluations, we introduce MINT, a unified evaluation suite for MIAs and machine-generated text detection, with implementation of 15 recent methods from both tasks.
Abstract（参考訳）: メンバシップ推論攻撃(MIA)と機械生成テキスト検出は異なる目標を目標とし、トレーニングサンプルと合成テキストを識別するが、それらの手法は言語モデルの確率分布に基づいた類似した信号を利用することが多い。この共通方法論の基礎にもかかわらず、この2つのタスクは独立して研究され、これは、他のタスクで開発されたより強力な方法や価値ある洞察を見落としている結論につながる可能性がある。本研究は,MIAと機械テキスト検出の間において,一方のタスクに対して開発された手法が他方のタスクに対してどの程度うまく機能するかを理論的,実証的に検討する。理論的貢献のために、両課題における漸近的に高い性能を達成する計量は同じであることを示す。我々は、この最適計量の文脈において、既存の文献の大部分を統一し、与えられたメソッドがこの計量に近似した精度が、その伝達可能性と直接相関していると仮定する。 7つの最先端MIA法と、13のドメインと10のジェネレータにまたがる5つの最先端マシンテキスト検出器を含む大規模な実験は、クロスタスク性能において非常に強いランク相関(rho > 0.6)を示す。特に,Binocularsはもともと機械テキスト検出用に設計されており,MIAベンチマーク上での最先端性能を実現し,転送可能性の実践的影響を実証している。この結果から,両研究コミュニティ間のクロスタスク意識とコラボレーションの必要性が示唆された。クロスタスク開発と公平な評価を容易にするため,MIAと機械生成テキスト検出のための統合評価スイートであるMINTを導入し,両タスクから15の手法を実装した。

論文の概要: Machine Text Detectors are Membership Inference Attacks

関連論文リスト