Fugu-MT 論文翻訳(概要): Cross-Prompt Generalization in Detecting AI-Generated Fake News Using Interpretable Linguistic Features

論文の概要: Cross-Prompt Generalization in Detecting AI-Generated Fake News Using Interpretable Linguistic Features

arxiv url: http://arxiv.org/abs/2606.04199v1
Date: Tue, 02 Jun 2026 20:33:56 GMT
ステータス: 翻訳完了
システム内更新日: 2026-06-04 20:44:18.375074
Title: Cross-Prompt Generalization in Detecting AI-Generated Fake News Using Interpretable Linguistic Features
Title（参考訳）: 解釈可能な言語特徴を用いたAI生成フェイクニュースの検出におけるクロスプロンプト一般化
Authors: Aya Vera-Jimenez, Samuel Jaeger, Calvin Ibenye, Dhrubajyoti Ghosh,
Abstract要約: 本稿では,異なるプロンプトで生成されたAI生成記事の3つのデータセットを用いて,偽ニュース検出におけるクロスプロンプト一般化について検討する。語彙の多様性,可読性,感情に基づく特徴を捉えた解釈可能な言語特徴を抽出する。特徴分布の分析により、AI生成したテキストは語彙の多様性を高め、読みやすさを低下させ、全体のデータセットと比較して感情的な強度を大幅に低下させることが示された。
参考スコア（独自算出の注目度）: 0.13999481573773068
License: http://creativecommons.org/licenses/by-nc-nd/4.0/
Abstract: The increasing use of large language models has raised concerns about the spread of AI-generated fake news, particularly under varying prompting strategies. Most existing detection models are trained and evaluated under a single generation setting, leaving their ability to generalize across unseen prompts unclear. In this study, we investigate cross-prompt generalization in fake news detection using three datasets of AI-generated articles produced under distinct prompts, combined with real news articles. We extract interpretable linguistic features capturing lexical diversity, readability, and emotion-based characteristics and evaluate a random forest classifier under a cross-prompt framework, where models trained on one prompt are tested on another. Across all six train-test combinations, performance remains consistently high, with AUC values ranging from 0.988 to 1.000. Analysis of feature distributions shows that AI-generated text exhibits increased lexical diversity, reduced readability, and substantially lower emotional intensity compared to the overall dataset, with variations across prompts. Despite these distributional shifts, the classifier maintains strong performance, indicating that these features capture stable properties of AI-generated text that generalize across prompting strategies. These findings suggest that feature-based approaches can provide robust detection of AI-generated fake news under prompt variability.
Abstract（参考訳）: 大規模な言語モデルの使用の増加は、AI生成のフェイクニュースの拡散、特に様々なプロンプト戦略の下での懸念を引き起こしている。既存の検出モデルは、単一の世代設定でトレーニングされ、評価され、目に見えないプロンプトをまたいで一般化する能力は不明確のままである。本研究では,異なるプロンプトで生成されたAI生成記事の3つのデータセットと実ニュース記事を組み合わせた偽ニュース検出におけるクロスプロンプト一般化について検討した。我々は、語彙の多様性、可読性、感情に基づく特徴を抽出した解釈可能な言語特徴を抽出し、クロスプロンプトの枠組みの下でランダムな森林分類器を評価し、あるプロンプトで訓練されたモデルを別のプロンプトでテストする。 6つの列車とテストの組合せの中で、AUCの値は0.988から1.000まで、常に高い性能を保っている。特徴分布の分析により、AI生成したテキストは、レキシカルな多様性の増加、可読性の低下、全体のデータセットと比較して感情的な強度の低下を示す。これらの分散シフトにもかかわらず、分類器は強力な性能を維持しており、これらの特徴は、プロンプト戦略をまたいだ一般化されたAI生成テキストの安定した特性を捉えていることを示している。これらの結果は、特徴に基づくアプローチが、即時変動の下でAIが生成した偽ニュースを堅牢に検出することを可能にすることを示唆している。

論文の概要: Cross-Prompt Generalization in Detecting AI-Generated Fake News Using Interpretable Linguistic Features

関連論文リスト