Fugu-MT 論文翻訳(概要): Evaluating the Performance of AI Text Detectors, Few-Shot and Chain-of-Thought Prompting Using DeepSeek Generated Text

論文の概要: Evaluating the Performance of AI Text Detectors, Few-Shot and Chain-of-Thought Prompting Using DeepSeek Generated Text

arxiv url: http://arxiv.org/abs/2507.17944v1
Date: Wed, 23 Jul 2025 21:26:33 GMT
ステータス: 翻訳完了
システム内更新日: 2025-07-25 15:10:42.625442
Title: Evaluating the Performance of AI Text Detectors, Few-Shot and Chain-of-Thought Prompting Using DeepSeek Generated Text
Title（参考訳）: ディープシーク生成テキストを用いたAIテキスト検出器,Few-Shot,Chain-of-Thought Promptingの性能評価
Authors: Hulayyil Alshammari, Praveen Rao,
Abstract要約: 標準および人為的パラフレージングのようなアドリバーサ攻撃は、検出者がテキストを検出する能力を阻害する。我々は、DeepSeekが生成したテキストを、一般的なAI Text、Content Detector AI、Copyleaks、QuillBot、GPT-2、GPTZeroの6つが一貫して認識できるかどうかを調査する。
参考スコア（独自算出の注目度）: 2.942616054218564
License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
Abstract: Large language models (LLMs) have rapidly transformed the creation of written materials. LLMs have led to questions about writing integrity, thereby driving the creation of artificial intelligence (AI) detection technologies. Adversarial attacks, such as standard and humanized paraphrasing, inhibit detectors' ability to detect machine-generated text. Previous studies have mainly focused on ChatGPT and other well-known LLMs and have shown varying accuracy across detectors. However, there is a clear gap in the literature about DeepSeek, a recently published LLM. Therefore, in this work, we investigate whether six generally accessible AI detection tools -- AI Text Classifier, Content Detector AI, Copyleaks, QuillBot, GPT-2, and GPTZero -- can consistently recognize text generated by DeepSeek. The detectors were exposed to the aforementioned adversarial attacks. We also considered DeepSeek as a detector by performing few-shot prompting and chain-of-thought reasoning (CoT) for classifying AI and human-written text. We collected 49 human-authored question-answer pairs from before the LLM era and generated matching responses using DeepSeek-v3, producing 49 AI-generated samples. Then, we applied adversarial techniques such as paraphrasing and humanizing to add 196 more samples. These were used to challenge detector robustness and assess accuracy impact. While QuillBot and Copyleaks showed near-perfect performance on original and paraphrased DeepSeek text, others -- particularly AI Text Classifier and GPT-2 -- showed inconsistent results. The most effective attack was humanization, reducing accuracy to 71% for Copyleaks, 58% for QuillBot, and 52% for GPTZero. Few-shot and CoT prompting showed high accuracy, with the best five-shot result misclassifying only one of 49 samples (AI recall 96%, human recall 100%).
Abstract（参考訳）: 大規模言語モデル(LLM)は、書物の作成を急速に変化させてきた。 LLMは、整合性の記述に関する疑問を引き起こし、人工知能(AI)検出技術の開発を推進している。標準的なパラフレーズや人為的なパラフレーズのような敵攻撃は、検知器が機械生成テキストを検出する能力を阻害する。従来の研究は主にChatGPTや他のよく知られたLCMに焦点を合わせており、検出器間での精度が変化している。しかし、最近出版されたLLMであるDeepSeekに関する文献には明らかなギャップがある。そこで本研究では,AI Text Classifier, Content Detector AI, Copyleaks, QuillBot, GPT-2, GPTZeroの6つの一般的なAI検出ツールが,DeepSeekが生成したテキストを一貫して認識できるかどうかを検討する。検出器は前述の敵の攻撃にさらされた。私たちはDeepSeekを、AIと人文テキストを分類するための、数発のプロンプトとチェーン・オブ・シークレット推論(CoT)を実行することで、検出対象とみなした。我々は,LLM時代以前の質問応答対49点を収集し,DeepSeek-v3を用いてマッチング応答を生成し,49個のAI生成サンプルを生成した。次に, パラフレージングやヒューマライゼーションといった対人的手法を適用し, 196個のサンプルを加味した。これらは検出器の堅牢性に挑戦し、精度への影響を評価するために使用された。 QuillBotとCopyleaksはオリジナルおよびパラフレーズのDeepSeekテキストでほぼ完璧なパフォーマンスを示したが、AI Text ClassifierとGPT-2は一貫性のない結果を示した。最も効果的な攻撃は人間化であり、Copyleaksは71%、QuillBotは58%、GPTZeroは52%に精度が低下した。 Few-shotとCoTのプロンプトは精度が高く、ベスト5ショットの結果は49のサンプルのうち1つだけを誤分類した(AIリコールは96%、人間リコールは100%)。

論文の概要: Evaluating the Performance of AI Text Detectors, Few-Shot and Chain-of-Thought Prompting Using DeepSeek Generated Text

関連論文リスト