Fugu-MT 論文翻訳(概要): Authorship Impersonation via LLM Prompting does not Evade Authorship Verification Methods

論文の概要: Authorship Impersonation via LLM Prompting does not Evade Authorship Verification Methods

arxiv url: http://arxiv.org/abs/2603.29454v1
Date: Tue, 31 Mar 2026 08:59:09 GMT
ステータス: 翻訳完了
システム内更新日: 2026-04-01 15:25:03.375404
Title: Authorship Impersonation via LLM Prompting does not Evade Authorship Verification Methods
Title（参考訳）: LLM Promptingによるオーサリングの偽装は,オーサリングの検証方法に悪影響を及ぼさない
Authors: Baoyi Zeng, Andrea Nini,
Abstract要約: 著者の検証は法言語学の重要な部分である。大規模言語モデル(LLM)の最近の進歩は、新しい課題を提起している。本研究は, LLMが説得力のある権威的偽造を生成できるかどうかを考察する。
参考スコア（独自算出の注目度）: 0.0
License: http://creativecommons.org/licenses/by/4.0/
Abstract: Authorship verification (AV), the task of determining whether a questioned text was written by a specific individual, is a critical part of forensic linguistics. While manual authorial impersonation by perpetrators has long been a recognized threat in historical forensic cases, recent advances in large language models (LLMs) raise new challenges, as adversaries may exploit these tools to impersonate another's writing. This study investigates whether prompted LLMs can generate convincing authorial impersonations and whether such outputs can evade existing forensic AV systems. Using GPT-4o as the adversary model, we generated impersonation texts under four prompting conditions across three genres: emails, text messages, and social media posts. We then evaluated these outputs against both non-neural AV methods (n-gram tracing, Ranking-Based Impostors Method, LambdaG) and neural approaches (AdHominem, LUAR, STAR) within a likelihood-ratio framework. Results show that LLM-generated texts failed to sufficiently replicate authorial individuality to bypass established AV systems. We also observed that some methods achieved even higher accuracy when rejecting impersonation texts compared to genuine negative samples. Overall, these findings indicate that, despite the accessibility of LLMs, current AV systems remain robust against entry-level impersonation attempts across multiple genres. Furthermore, we demonstrate that this counter-intuitive resilience stems, at least in part, from the higher lexical diversity and entropy inherent in LLM-generated texts.
Abstract（参考訳）: 著者検証(AV)とは、特定の個人によって書かれたテキストかどうかを判断するタスクであり、法医学的言語学の重要な部分である。加害者による手動の権威的偽装は、長い間、歴史的法医学的事件では認識されている脅威であったが、近年の大規模言語モデル(LLM)の進歩は、敵がこれらのツールを利用して他人の著作を偽装する可能性があるため、新たな課題を提起している。本研究では, LLMが説得力のある権威的偽造を生成できるか, 既存の法医学的AVシステムを回避することができるかを検討する。 GPT-4oを相手モデルとして,メール,テキストメッセージ,ソーシャルメディア投稿という3つのジャンルにまたがる4つの条件下で,偽造テキストを生成した。次に, ニューラルAV法 (n-gram Trace, Ranking-Based Impostors Method, LambdaG) とニューラルアプローチ (AdHominem, LUAR, STAR) の両手法を, 確率比で評価した。その結果、LLM生成したテキストは、確立されたAVシステムを回避するために、権威的個人性を十分に再現できなかった。また,偽造テキストを否定する手法が,真正の陰性サンプルよりも高い精度で達成されていることも見いだした。これらの結果は,LSMのアクセシビリティにもかかわらず,現在のAVシステムは,複数のジャンルにわたるエントリーレベルの偽造の試みに対して頑健であることを示している。さらに、この反直感的レジリエンスは、少なくとも部分的には、LLM生成テキストに固有の高い語彙的多様性とエントロピーから生じることを実証する。

論文の概要: Authorship Impersonation via LLM Prompting does not Evade Authorship Verification Methods

関連論文リスト