Fugu-MT 論文翻訳(概要): Decoding AI Authorship: Can LLMs Truly Mimic Human Style Across Literature and Politics?

論文の概要: Decoding AI Authorship: Can LLMs Truly Mimic Human Style Across Literature and Politics?

arxiv url: http://arxiv.org/abs/2603.23219v1
Date: Tue, 24 Mar 2026 13:58:09 GMT
ステータス: 翻訳完了
システム内更新日: 2026-03-25 19:53:37.51226
Title: Decoding AI Authorship: Can LLMs Truly Mimic Human Style Across Literature and Politics?
Title（参考訳）: AIオーサリングをデコードする - LLMは文学と政治にまたがって、真にミミックな人間スタイルを実現できるか?
Authors: Nasser A Alsadhan,
Abstract要約: 本研究では,現在最先端の大規模言語モデル(LLM)が,著名な文学的・政治的人物の著作的署名をエミュレートする能力について検討する。我々は、変換器に基づく分類(BERT)と解釈可能な機械学習(XGBoost)を組み合わせた補完的枠組みを用いて評価した合成コーパスを作成した。提案手法は,AI生成テキストと人間認証テキストの相違を評価するために,言語問合せ,単語数マーカー,難易度,可読性指標を統合した。
参考スコア（独自算出の注目度）: 0.0
License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
Abstract: Amidst the rising capabilities of generative AI to mimic specific human styles, this study investigates the ability of state-of-the-art large language models (LLMs), including GPT-4o, Gemini 1.5 Pro, and Claude Sonnet 3.5, to emulate the authorial signatures of prominent literary and political figures: Walt Whitman, William Wordsworth, Donald Trump, and Barack Obama. Utilizing a zero-shot prompting framework with strict thematic alignment, we generated synthetic corpora evaluated through a complementary framework combining transformer-based classification (BERT) and interpretable machine learning (XGBoost). Our methodology integrates Linguistic Inquiry and Word Count (LIWC) markers, perplexity, and readability indices to assess the divergence between AI-generated and human-authored text. Results demonstrate that AI-generated mimicry remains highly detectable, with XGBoost models trained on a restricted set of eight stylometric features achieving accuracy comparable to high-dimensional neural classifiers. Feature importance analyses identify perplexity as the primary discriminative metric, revealing a significant divergence in the stochastic regularity of AI outputs compared to the higher variability of human writing. While LLMs exhibit distributional convergence with human authors on low-dimensional heuristic features, such as syntactic complexity and readability, they do not yet fully replicate the nuanced affective density and stylistic variance inherent in the human-authored corpus. By isolating the specific statistical gaps in current generative mimicry, this study provides a comprehensive benchmark for LLM stylistic behavior and offers critical insights for authorship attribution in the digital humanities and social media.
Abstract（参考訳）: GPT-4o、Gemini 1.5 Pro、Claude Sonnet 3.5といった最先端の大規模言語モデル(LLM)が、文学や政治の著名な人物の権威的な署名を模倣する能力について調査する。厳密なセマンティックアライメントを備えたゼロショットプロンプトフレームワークを用いて,変換器ベース分類(BERT)と解釈可能な機械学習(XGBoost)を組み合わせた補完的フレームワークを用いて評価した合成コーパスを作成した。提案手法では,言語問合せと単語数(LIWC)マーカー,パープレキシティ,可読性指標を統合し,AI生成テキストと人間によるテキストの相違を評価する。その結果、XGBoostモデルは、高次元のニューラル分類器に匹敵する精度を達成し、8つのスタイル的特徴の限定セットに基づいて訓練されている。特徴的重要度分析は,AI出力の確率的正則性において,人間の文章の多様性よりも有意なばらつきを示す主要な識別指標としてパープレキシティを同定する。 LLMは、構文的複雑性や可読性などの低次元ヒューリスティックな特徴について、ヒトの著者と分布的収束を示すが、ヒトが認可したコーパスに固有のニュアンスな感情密度とスタイル的差異を完全には再現していない。本研究は,現生模倣における特定の統計的ギャップを分離することにより,LLMのスタイリスティックな行動に関する総合的なベンチマークを提供し,デジタル人文科学やソーシャルメディアにおける著者帰属に関する重要な知見を提供する。

論文の概要: Decoding AI Authorship: Can LLMs Truly Mimic Human Style Across Literature and Politics?

関連論文リスト