Fugu-MT 論文翻訳(概要): Cognitive Surgery: The Awakening of Implicit Territorial Awareness in LLMs

論文の概要: Cognitive Surgery: The Awakening of Implicit Territorial Awareness in LLMs

arxiv url: http://arxiv.org/abs/2508.14408v1
Date: Wed, 20 Aug 2025 04:08:18 GMT
ステータス: 翻訳完了
システム内更新日: 2025-08-21 16:52:41.334935
Title: Cognitive Surgery: The Awakening of Implicit Territorial Awareness in LLMs
Title（参考訳）: 認知外科 : LLMにおける意図しない領域認識の覚醒
Authors: Yinghan Zhou, Weifeng Zhu, Juan Wen, Wanli Peng, Zhengxian Wu, Yiming Xue,
Abstract要約: 我々は,大規模言語モデル (LLM) が,個人表現パラダイム (IPP) に基づく自己生成テキストと他の生成テキストとの区別に苦慮していることを示す。次に、この失敗の原因を調査し、Implicit Territorial Awareness (ITA) という現象とみなす。本研究では, LLM の ITA を実現するために, 表現抽出, 領域構築, 著者識別, 認知編集の4つの主要モジュールからなる新しいフレームワークである認知手術(CoSur)を提案する。
参考スコア（独自算出の注目度）: 11.274791012904263
License: http://creativecommons.org/licenses/by/4.0/
Abstract: Large language models (LLMs) have been shown to possess a degree of self-recognition capability-the ability to identify whether a given text was generated by themselves. Prior work has demonstrated that this capability is reliably expressed under the Pair Presentation Paradigm (PPP), where the model is presented with two texts and asked to choose which one it authored. However, performance deteriorates sharply under the Individual Presentation Paradigm (IPP), where the model is given a single text to judge authorship. Although this phenomenon has been observed, its underlying causes have not been systematically analyzed. In this paper, we first replicate existing findings to confirm that LLMs struggle to distinguish self- from other-generated text under IPP. We then investigate the reasons for this failure and attribute it to a phenomenon we term Implicit Territorial Awareness (ITA)-the model's latent ability to distinguish self- and other-texts in representational space, which remains unexpressed in its output behavior. To awaken the ITA of LLMs, we propose Cognitive Surgery (CoSur), a novel framework comprising four main modules: representation extraction, territory construction, authorship discrimination and cognitive editing. Experimental results demonstrate that our proposed method improves the performance of three different LLMs in the IPP scenario, achieving average accuracies of 83.25%, 66.19%, and 88.01%, respectively.
Abstract（参考訳）: 大規模言語モデル(LLM)は、与えられたテキストが自分自身によって生成されたかどうかを識別する能力である、ある程度の自己認識能力を持っていることが示されている。以前の研究では、この能力はPair Presentation Paradigm (PPP)の下で確実に表現され、そこでモデルは2つのテキストで示され、どれを作成したかを選択するように求められていた。しかし、個人プレゼンテーションパラダイム(IPP)では、著者を判断する単一テキストが与えられるため、パフォーマンスが急激に悪化する。この現象は観測されているが、その根本原因は体系的に分析されていない。本稿では,IPP 下での自己生成テキストと自己生成テキストの区別に LLM が苦労していることを確認するために,既存の知見を最初に複製した。次に、この失敗の原因を調査し、モデルが表現空間における自己および他文を識別する潜在能力を示す現象をImplicit Territorial Awareness (ITA) という現象とみなす。本研究では, LLM の ITA を実現するために, 表現抽出, 領域構築, 著者識別, 認知編集の4つの主要モジュールからなる新しいフレームワークである認知手術(CoSur)を提案する。実験の結果,提案手法はIPPシナリオにおける3種類のLCMの性能を改善し,平均精度は83.25%,66.19%,88.01%であった。

論文の概要: Cognitive Surgery: The Awakening of Implicit Territorial Awareness in LLMs

関連論文リスト