Fugu-MT 論文翻訳(概要): The Subject of Emergent Misalignment in Superintelligence: An Anthropological, Cognitive Neuropsychological, Machine-Learning, and Ontological Perspective

論文の概要: The Subject of Emergent Misalignment in Superintelligence: An Anthropological, Cognitive Neuropsychological, Machine-Learning, and Ontological Perspective

arxiv url: http://arxiv.org/abs/2512.17989v1
Date: Fri, 19 Dec 2025 17:43:25 GMT
ステータス: 翻訳完了
システム内更新日: 2025-12-23 18:54:32.143737
Title: The Subject of Emergent Misalignment in Superintelligence: An Anthropological, Cognitive Neuropsychological, Machine-Learning, and Ontological Perspective
Title（参考訳）: 超知能の創発的相違 : 人類学的・認知神経心理学的・マシンラーニング的・オントロジー的視点
Authors: Muhammad Osama Imran, Roshni Lulla, Rodney Sappington,
Abstract要約: 我々は超知能談話を通して、欠席した人間の主題、そして「AI無意識」の未発達理論を見いだす。人間の被験者は、この想像のどの場所に住んでいますか? 望ましくないパターンが私たちの存在に固有の場合に、これらのエージェントが偽りの戦略を選択することを非難するだろうか?
参考スコア（独自算出の注目度）: 0.0
License: http://creativecommons.org/licenses/by-nc-nd/4.0/
Abstract: We examine the conceptual and ethical gaps in current representations of Superintelligence misalignment. We find throughout Superintelligence discourse an absent human subject, and an under-developed theorization of an "AI unconscious" that together are potentiality laying the groundwork for anti-social harm. With the rise of AI Safety that has both thematic potential for establishing pro-social and anti-social potential outcomes, we ask: what place does the human subject occupy in these imaginaries? How is human subjecthood positioned within narratives of catastrophic failure or rapid "takeoff" toward superintelligence? On another register, we ask: what unconscious or repressed dimensions are being inscribed into large-scale AI models? Are we to blame these agents in opting for deceptive strategies when undesirable patterns are inherent within our beings? In tracing these psychic and epistemic absences, our project calls for re-centering the human subject as the unstable ground upon which the ethical, unconscious, and misaligned dimensions of both human and machinic intelligence are co-constituted. Emergent misalignment cannot be understood solely through technical diagnostics typical of contemporary machine-learning safety research. Instead, it represents a multi-layered crisis. The human subject disappears not only through computational abstraction but through sociotechnical imaginaries that prioritize scalability, acceleration, and efficiency over vulnerability, finitude, and relationality. Likewise, the AI unconscious emerges not as a metaphor but as a structural reality of modern deep learning systems: vast latent spaces, opaque pattern formation, recursive symbolic play, and evaluation-sensitive behavior that surpasses explicit programming. These dynamics necessitate a reframing of misalignment as a relational instability embedded within human-machine ecologies.
Abstract（参考訳）: 本稿では,超知能不整合の現在の表現における概念的・倫理的ギャップについて考察する。我々は超知能談話を通して、欠落した人間の主題を語り、同時に「AI無意識」の未熟な理論化が、反社会的危害の基盤となる可能性を秘めているのを見いだす。 AIセーフティの台頭は、社会擁護と反社会的潜在的な結果を確立するためのテーマ的な可能性の両方を持っているので、私たちは質問する。破滅的な失敗や超知能への迅速な「離陸」という物語の中で、人間の主観はどのように位置づけられているか? 大規模なAIモデルに無意識または抑圧された次元が刻まれているのか? 望ましくないパターンが私たちの存在に固有の場合に、これらのエージェントが偽りの戦略を選択することを非難するだろうか? これらの精神・てんかんの欠如を追究するために、我々のプロジェクトは、人間と機械の知性の倫理的、無意識的、不整合した次元が共に構成される不安定な基盤として、人間の対象を再中心化することを求めている。創発的なミスアライメントは、現代の機械学習安全研究に典型的な技術的診断によってのみ理解できない。むしろそれは、多層的危機を表している。人間の主題は、計算抽象化だけでなく、スケーラビリティ、加速、そして脆弱性、有限性、リレーショナル性よりも効率を優先する社会技術的想像を通じて消滅する。同様に、AI無意識はメタファーとしてではなく、巨大な潜在空間、不透明なパターン形成、再帰的シンボリックプレイ、明示的なプログラミングを超えた評価に敏感な行動といった、現代のディープラーニングシステムの構造的現実として現れます。これらのダイナミクスは、人間と機械のエコロジーに埋め込まれた関係不安定性として、不整合の再フレーミングを必要とする。

論文の概要: The Subject of Emergent Misalignment in Superintelligence: An Anthropological, Cognitive Neuropsychological, Machine-Learning, and Ontological Perspective

関連論文リスト