Fugu-MT 論文翻訳(概要): The Causally Emergent Alignment Hypothesis: Causal Emergence Aligns with and Predicts Final Reward in Reinforcement Learning Agents

論文の概要: The Causally Emergent Alignment Hypothesis: Causal Emergence Aligns with and Predicts Final Reward in Reinforcement Learning Agents

arxiv url: http://arxiv.org/abs/2605.06746v1
Date: Thu, 07 May 2026 15:00:51 GMT
ステータス: 翻訳完了
システム内更新日: 2026-05-11 19:43:38.507604
Title: The Causally Emergent Alignment Hypothesis: Causal Emergence Aligns with and Predicts Final Reward in Reinforcement Learning Agents
Title（参考訳）: 因果的創発的アライメント仮説:強化学習エージェントにおける因果的創発的アライメントと最終リワードの予測
Authors: Federico Pigozzi, Michael Levin,
Abstract要約: 因果的出現は、エージェントがその将来に固有の予測力を発揮する度合いを測定する。近年の発見によると、生物学的エージェントは、最小限のエージェントでさえ、新しい記憶を学習した後、因果関係の出現を増大させる。本研究では,ニューラルネットワークエージェントの環境条件に対する強化学習に着目した。
参考スコア（独自算出の注目度）: 1.0742675209112622
License: http://creativecommons.org/licenses/by/4.0/
Abstract: A hallmark of life on Earth is the ability of agents to exert causal power and be drivers of subsequent events. This is key to cognition at all scales. Causal emergence, measuring the degree to which an agent exerts unique predictive power on its future, is one consequence of causal power. Indeed, recent discoveries have shown that biological agents, even minimal ones, increase their causal emergence after learning new memories. However, there is a major knowledge gap regarding how causally emergent artificial agents are. We focused on Reinforcement Learning (RL) of neural-network agents across an array of environmental conditions, encompassing different algorithms, agent architectures, and six environments arranged on a complexity spectrum. For consistency, we computed the causal emergence of their latent-space representations over their lifetimes. We used the recently proposed ΦID to estimate causal emergence and tested how it related to learning performance. Our results suggested a Causally Emergent Alignment Hypothesis: successful agents exhibited causal emergence that was consistently predictive of final reward early in training and whose representational dynamics aligned with reward improvement in most tasks. This idea suggests that causal emergence may be a previously undisclosed axis of reorganization of neural representations in RL agents, with the potential to establish causal relationships and interventions that will lead to better RL agents. Our work also highlights the alignment between causal emergence and learning as another way biological and artificial creatures compare.
Abstract（参考訳）: 地球上の生命の目印は、エージェントが因果力を発揮し、その後の出来事の原動力となる能力である。これはあらゆる規模において認知の鍵です。因果的出現は、エージェントがその将来に固有の予測力を発揮する程度を測定するものであり、因果的力の結果である。事実、最近の発見では、生物学的エージェントは、最小限のエージェントでさえ、新しい記憶を学習した後、因果関係の出現を増大させる。しかし、因果的創発的な人工エージェントがどうあるかについては、大きな知識ギャップがある。我々は、さまざまなアルゴリズム、エージェントアーキテクチャ、複雑スペクトル上に配置された6つの環境を網羅した、ニューラルネットワークエージェントの強化学習(RL)に焦点を当てた。一貫性のために、我々は彼らの生涯における潜在空間表現の因果的出現を計算した。我々は、最近提案された sID を用いて因果関係の出現を推定し、それが学習性能とどのように関係しているかを検証した。本研究は,訓練の初期段階において最終報酬を常に予測し,ほとんどのタスクにおいて報酬改善に一致した表現力学を有する因果関係の出現を成功エージェントが示すことを示し,因果関係の仮説を提唱した。この考えは、因果発生は、RLエージェントにおける神経表現の再編成の以前には明らかにされていなかった軸であり、より優れたRLエージェントにつながる因果関係や介入を確立する可能性があることを示唆している。私たちの研究は、生物学的生物と人工生物の比較方法として、因果関係の出現と学習の整合性も強調しています。

論文の概要: The Causally Emergent Alignment Hypothesis: Causal Emergence Aligns with and Predicts Final Reward in Reinforcement Learning Agents

関連論文リスト