Fugu-MT 論文翻訳(概要): Toward AI That Understands Self and Others: A World-Model Theory of Cognitive Diversity and Alignment

論文の概要: Toward AI That Understands Self and Others: A World-Model Theory of Cognitive Diversity and Alignment

arxiv url: http://arxiv.org/abs/2605.29930v2
Date: Tue, 02 Jun 2026 04:31:00 GMT
ステータス: 翻訳完了
システム内更新日: 2026-06-03 18:57:50.163989
Title: Toward AI That Understands Self and Others: A World-Model Theory of Cognitive Diversity and Alignment
Title（参考訳）: 自己や他者を理解するAIに向けて:認知多様性とアライメントの世界モデル理論
Authors: Toru Takahashi,
Abstract要約: 本稿は、すでに不一致は後期現象であると主張している。中心的な前提は単純だが自明ではない。本稿では,認知の多様性とアライメントに関する世界モデル理論を開発する。
参考スコア（独自算出の注目度）: 0.6768558752130311
License: http://creativecommons.org/licenses/by-nc-nd/4.0/
Abstract: Modern societies possess more information than ever before, yet they do not converge toward a single shared understanding. The same events, facts, laws, technologies, or risks can be interpreted as evidence of freedom, danger, exclusion, injustice, responsibility, or unrealized possibility. Existing discussions often treat such disagreement as a conflict of values, preferences, or beliefs. This paper argues that disagreement is already a late-stage phenomenon. The central premise is simple but not trivial: observation is not yet inference. Not every observation becomes inferentially relevant, and not every possible object in an observation sequence becomes an estimation target. A possible target becomes admissible only when a state representation can be constructed that is approximately sufficient for prediction, evaluation, or action with respect to that target. This paper develops a world-model theory of cognitive diversity and alignment by reconstructing recognition as the construction of such approximate sufficient statistics under finite informational, representational, observational, and action constraints. It formulates this position as the Multi-Phase Inference Assumption (MIA) and defines its core internal mechanism as the Multi-Phase Inference Mechanism (MIM). The framework introduces alignment maps and transformation loss to analyze how heterogeneous world models communicate without being collapsed into a single representation. World-model alignment is therefore processability, not agreement: the design of AI systems that help heterogeneous forms of intelligence remain mutually processable while preserving their distinct error-detection capacities.
Abstract（参考訳）: 現代の社会はこれまでになく多くの情報を持っているが、それらは単一の共有された理解に収束しない。同じ出来事、事実、法律、技術、またはリスクは、自由、危険、排除、不正、責任、あるいは非現実的な可能性の証拠として解釈することができる。既存の議論はしばしば、そのような意見の相違を価値、好み、信念の相反として扱う。本稿は、すでに不一致は後期現象であると主張している。中心的な前提は単純だが自明ではない。すべての観測が推論的に関連づけられるわけではないし、観測シーケンス内の全ての可能なオブジェクトが推定対象になるわけではない。可能なターゲットは、そのターゲットに対する予測、評価、またはアクションにほぼ十分である状態表現を構築することができる場合にのみ許容される。本稿では, 有限情報, 表現, 観察, 行動制約下での十分な統計量の構築として認識を再構築することにより, 認知の多様性とアライメントに関する世界モデル理論を開発する。この位置をMulti-Phase Inference Assumption (MIA)として定式化し、その中核となる内部メカニズムをMulti-Phase Inference Mechanism (MIM)として定義する。このフレームワークはアライメントマップと変換損失を導入し、異種世界モデルが単一の表現に分解されることなくどのように通信するかを分析する。したがって、ワールドモデルアライメントはプロセス可能性であり、合意ではない。異質なインテリジェンスを支援するAIシステムの設計は、異なるエラー検出能力を維持しながら、相互に処理可能である。

関連論文リスト

Causal Bias Detection in Generative Artificial Intelligence [15.736899098702972]
我々は、生成AIにおける因果フェアネスの問題を形式化し、共通の理論的枠組みの下で標準ML設定と統一する。我々は, (a) 異なる因果経路と (b) 生成モデルのメカニズムによる実世界のメカニズムの置き換えの両方に沿って, 公正な影響の粒度の定量化を可能にする新たな因果分解結果を導出した。
論文参考訳（メタデータ） (2026-05-12T00:36:53Z)
Position: General Alignment Has Hit a Ceiling; Edge Alignment Must Be Taken Seriously [51.03213216886717]
我々は、一般的なアライメントの支配的なパラダイムが、矛盾する値の設定において構造的な天井に達するという立場を取る。エッジアライメント(Edge Alignment)は,多次元の値構造を保持するシステムにおいて,異なるアプローチである。
論文参考訳（メタデータ） (2026-02-23T16:51:43Z)
A Multimodal Framework for Aligning Human Linguistic Descriptions with Visual Perceptual Data [0.0]
人間の参照解釈の中核的な側面をモデル化する計算フレームワークを提案する。スタンフォード・リピート・レファレンス・ゲーム・コーパス(Stanford Repeated Reference Game corpus)のモデルを評価する。その結果, 比較的単純な知覚言語的アライメント機構は, 人間の競争行動をもたらすことが示唆された。
論文参考訳（メタデータ） (2026-02-23T07:20:11Z)
Exploring Syntropic Frameworks in AI Alignment: A Philosophical Investigation [0.0]
AIアライメントは、プロセスベース、マルチエージェント、開発メカニズムを通じて、シントロピックで理由対応のエージェントを設計するものとして再認識されるべきである、と私は主張する。コンテンツベースの値仕様が構造的に不安定なように見える理由を示す、仕様トラップの議論を明確にする。マルチエージェントアライメントのダイナミクスを理解するための情報理論の枠組みとして, シントロピーを提案する。
論文参考訳（メタデータ） (2025-11-19T23:31:29Z)
Modeling Open-World Cognition as On-Demand Synthesis of Probabilistic Models [93.1043186636177]
我々は、人々が分散表現と象徴表現の組み合わせを使って、新しい状況に合わせた見知らぬ精神モデルを構築するという仮説を探求する。モデル合成アーキテクチャ」という概念の計算的実装を提案する。我々は、新しい推論データセットに基づく人間の判断のモデルとして、MSAを評価した。
論文参考訳（メタデータ） (2025-07-16T18:01:03Z)
PRISM: Perspective Reasoning for Integrated Synthesis and Mediation as a Multi-Perspective Framework for AI Alignment [0.0]
Perspective Reasoning for Integrated Synthesis and Mediation (PRISM)は、AIアライメントにおける永続的な課題に対処するフレームワークである。 PRISMは道徳的懸念を7つの「基本世界観」にまとめ、それぞれが人間の道徳的認知の異なる次元を捉えていると仮定している。現実の展開や形式的検証など,今後の方向性を概説するとともに,マルチパースペクティブな合成とコンフリクトの仲介に重点を置きながら,今後の方向性を概説する。
論文参考訳（メタデータ） (2025-02-05T02:13:57Z)
Position: Towards Bidirectional Human-AI Alignment [109.57781720848669]
我々は、人間とAIの双方向的・動的関係を説明するために、研究コミュニティは「調整」を明確に定義し、批判的に反映すべきであると主張する。このフレームワークは、AIと人間の価値を整合させる従来の取り組みを取り入れているだけでなく、人間とAIを整合させるという、重要で未解明の次元も導入しています。
論文参考訳（メタデータ） (2024-06-13T16:03:25Z)
Attacks in Adversarial Machine Learning: A Systematic Survey from the Life-cycle Perspective [69.25513235556635]
敵対的機械学習(英: Adversarial Machine Learning、AML)は、機械学習の逆行現象を研究する。機械学習システムの異なる段階で発生するこの敵対現象を探求するために、いくつかのパラダイムが最近開発された。既存の攻撃パラダイムをカバーするための統一的な数学的枠組みを提案する。
論文参考訳（メタデータ） (2023-02-19T02:12:21Z)

関連論文リストは本サイト内にある論文のタイトル・アブストラクトから自動的に作成しています。

指定された論文の情報です。
本サイトの運営者は本サイト（すべての情報・翻訳含む）の品質を保証せず、本サイト（すべての情報・翻訳含む）を使用して発生したあらゆる結果について一切の責任を負いません。