Fugu-MT 論文翻訳(概要): Towards Platonic Representation for Table Reasoning: A Foundation for Permutation-Invariant Retrieval

論文の概要: Towards Platonic Representation for Table Reasoning: A Foundation for Permutation-Invariant Retrieval

arxiv url: http://arxiv.org/abs/2604.12133v1
Date: Mon, 13 Apr 2026 23:33:43 GMT
ステータス: 翻訳完了
システム内更新日: 2026-04-15 19:11:32.15872
Title: Towards Platonic Representation for Table Reasoning: A Foundation for Permutation-Invariant Retrieval
Title（参考訳）: 表推論のためのプラトン表現に向けて:置換不変検索の基礎
Authors: Willy Carlos Tchuitcheu, Tan Lu, Ann Dooms,
Abstract要約: 我々は,表表表現学習(TRL)に対する歴史的アプローチが,自然言語処理(NLP)のシーケンシャルパラダイムを広く採用していることを論じる。本稿では、テーブルに対するプラトン表現仮説(PRH)を導入し、テーブル推論のための意味論的に堅牢な潜在空間は本質的に置換不変量(PI)でなければならないと仮定する。本稿では,セルヘッダアライメントの認知原理を明示する,構造を意識したTRLエンコーダアーキテクチャを提案する。
参考スコア（独自算出の注目度）: 0.0
License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
Abstract: Historical approaches to Table Representation Learning (TRL) have largely adopted the sequential paradigms of Natural Language Processing (NLP). We argue that this linearization of tables discards their essential geometric and relational structure, creating representations that are brittle to layout permutations. This paper introduces the Platonic Representation Hypothesis (PRH) for tables, positing that a semantically robust latent space for table reasoning must be intrinsically Permutation Invariant (PI). To ground this hypothesis, we first conduct a retrospective analysis of table-reasoning tasks, highlighting the pervasive serialization bias that compromises structural integrity. We then propose a formal framework to diagnose this bias, introducing two principled metrics based on Centered Kernel Alignment (CKA): (i) PI, which measures embedding drift under complete structural derangement, and (ii) rho, a Spearman-based metric that tracks the convergence of latent structures toward a canonical form as structural information is incrementally restored. Our empirical analysis quantifies an expected flaw in modern Large Language Models (LLMs): even minor layout permutations induce significant, disproportionate semantic shifts in their table embeddings. This exposes a fundamental vulnerability in RAG systems, in which table retrieval becomes fragile to layout-dependent noise rather than to semantic content. In response, we present a novel, structure-aware TRL encoder architecture that explicitly enforces the cognitive principle of cell header alignment. This model demonstrates superior geometric stability and moves towards the PI ideal. Our work provides both a foundational critique of linearized table encoders and the theoretical scaffolding for semantically stable, permutation invariant retrieval, charting a new direction for table reasoning in information systems.
Abstract（参考訳）: 表表現学習(TRL)の歴史的アプローチは、自然言語処理(NLP)のシーケンシャルパラダイムを多く採用している。このテーブルの線形化は、その基本的な幾何学的および関係的な構造を捨て、レイアウトの置換に脆弱な表現を生み出していると論じる。本稿では、テーブルに対するプラトン表現仮説(PRH)を導入し、テーブル推論のための意味論的に堅牢な潜在空間は本質的に置換不変量(PI)でなければならないと仮定する。この仮説を基礎として、まずテーブル推論タスクの振り返り分析を行い、構造的整合性を損なう広範囲な直列化バイアスを強調した。次に、このバイアスを診断するための正式なフレームワークを提案し、CKA(Centered Kernel Alignment)に基づく2つの原則付きメトリクスを導入します。一完全な構造上の乱れの下での埋没流を測定するPI及び (ii) 構造情報としての正準形式への潜伏構造の収束を追跡するスピアマン測度が漸進的に復元される。我々の経験的分析は、現代のLarge Language Models (LLMs) の期待する欠陥を定量化します。これは、テーブル検索がセマンティックコンテンツよりもレイアウトに依存したノイズに脆弱になる、RAGシステムにおける根本的な脆弱性を露呈する。そこで本研究では,セルヘッダアライメントの認知原理を明示的に適用した,構造を意識したTRLエンコーダアーキテクチャを提案する。このモデルは優れた幾何学的安定性を示し、PIイデアルに向かう。本研究は,線形化テーブルエンコーダの基本的批判と,意味論的に安定な変分不変検索のための理論的足場を提供し,情報システムにおけるテーブル推論の新しい方向を図示する。

論文の概要: Towards Platonic Representation for Table Reasoning: A Foundation for Permutation-Invariant Retrieval

関連論文リスト