Fugu-MT 論文翻訳(概要): OpenRFM: Dissecting Relational In-Context Learning

論文の概要: OpenRFM: Dissecting Relational In-Context Learning

arxiv url: http://arxiv.org/abs/2606.04320v1
Date: Wed, 03 Jun 2026 00:48:44 GMT
ステータス: 翻訳完了
システム内更新日: 2026-06-04 20:44:18.451247
Title: OpenRFM: Dissecting Relational In-Context Learning
Title（参考訳）: OpenRFM:リレーショナルインコンテキストラーニングの分離
Authors: Zhikai Chen, Junyu Yin, Jialiang Gu, Siheng Xiong, Xiaoze Liu, Ruowang Zhang, Keren Zhou, Kai Guo,
Abstract要約: In-context Learning (ICL) を通じて1回のパスで返される事前学習された予測器は,ラベルとセルの関係が疎い場合に故障することを示す。本稿では,リレーショナルバックボーンと,事前学習した表層基礎モデルから持ち上げられたバッチレベルICL層を結合して,関係レベルのラベル不足を克服する2段ICLアーキテクチャを提案する。
参考スコア（独自算出の注目度）: 15.623084178502053
License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
Abstract: Relational Foundation Models (RFMs) promise a single pre-trained predictor that, given any relational database, returns predictions in one forward pass via relational in-context learning (ICL). Yet a substantial gap separates open RFMs from their commercial counterparts, and the origin of this gap has not been systematically understood. We dissect a representative framework, the Relational Transformer (RT), from two perspectives. Model side: we show that RT performs relation-level ICL, and a kernel regression view shows it fails when sparse label-cell coverage yields an underdetermined regression. Data side: we ablate RT's pre-training source and find that existing synthetic-only pre-training and in-distribution pre-training drive the same architecture into different regimes, lazy vs. feature-learning. Probing this gap reveals that the missing ingredient is a support-identifiable relational latent in the label-generation process. These two diagnoses translate into (1) a dual-stage ICL architecture that combines the relational backbone with a batch-level ICL layer lifted from a pre-trained tabular foundation model to overcome relation-level label scarcity, and (2) a homophily-aware synthetic plus continual real-data pre-training mixture, augmented with a prototype-based regularization. These choices define OpenRFM, a simple yet effective RFM that improves average task performance by approximately 30% over the RT backbone and surpasses the commercial model KumoRFMv1 on a large set of evaluation tasks.
Abstract（参考訳）: リレーショナル・ファンデーション・モデル(RFM)は、任意のリレーショナル・データベースが与えられた場合、リレーショナル・イン・コンテキスト・ラーニング(ICL)を介して1つのフォワード・パスで予測を返す1つの事前訓練された予測器を約束する。しかし、実質的なギャップは、オープンRAMと商業的ギャップを分離し、このギャップの起源は体系的に理解されていない。我々は2つの観点から、代表的なフレームワークであるリレーショナルトランスフォーマー(RT)を識別する。モデル側: RT が関係レベル ICL を実行することを示し、カーネルレグレッションビューは、スパースラベルセルのカバレッジが未決定のレグレッションをもたらすときに失敗することを示している。データ側: RTの事前トレーニングソースを廃止し、既存の合成専用事前トレーニングと非配布事前トレーニングが、同じアーキテクチャを、遅延対フィーチャーラーニングという異なるレシエーションに導くことを発見した。このギャップは、ラベル生成プロセスにおいて、サポート識別可能なリレーショナル潜伏剤が欠落していることを示している。これらの2つの診断は,(1)関係バックボーンと,(2)関係レベルラベルの不足を克服するためにトレーニング済みの表層モデルから持ち上げられたバッチレベルのICL層を結合した2段階のICLアーキテクチャ,(2)プロトタイプベースの正規化で強化されたホモフィア認識合成および連続的な実データ事前学習混合物に翻訳される。 OpenRFMは、RTバックボーン上で平均タスク性能を約30%改善し、多数の評価タスクにおいて商用モデルであるKumoRFMv1を上回る、シンプルで効果的なRAMである。

論文の概要: OpenRFM: Dissecting Relational In-Context Learning

関連論文リスト