Fugu-MT 論文翻訳(概要): SpatialBench: Is Your Spatial Foundation Model an All-Round Player?

論文の概要: SpatialBench: Is Your Spatial Foundation Model an All-Round Player?

arxiv url: http://arxiv.org/abs/2605.27367v1
Date: Tue, 26 May 2026 17:59:20 GMT
ステータス: 翻訳完了
システム内更新日: 2026-05-27 17:51:42.59399
Title: SpatialBench: Is Your Spatial Foundation Model an All-Round Player?
Title（参考訳）: SpaceBench: あなたの空間ファンデーションモデルはオールロードプレーヤーか?
Authors: Haosong Peng, Hao Li, Jiaqi Chen, Yuhao Pan, Runmao Yao, Yalun Dai, Fushuo Huo, Fangzhou Hong, Zhaoxi Chen, Haozhao Wang, Dingwen Zhang, Ziwei Liu, Wenchao Xu,
Abstract要約: 空間ベンチ(SpatialBench)は、決定論的サンプリングを伴う空間基盤モデルのための、クロスパラダイムなドメインディバースベンチマークである。 6つのパラダイムにまたがる41のモデルを4つの異なる入力密度設定の下で5つのタスクスイートで包括的に評価する。厳密なドメインアライメントと高いデータ品質が、単純なデータセットスケーリングよりもパフォーマンスに極めて重要であることを示す。
参考スコア（独自算出の注目度）: 92.031716744172
License: http://creativecommons.org/licenses/by/4.0/
Abstract: While spatial foundation models have demonstrated impressive performance on standard datasets, a critical question remains: are they truly all-round players capable of generalizing robustly across diverse downstream tasks, arbitrary viewpoints, shifting scene domains, varying input densities, and specific hardware constraints? Answering this overarching question requires a holistic assessment, yet current models are mainly evaluated on specific domains for which they were specifically designed or trained. Such evaluations are intrinsically limited by narrow paradigm coverage, limited scene domains, and arbitrary frame sampling, making it fundamentally difficult to assess their true generalization capabilities. To address this gap, we present SpatialBench, a cross-paradigm, domain-diverse benchmark for spatial foundation models with deterministic sampling. SpatialBench features unprecedented scale and rigorous deterministic design, comprising 19 datasets and 546 scenes across 5 diverse spatial domains. It comprehensively evaluates 41 models across 6 paradigms on 5 task suites under 4 different input density settings. Our extensive evaluation reveals that current models are not yet all-round players, and uncovers crucial insights for future advancement. Specifically, we demonstrate that full-context attention maximizes accuracy while bounded-memory strategies unlock long-sequence scalability. Moreover, our empirical evaluations in challenging embodied and egocentric tasks demonstrate that strict domain alignment and high data quality are far more critical to performance than simple dataset scaling. Furthermore, to address the largest data gap identified in our analysis, we go beyond evaluation by introducing a large-scale dataset, DA-Next-5M, and a strong baseline model, DA-Next, pushing the boundaries of spatial representation learning.
Abstract（参考訳）: 空間基盤モデルは標準的なデータセットで素晴らしいパフォーマンスを示してきたが、重要な疑問が残る。彼らは本当に、さまざまな下流タスク、任意の視点、シーンドメインのシフト、入力密度の変化、特定のハードウェア制約に対して、堅牢に一般化できる全ラウンドのプレイヤーなのか? この包括的な疑問に答えるには、全体的評価が必要ですが、現在のモデルは、特に設計または訓練された特定のドメインで主に評価されます。このような評価は、狭いパラダイムカバレッジ、限られたシーンドメイン、任意のフレームサンプリングによって本質的に制限されており、真の一般化能力を評価することは根本的に困難である。このギャップに対処するために、決定論的サンプリングを用いた空間基盤モデルのためのクロスパラダイム・ドメイン・ディバース・ベンチマークであるSpatialBenchを提案する。 SpaceBenchは、19のデータセットと5つの異なる空間領域にわたる546のシーンからなる、前例のないスケールと厳密な決定論的設計を特徴としている。 6つのパラダイムにまたがる41のモデルを4つの異なる入力密度設定の下で5つのタスクスイートで包括的に評価する。我々の広範な評価は、現在のモデルがまだ全ラウンドのプレイヤーではないことを明らかにし、将来の進歩にとって重要な洞察を明らかにする。具体的には,完全コンテキストの注意が精度を最大化し,境界メモリ戦略が長期のスケーラビリティを解放することを示した。さらに、具体的でエゴセントリックなタスクに挑戦する上での実証的な評価は、単純なデータセットスケーリングよりも、厳密なドメインアライメントと高いデータ品質がパフォーマンスに極めて重要であることを示している。さらに,分析で特定される最大のデータギャップに対処するために,大規模データセットDA-Next-5Mと強力なベースラインモデルDA-Nextを導入し,空間表現学習の境界を推し進める。

論文の概要: SpatialBench: Is Your Spatial Foundation Model an All-Round Player?

関連論文リスト