Fugu-MT 論文翻訳(概要): StereoFactory: A Unified Merging Framework for Robust Stereo Matching

論文の概要: StereoFactory: A Unified Merging Framework for Robust Stereo Matching

arxiv url: http://arxiv.org/abs/2606.17475v1
Date: Tue, 16 Jun 2026 03:36:59 GMT
ステータス: 翻訳完了
システム内更新日: 2026-06-17 17:15:32.251156
Title: StereoFactory: A Unified Merging Framework for Robust Stereo Matching
Title（参考訳）: StereoFactory:ロバストなステレオマッチングのための統合マージフレームワーク
Authors: Xianda Guo, Pinhan Fu, Ruilin Wang, Wenke Huang, Mang Ye, Qin Zou,
Abstract要約: ステレオマッチングは、大規模なデータセットでトレーニングされた基礎モデルを通じて進歩しているが、このパラダイムはスケーラビリティのボトルネックに悩まされている。モデルマージは、ソースチェックポイントが利用可能になった後、特別なモデルからの知識を統合することで、スケーラブルなポストホックな代替手段を提供する。本稿では,適応モデルマージのための粗大な進化的フレームワークであるStereoFactoryを提案する。
参考スコア（独自算出の注目度）: 61.973843344605655
License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
Abstract: Stereo matching has advanced through foundation models trained on large-scale datasets, yet this paradigm suffers from a scalability bottleneck: incorporating new data requires costly joint retraining. Model merging offers a scalable post-hoc alternative by integrating knowledge from specialized models after source checkpoints are available. However, existing merging methods typically retain all available models or rely on greedy inclusion, which can preserve harmful task-vector interference. We propose StereoFactory, a coarse-to-fine evolutionary framework for adaptive model merging. Stage~1 employs a genetic algorithm to search the combinatorial space of model subsets, determining which models should participate. Stage~2 addresses module-level knowledge specialization (different functional modules exhibit distinct preferences for knowledge sources) through CMA-ES optimization of architecture-adaptive routing over the selected task vectors, with optional module-level scaling. Experiments across two architectures and four benchmarks demonstrate that StereoFactory consistently achieves the best four-benchmark average under the same checkpoint pool, reducing the average error from 3.80 to 3.30 on NMRF and from 2.88 to 2.19 on FoundationStereo relative to the strongest controlled baseline. The post-hoc search requires only 2.7--3.7\% of the corresponding joint-retraining wall-clock time. Analysis reveals that knowledge contributions are inherently module-specific, and selected subsets can transfer across architectures with minimal degradation. Code will be publicly released upon acceptance at: https://github.com/XiandaGuo/StereoFactory.
Abstract（参考訳）: ステレオマッチングは、大規模なデータセットでトレーニングされた基礎モデルを通じて進歩しているが、このパラダイムはスケーラビリティのボトルネックに悩まされている。モデルマージは、ソースチェックポイントが利用可能になった後、特別なモデルからの知識を統合することで、スケーラブルなポストホックな代替手段を提供する。しかし、既存のマージ手法は一般的にすべての利用可能なモデルを保持するか、有害なタスクとベクターの干渉を保ちうるグレディなインクルージョンに依存している。本稿では,適応モデルマージのための粗大な進化的フレームワークであるStereoFactoryを提案する。 Stage~1は、モデルサブセットの組合せ空間を探索するために遺伝的アルゴリズムを使用し、どのモデルに参加するべきかを決定する。 Stage~2は、選択したタスクベクトル上のアーキテクチャ適応ルーティングのCMA-ES最適化を通じて、モジュールレベルの知識専門化(異なる機能的モジュールは知識ソースに対して異なる好みを示す)に対処し、オプションのモジュールレベルのスケーリングを行う。 2つのアーキテクチャと4つのベンチマークの実験により、ステレオファクトリーは一貫して同じチェックポイントプールの下で最高の4つのベンチマーク平均を達成し、NMRFでは平均誤差を3.80から3.30に、FoundationStereoでは2.88から2.19に下げた。ポストホックサーチは、対応する共同規制壁時計の2.7--3.7\%しか必要としない。分析によると、知識の貢献は本質的にモジュール固有であり、選択されたサブセットは最小限の劣化でアーキテクチャ間で転送可能である。コードは https://github.com/XiandaGuo/StereoFactory.com で公開される。

論文の概要: StereoFactory: A Unified Merging Framework for Robust Stereo Matching

関連論文リスト